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NOVEL GENES, COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, 
AND THERAPY OF BREAST CANCER 

5 RELATED APPLICATIONS 

The present application claims priority to U.S. provisional application serial no. 
60/176,077, filed January 14, 2000, U.S. provisional application serial no. 60/189,167, 
filed March 14, 2000, U.S. provisional application serial no. 60/192,099, filed March 24, 
2000, U.S. provisional application serial no. 60/193,480, filed March 29, 2000, U.S. 
10 provisional application serial no. 60/205,230, filed May 15, 2000, U.S. provisional 

application serial no. 60/21 1,315, filed June 9, 2000, U.S. provisional application serial 
no. 60/220,534, filed July 25, 2000, all of which are expressly incorporated by reference. 

FIELD OF THE INVENTION 
1 5 The field of the invention is breast cancer, including diagnosis, characterization, 

management, and therapy of breast cancer. 

BACKGROUND OF THE INVENTION 
The increased number of cancer cases reported in the United States, and, indeed, 

20 around the world, is a major concern. Currently there are only a handful of treatments 
available for specific types of cancer, and these provide no absolute guarantee of 
success. In order to be most effective, these treatments require not only an early 
detection of the malignancy, but a reliable assessment of the severity of the malignancy. 
The incidence of breast cancer, a leading cause of death in women, has been 

25 gradually increasing in the United States over the last thirty years. In 1997, it was 
estimated that 181 ,000 new cases were reported in the U.S., and that 44,000 people 
would die of breast cancer (Parker et al, 1997, CA Cancer 1 Clin. 47:5-27; Chu et al, 
1996, 7. Nat Cancer Inst. 88:1571-1579). While the pathogenesis of breast cancer is 
unclear, transformation of normal breast epithelium to a malignant phenotype may be 

30 the result of genetic factors, especially in women under 30 (Miki et al, 1994, Science, 
266:66-71). The discovery and characterization of BRCA1 and BRCA2 has recently 
expanded our knowledge of genetic factors which can contribute to familial breast 
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cancer. Germ-line mutations within these two loci are associated with a 50 to 85% 

lifetime risk of breast and/or ovarian cancer (Casey, 1997, Curr. Opin. Oncol 9:88-93; 

Marcus et al, 1996, Cancer 77:697-709). However, it is likely that other, non-genetic 

factors also have a significant effect on the etiology of the disease. Regardless of its 
5 origin, breast cancer morbidity and mortality increases significantly if it is not detected 

early in its progression. Thus, considerable effort has focused on the early detection of 

cellular transformation and tumor formation in breast tissue. 

Currently, the principal manner of identifying breast cancer is through detection 

of the presence of dense tumorous tissue. This may be accomplished to varying degrees 
10 of effectiveness by direct examination of the outside of the breast, or through 

mammography or other X-ray imaging methods (Jatoi, 1999, Am. J. Surg. 177:51 8-524). 

The latter approach is not without considerable cost, however. Every time a 

mammogram is taken, the patient incurs a small risk of having a breast tumor induced 

by the ionizing properties of the radiation used during the test. In addition, the process is 
1 5 expensive and the subjective interpretations of a technician can lead to imprecision, e.g. y 

one study showed major clinical disagreements for .about one-third of a set of 

mammograms that were interpreted individually by a surveyed group of radiologists. 

Moreover, many women find that undergoing a mammogram is a painful experience. 

Accordingly, the National Cancer Institute has not recommended mammograms for 
20 women under fifty years of age, since this group is not as likely to develop breast 

cancers as are older women. It is compelling to note, however, that while only about 

22% of breast cancers occur in women under fifty, data suggests that breast cancer is 

more aggressive in pre-menopausal women. 

It would therefore be beneficial to provide specific methods and reagents for the 
25 diagnosis, staging, prognosis, monitoring, and treatment of diseases associated with 

breast cancer, or to indicate a predisposition to such for preventative measures. 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with breast cancer as well as 
30 methods of assessing whether a patient is afflicted with breast cancer. The methods of 
the present invention comprise the step of comparing the level of expression of a marker 
in a patient sample, wherein the marker is listed in Tables 1 -6 and the normal level of 
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expression of the marker in a control, e.g., a sample from a patient without breast 
cancer. A significant difference between the level of expression of the marker in the 
patient sample and the normal level is an indication that the patient is afflicted with 
breast cancer. Preferably, a protein corresponding to the marker is a secreted protein or 
5 is predicted to correspond to a secreted protein. Alternatively, the marker can 
correspond to a protein having an extracellular portion, to one which is normally 
expressed in breast tissue at a detectable level, or both. 

In one method, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 

10 the method wherein the marker is over- or under-expressed by at least two-fold in at 

least about 20% of stage 0 breast cancer patients, stage I breast cancer patients, stage HA 
breast cancer patients, stage IIB breast cancer patients, stage II1A breast cancer patients, 
stage IIIB breast cancer patients, stage IV breast cancer patients, grade I breast cancer 
patients, grade II breast cancer patients, grade III breast cancer patients, malignant breast 

1 5 cancer patients, ductal carcinoma breast cancer patients, and lobular carcinoma breast 
cancer patients. 

In one embodiment of the methods of the present invention, the patient sample is 
a breast tissue-associated body fluid. Such fluids include, for example, blood fluids, 
lymph and cystic fluids, as well as nipple aspirates. In another embodiment, the sample 
20 comprises cells obtained from the patient. In another embodiment, the patient sample is 
in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 
sample of: 

25 • a protein or a fragment of the protein corresponding to the marker (e.g. 

using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein or a fragment of the 
protein) 

• a metabolite which is produced directly (i.e., catalyzed) or indirectly by a 
30 protein corresponding to the marker 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
thereof, having at least a portion with which the marker is substantially 
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homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-6 fixed thereto at selected positions) 
• a transcribed polynucleotide or fragment thereof, wherein the 
5 polynucleotide anneals with the marker under stringent hybridization 

conditions. 

The methods of the present invention are particularly useful for patients with an 
identified breast mass or symptoms associated with breast cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
1 0 developing breast cancer (e.g. , patients having a familial history of breast cancer, 

patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in 
monitoring the efficacy of treatment of a breast cancer patient (e.g. the efficacy of 
chemotherapy). 

1 5 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 1 0 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-6 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 

20 control humans not afflicted with breast cancer. A significantly enhanced level of 
expression of one or more of the markers listed in Tables 1-6 in the sample, relative to 
the corresponding normal levels, is an indication that the patient is afflicted with breast 
cancer. The markers of Tables 1-6 may also be used in combination with known breast 
cancer markers in the methods of the present invention. 

25 In a preferred method of assessing whether a patient is afflicted with breast cancer 

(e.g., new detection ("screening"), detection of recurrence, reflex testing), the method 
comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-6, and 

30 b) the normal level of expression of the marker in a control non-breast 

cancer sample. 
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A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with breast 
cancer. 

The methods of the present invention further include a method of assessing the 
5 efficacy of a test compound for inhibiting breast cancer in a patient. This method 
comprises comparing: 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-6, 

10 and 

b) expression of the marker in a second sample obtained from the patient 
and maintained in the absence of the test compound. 

A significantly lower level of expression of the marker in the first sample, relative to the 
second sample, is an indication that the test compound is efficacious for inhibiting breast 
1 5 cancer in the patient. For example, the first and second samples can be portions of a 
single sample obtained from the patient or portions of pooled samples obtained from the 
patient. 

The invention further relates to a method of assessing the efficacy of a therapy for 
inhibiting breast cancer in a patient. This method comprises comparing: 
20 a) expression of a marker in a first sample obtained from the patient prior 

to providing at least a portion of the therapy to the patient, wherein the 
marker is selected from the group consisting of the markers listed in 
Tables 1-6, and 

b) expression of the marker in a second sample obtained from the patient 
25 following provision of the portion of the therapy. 

A significantly lower level of expression of the marker in the second sample, relative to 
the first sample, is an indication that the therapy is efficacious for inhibiting breast 
cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any therapy for 
30 treating breast cancer including, but not limited to, chemotherapy, radiation therapy and 
surgical removal of tissue, e.g., a breast tumor. Thus, the methods of the invention may 
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be used to evaluate a patient before, during and after therapy, for example, to evaluate 
the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 
progression of breast cancer in a patient, the method comprising: 
5 a) detecting in a patient sample at a first time point, the expression of a marker, 

wherein the marker is selected from the group consisting of the markers listed in Tables 
1-6; 

b) repeating step a) at a subsequent time point; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
10 monitoring the progression of breast cancer in the patient. . 

The invention also includes a method of selecting a composition for inhibiting 
breast cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
1 5 plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1 -6 in each of the 
aliquots; and 

d) selecting one of the test compositions which induces a lower level of 
expression of the marker in the aliquot containing that test composition, 

20 relative to other test compositions. 

In addition, the invention includes a method of inhibiting breast cancer in a 
patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
25 plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-6 in each of the 
\ aliquots; and 

d) administering to the patient at least one of the test compositions which 
induces a lower level of expression of the marker in the aliquot 

30 containing that test composition, relative to other test compositions. 
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The invention also includes a kit for assessing whether a patient is afflicted with 
breast cancer. This kit comprises reagents for assessing expression of a marker listed in 
Tables 1-6. 

In another aspect, the invention relates to a kit for assessing the suitability of each 
5 of a plurality of compounds for inhibiting breast cancer in a patient. The kit comprises a 
reagent for assessing expression of a marker listed in Tables 1-6, and may also comprise 
a plurality of compounds. 

In another aspect, the invention relates to a kit for assessing the presence of breast 
cancer cells. This kit comprises an antibody, wherein the antibody binds specifically 
1 0 with a protein corresponding to a marker listed in Tables 1 -6. The kit may also 

comprise a plurality of antibodies, wherein the plurality binds specifically with a protein 
corresponding to a different marker which is also listed in Tables 1 -6. 

The invention also includes a kit for assessing the presence of breast cancer cells, 
wherein the kit comprises a nucleic acid probe. The probe binds specifically with a 
15 transcribed polynucleotide corresponding to a marker listed in Tables 1-6. The kit may 
also comprise a plurality of probes, wherein each of the probes binds specifically with a 
transcribed polynucleotide corresponding to a different marker listed in Tables 1-6. 

The invention further relates to a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether a patient is afflicted with breast 
20 cancer. The method comprises isolating a protein or protein fragment corresponding to 
a marker listed in Tables 1 -6, immunizing a mammal using the isolated protein or 
protein fragment, isolating splenocytes from the immunized mammal, fusing the isolated 
splenocytes with an immortalized cell line to form hybridomas, and screening individual 
hybridomas for production of an antibody which specifically binds with the protein or 
25 protein fragment to isolate the hybridoma. The invention also includes an antibody 
produced by this method. 

The invention further includes a method of assessing the breast carcinogenic or 
irregular growth promoting potential of a test compound. This method comprises the 
steps of: 

30 a) maintaining separate aliquots of breast cells in the presence and 

absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
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The marker is selected from those listed in Tables 1-6. A significantly enhanced 
level of expression of the marker in the aliquot maintained in the presence of (or 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
test compound, is an indication that the test compound possesses breast carcinogenic or 
5 irregular growth promoting potential. 

Additionally, the invention includes a kit for assessing the breast carcinogenic 
potential of a test compound. The kit comprises breast cells and a reagent for assessing 
expression of a marker in each of the aliquots. The marker is selected from those listed 
in Tables 1-6. 

1 0 The invention further includes a method of treating a patient afflicted with breast 

cancer, comprising providing to cells of the patient an antisense oligonucleotide 
complementary to a polynucleotide corresponding to a marker listed in Tables 1-6. 

The invention includes a method of inhibiting breast cancer in a patient at risk for 
developing breast cancer. This method comprises inhibiting expression or 

15 overexpression of a gene corresponding to a marker listed in Tables 1-6. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known breast cancer markers. It will further be 
appreciated that the methods and kits may be used to identify cancers other than breast 
cancer. 

20 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered correlations between expression of 
certain markers and the cancerous state of breast cells. It has been discovered that the 
level of expression of individual markers and combinations of markers described herein 
25 correlates with the presence of breast cancer in a patient. Methods are provided for 
detecting the presence of breast cancer in a sample, the absence of breast cancer in a 
sample, the stage of breast cancer, and other characteristics of breast cancer that are 
relevant to prevention, diagnosis, characterization, and therapy of breast cancer in a 
patient. 
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Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles "a" and "an" are used herein to refer to one or to more than one (i.e. 
5 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-6. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA (i.e. including any introns 
10 occurring therein), RNA generated by transcription of genomic DNA (i.e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (e.g. including proteins both before 
and after cleavage of normally cleaved regions such as transmembrane signal 
sequences). As used herein, "marker" may also include a cDNA made by reverse 
1 5 transcription of an RNA generated by transcription of genomic DNA (including spliced 
RNA). 

As used herein a "polynucleotide corresponds to" another (a first) polynucleotide 
if it is related to the first polynucleotide by any of the following relationships: 1) The 
second polynucleotide comprises the first polynucleotide and the second polynucleotide 

20 encodes a gene product. 2) The second polynucleotide is 5 r or 3 ' to the first 
polynucleotide in cDNA, RNA, genomic DNA, or fragment of any of these 
polynucleotides. For example, a second polynucleotide may be fragment of a gene that 
includes the first and second polynucleotides. The first and second polynucleotides are 
related in that they are components of the gene coding for a gene product, such as a 

25 protein or antibody. However, it is not necessary that the second polynucleotide 
comprises or overlaps with the first polynucleotide to be encompassed within the 
definition of "corresponding to" as used herein. For example, the first polynucleotide 
may be a fragment of a 3' untranslated region of the second polynucleotide. The first 
and second polynucleotide may be fragments of a gene coding for a gene product. The 

30 second polynucleotide may be an exon of the gene while the first polynucleotide may be 
an intron of the gene. 3) The second polynucleotide is the complement of the first 
polynucleotide. 
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The term "probe" refers to any molecule which is capable of selectively binding 
to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
5 specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
and organic monomers. 

A "breast-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through breast cells or into which cells, nucleic acids or proteins shed 
10 from breast cells are capable of passing. Exemplary breast-associated body fluids 
include blood fluids, lymph, cystic fluid, urine and nipple aspirates. 

The "normal" level of expression of a marker is the level of expression of the 
marker in breast cells of a patient, e.g. a human, not afflicted with breast cancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
1 5 marker of a patient at a greater or lesser level, respectively, than normal level of 
expression of the marker {e.g. at least two-fold greater or lesser level). 

As used herein, the term "promoter/regulatory sequence" means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
20 promoter sequence and in other instances, this sequence may also include an enhancer 
sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
25 with a polynucleotide which encodes or specifies a gene product, causes the gene 

product to be produced in a living human cell under most or all physiological conditions 
of the cell. 

An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
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A "tissue-specific" promoter is a nucleotide sequence which, when operably 
linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 
5 A "transcribed polynucleotide" is a polynucleotide (e.g. an RNA, a cDNA, or an 

analog of one of an RNA or cDN A) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

1 0 "Complementary" refers to the broad concept of sequence complementarity 

between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 

1 5 Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of 
base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 

20 capable of base pairing with a residue of the second region. Preferably, the first region 
comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portion are capable of base pairing with nucleotide 

25 residues in the second portion. More preferably, all nucleotide residues of the first 
portion are capable of base pairing with nucleotide residues in the second portion. 

"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by the 

30 same nucleotide residue, then the regions are homologous at that position. A first region 
is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
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terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence 5 ? -ATTGCC-3' and a region having the nucleotide sequence 5'- 
TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
5 and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 
the same nucleotide residue. 
10 A marker is "fixed" to a substrate if it is covalently or non-covalently associated 

with the substrate such the substrate can be rinsed with a fluid (e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or 
1 5 DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 
or less, respectively, than the normal level by an amount greater than the standard error 

20 of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount. Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 
about three, four, or five times, higher or lower, respectively, than the normal level of 

25 expression of the marker. 

Breast cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, breast cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 

30 reagent, e.g. a probe, for specifically detecting a marker of the invention, the 

manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 
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Description 

The present invention is based, in part, on identification of novel markers which 
are expressed at a different level in breast cancer cells than they are in normal (ie. non- 

' 5 cancerous) breast cells. The markers of the invention correspond to nucleic acid and 
polypeptide molecules which can be detected in one or both of normal and cancerous 
breast cells. The presence, absence, or level of expression of one or more of these 
markers in breast cells is herein correlated with the cancerous state of the tissue. The 
invention thus includes compositions, kits, and methods for assessing the cancerous state 

10 of breast cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 
among others: 

I) assessing whether a patient is afflicted with breast cancer; 
1 5 2) assessing the stage of breast cancer in a human patient; 

3) assessing the grade of breast cancer in a patient; 

4) assessing the benign or malignant nature of breast cancer in a patient; 

5) assessing the histological type of neoplasm (e.g. ductal, lobular, etc.) 

associated with breast cancer in a patient; 
20 6) making an isolated hybridoma which produces an antibody useful for 

assessing whether a patient is afflicted with breast cancer; 

7) assessing the presence of breast cancer cells; 

8) assessing the efficacy of one or more test compounds for inhibiting breast 

cancer in a patient; 

25 9) assessing the efficacy of a therapy for inhibiting breast cancer in a 

patient; 

1 0) monitoring the progression of breast cancer in a patient; 

I I) selecting a composition or therapy for inhibiting breast cancer in a 

patient; 

30 1 2) treating a patient afflicted with breast cancer; 

1 3) inhibiting breast cancer in a patient; 
1 4) assessing the carcinogenic potential of a test compound; and 
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1 5) inhibiting breast cancer in a patient at risk for developing breast 
cancer. 

The invention thus includes a method of assessing whether a patient is afflicted 
with breast cancer. This method comprises comparing the level of expression of a 
5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-breast cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with breast cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-6. 

10 The polynucleotides set forth in Tables 1-6 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. In Tables 1 and 3, SEQ ID NOS 316-470, 793- 
890, 1255-1363, 2125-2454 and 3352-3626 are preferred and SEQ ID NOS 1-315, 676- 
792, 1056-1254, 1645-2124 and 2942-3351 are most preferred. In Tables 2 and 4, SEQ 

15 ID NOS: 1879-1959 are preferred and SEQ ID NOS: 1-1878 are most preferred. Also 
provided by this invention are polynucleotides that correspond to the polynucleotides of 
Tables 1-6. In one embodiment, these polynucleotides are obtained by identification of 
a larger fragment or full-length coding sequence of these polynucleotides. Gene 
delivery vehicles, host cells, compositions and databases (all describe herein) containing 

20 these polynucleotides are also provided by this invention. 

The invention also encompasses polynucleotides which differ from that of the 
polynucleotides described above, but which produce the same phenotypic effect, e.g. 
allelic variants. These altered, but phenotypically equivalent polynucleotides are 
referred to "equivalent nucleic acids." This invention also encompasses polynucleotides 

25 characterized by changes in non-coding regions that do not alter the polypeptide 

produced therefrom when compared to the polynucleotide herein. This invention further 
encompasses polynucleotides, which hybridize to the polynucleotides of the subject 
invention under conditions of moderate or high stringency. Alternatively, the 
polynucleotides are at least 85%, or at least 90%, or more preferably, greater or equal to 

30 95% identical as determined by a sequence alignment program when run under default 
parameters. 
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Any marker or combination of markers listed in Tables 1-6, as well as any known 
markers in combination with the markers set forth in Tables 1-6, may be used in the 
compositions, kits, and methods of the present invention. In general, it is preferable to 
use markers for which the difference between the level of expression of the marker in 
5 breast cancer cells and the level of expression of the same marker in normal breast cells 
is as great as possible. Although this difference can be as small as the limit of detection 
of the method for assessing expression of the marker, it is preferred that the difference 
be at least greater than the standard error of the assessment method, and preferably a 
difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 500-, 1000-fold 
10 or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from breast cells (z.e. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

15 corresponding to each of these markers can be detected in an breast-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

20 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

Although not every marker corresponding to a secreted protein is indicated as 
such herein, it is a simple matter for the skilled artisan to determine whether any 
particular marker corresponds to a secreted protein. In order to make this determination, 

25 the protein corresponding to a marker is expressed in a test cell (e.g. a cell of a breast 
cell line), extracellular fluid is collected, and the presence or absence of the protein in 
the extracellular fluid is assessed (e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion of 
30 a protein corresponding to a marker of the invention. About 8x10* 293T cells are 
incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 10% fetal bovine serum) under a 5% (v/v) C0 2 , 
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95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMNE™ 
(GIBCO/BRL Catalog no. 18342-012) per well. The transfection mixture is maintained 
5 for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 
of DMEM-MC and about 50 microcuries of Trans- 35 S™ reagent (ICN Catalog no. 
51006) are added to each well. The wells are maintained under the 5% C0 9 atmosphere 
10 described above and incubated at 37°C for a selected period. Following incubation, 150 
microliters of conditioned medium is removed and centrifuged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of breast-associated body fluids include blood fluids (e.g. whole blood, 

15 blood serum, blood having platelets removed therefrom, etc.), lymph, ascitic fluid, cystic 
fluid, urine and nipple aspirates. In these embodiments, the level of expression of the 
marker can be assessed by assessing the amount (e.g. absolute amount or concentration) 
of the marker in a breast-associated body fluid obtained from a patient. The fluid can, of 
course, be subjected to a variety of well-known post-collection preparative and storage 

20 techniques (e.g. storage, freezing, ultrafiltration, concentration, evaporation, 
centrifugation, etc.) prior to assessing the amount of the marker in the fluid. 

Many breast-associated body fluids (i.e. usually excluding urine) can have breast 
cells therein, particularly when the breast cells are cancerous, and, more particularly, 
when the breast cancer is metastasizing. Thus, the compositions, kits, and methods of 

25 the invention can be used to detect expression of markers corresponding to proteins 
having at least one portion which is displayed on the surface of cells which express it. 
It is a simple matter for the skilled artisan to determine whether the protein 
corresponding to any particular marker comprises a cell-surface protein. For example, 
immunological methods may be used to detect such proteins on whole cells, or well 

30 known computer-based sequence analysis methods (e.g. the SIGNALP program; Nielsen 
et al, 1997, Protein Engineering 10:1-6) may be used to predict the presence of at least 
one extracellular domain (i.e. including both secreted proteins and proteins having at 
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least one cell-surface domain). Expression of a marker corresponding to a protein 
having at least one portion which is displayed on the surface of a cell which expresses it 
may be detected without necessarily lysing the cell (e.g. using a labeled antibody which 
binds specifically with a cell-surface domain of the protein). 
5 Expression of a marker of the invention may be assessed by any of a wide variety 

of well known methods for detecting expression of a transcribed molecule or protein. 
Non-limiting examples of such methods include immunological methods for detection 
of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification methods, 
protein function or activity assays, nucleic acid hybridization methods, nucleic acid 

10 reverse transcription methods, and nucleic acid amplification methods. 

In a preferred embodiment, expression of a marker is assessed using an antibody 
(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 

1 5 fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 
which binds specifically with a protein or a fragment thereof, corresponding to the 
marker, such as the protein encoded by the open reading frame corresponding to the 
marker or such a protein which has undergone all or a portion of its normal post- 
translational modification. 

20 In another preferred embodiment, expression of a marker is assessed by preparing 

mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient sample, and by 
hybridizing the mRNA/cDNA with a reference polynucleotide which is a complement of 
a polynucleotide comprising the marker, and fragments thereof. cDNA can, optionally, 
be amplified using any of a variety of polymerase chain reaction methods prior to 

25 hybridization with the reference polynucleotide; preferably, it is not amplified. 

Expression of one or more markers can likewise be detected using quantitative PCR to 
assess the level of expression of the markers). Alternatively, any of the many known 
methods of detecting mutations or variants (e.g. single nucleotide polymorphisms, 
deletions, etc.) of a marker of the invention may be used to detect occurrence of a 

30 marker in a patient. 
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In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
complementary to or homologous with at least a portion (e.g. at least 7, 10, 15, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
5 polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 
be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 

10 expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be performed under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 
a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 

15 detection limit of the method used to assess expression in at least one of normal breast 
cells and cancerous breast cells. 

It is understood that by routine screening of additional patient samples using one 
or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific breast cancers, 

20 as well as other cancers such as ovarian cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (i.e. 50% or more) or substantially all (i.e. 80% or more) of breast cancer. 
Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with breast cancer of various stages (i.e. stage 0, 1, II, II, and IV breast 

25 cancers, as well as subclassifications HA, IIB, IIIA, and IIIB, using the FIGO Stage 
Grouping system for primary carcinoma of the breast; (see Breast, In: American Joint 
Committee on Cancer: AJCC Cancer Staging Manual. Lippincott-Raven Publishers, 5th 
ed., 1997, pp. 171-180), of various histologic subtypes (e.g. serous, mucinous, 
endometroid, and clear cell subtypes, as well as subclassifications and alternate 

30 classifications adenocarcinoma, papillary adenocarcinoma, papillary 

cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
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endometrioid stromal sarcoma, mesodermal (MuIIerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
breast tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
5 various grades (i.e. grade I {well differentiated} , grade II {moderately well 

differentiated}, and grade III {poorly differentiated from surrounding normal tissue} )). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 
samples were obtained are correlated, it will also be confirmed that altered expression of 

1 0 certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast cancer in patients. In addition, these compositions, kits, and methods 

1 5 can be used to detect and differentiate lobular and ductal carcinoma breast cancers. 
When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast cancer in a patient, it is preferred that the marker or panel of markers of 
the invention is selected such that a positive result is obtained in at least about 20%, and 

20 preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an breast cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 
general population (more preferably coupled with an assay specificity greater than 

25 99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
in non-cancerous samples of the same type, either in a single reaction mixture {le. using 
30 reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
significantly enhanced level of expression of more than one of the plurality of markers 
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in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with breast cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
corresponding normal levels, is an indication that the patient is afflicted with breast 
5 cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more markers and a significantly lower level of expression of one or more markers in 
a sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with breast cancer. When a plurality of markers is used, it is preferred that 2, 3, 
4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 

1 0 markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (i.e. by interference attributable to cells of non-breast origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 
has a restricted tissue distribution, e.g., normally not expressed in a non-breast tissue. 

1 5 Only a small number of markers are known to be associated with breast cancers 

(e.g. BRCA1 and BRCA2). These markers are not, of course, included among the 
markers of the invention, although they may be used together with one or more markers 
of the invention in a panel of markers, for example. It is well known that certain types 
of genes, such as oncogenes, tumor suppressor genes, growth factor-like genes, 

20 protease-like genes, and protein kinase-like genes are often involved with development 
of cancers of various types. Thus, among the markers of the invention, use of those 
which correspond to proteins which resemble known proteins encoded by known 
oncogenes and tumor suppressor genes, and those which correspond to proteins which 
resemble growth factors, proteases, and protein kinases are preferred. 

25 Known oncogenes and tumor suppressor genes include, for example, abl, abr, 

akt2, ape, bcUa, bcttfi, bc!3, bcr, brcal, brca2, cbl, ccndl, cdc42, cdk4, crk- II, 
csflrlfins, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfrlerbb-U elkl, elk3, eph, erg, etsl, 
ets2,fer,fgr/src2,fliJ/ergb2,fos,^s/fes,fraJ,fra2,fi/n, hck, hek, her2lerbb- 2/neti, 
her3/erbb-3, her4/erbb-4, hrasl, hst2, hstfl, igfbp2, ink4a, ink4b, int2lfgf3,jun,junb, 

30 jund, kip2, kit, b*as2a, kras2b, lek, lyn, mas, max, mcc, mdm2 t met, mlhl, mmplO, mos, 
msh2, msh3, msh6> myb, myba, mybb, myc, mycll, myen, nfl, nf2, nme2, nras,p53, 
pdgfb,phb, piml,pmsl,pms2,ptc,pten, rafl, rapla, rbl, rel, ret, rosl, ski, srcl, tall, 
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tgfbr2, tgfb3, tgfbr3, thral, thrb, tiaml, timp3, tjpl, tp53, trk, vav, vhl, vil2, wafl, wntl, 
wnt2, wtl, and yesl (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 
Facts Book, 2nd Ed., Academic Press; Fishel et aL, 1994, Science 266:1403-1405). 
Known growth factors include platelet-derived growth factor alpha, platelet- 
5 derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 

thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 
stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 

1 0 growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 
connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 

1 5 factor 1 , vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 

growth factor (vascular endothelial growth factor-related protein), transforming growth 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 
factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 

20 and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 
brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 

25 and its precursors, Mch2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 
72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 

30 (macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
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(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
5 NIK, PK428, Krs-2, SAK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-Iike midbody-associated protein 
kinase- 1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 
checlq>oint kinase Chkl, Nek3 protein kinase, BMK1 beta kinase, Clkl, Clk2, Clk3, 
extracellular signal-regulated kinases 1, 3, and 6, cdc28 protein kinase 1, cdc28 protein 

10 kinase 2, pLK, Mytl , c-Jun N-terminal kinase 2, Cam kinase 1 , the MAP kinases, 

insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 
Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 

15 associated coiled-coil containing protein kinase pl60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- 
activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl, lipid-activated protein kinase PRK2, focal adhesion kinase, dual-specificity 

20 tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 
Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology 2 protein), agammaglobulinaemia protein-tyrosine kinase (Atk), protein 

25 kinase ATR, guanylate kinase 1 , thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 
(Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fujinami avian sarcoma viral oncogene 

30 homologs), proto-oncogene c-cof, oncogene pim-\ , and oncogene masl . 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing breast cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
breast cancer include, for example, patients having a familial history of breast cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age {i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. rion-cancerous) human breast 
tissue can be assessed in a variety of ways. In one embodiment, this normal level of 
expression is assessed by assessing the level of expression of the marker in a portion of 

10 breast cells which appears to be non-cancerous and by comparing this normal level of 
expression with the level of expression in a portion of the breast cells which is suspected 
of being cancerous. For example, when mammogrophy or other medical procedure, 
reveals the presence of a lump in a patient's breast, the normal level of expression of a 
marker may be assessed using the non-affected breast tissue, and this normal level of 

1 5 expression may be compared with the level of expression of the same marker in an 
affected portion (i.e. the lump) of the affected breast. Alternately, and particularly as 
further information becomes available as a result of routine performance of the methods 
described herein, population-average values for normal expression of the markers of the 
invention may be used. In other embodiments, the 'normal 1 level of expression of a 

20 marker may be determined by assessing expression of the marker in a patient sample 
obtained from a non-cancer-afflicted patient, from a patient sample obtained from a 
patient before the suspected onset of breast cancer in the patient, from archived patient 
samples, and the like. 

The invention includes compositions, kits, and methods for assessing the presence 

25 of breast cancer cells in a sample (e.g. an archived tissue sample or a sample obtained 
from a patient). These compositions, kits, and methods are substantially the same as 
those described above, except that, where necessary, the compositions, kits, and 
methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 

30 necessary to adjust the ratio of compounds in the compositions of the invention, in the 
kits of the invention, or the methods used to assess levels of marker expression in the 
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sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of breast cancer cells (e.g. 
in a sample such as a patient sample). The kit comprises a plurality of reagents, each of 
5 which is capable of binding specifically with a nucleic acid or polypeptide 

corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
nucleic acid (e.g. a genomic DNA, an mRNA, a spliced mRNA, a cDNA, or the like) 

10 include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

The kit of the invention may optionally comprise additional components useful 

1 5 for performing the methods of the invention. By way of example, the kit may comprise 
fluids (e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the 
invention, a sample of normal breast cells, a sample of breast cancer cells, and the like. 

20 The invention also includes a method of making an isolated hybridoma which 

produces an antibody useful for assessing whether patient is afflicted with breast cancer. 
In this method, a protein corresponding to a marker of the invention is isolated (e.g. by 
purification from a cell in which it is expressed or by transcription and translation of a 
nucleic acid encoding the protein in vivo or in vitro using known methods). A 

25 vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is immunized 
using the isolated protein or protein fragment. The vertebrate may optionally (and 
preferably) be immunized at least one additional time with the isolated protein or protein 
fragment, so that the vertebrate exhibits a robust immune response to the protein or 
protein fragment. Splenocytes are isolated from the immunized vertebrate and fused 

30 with an immortalized cell line to form hybridomas, using any of a variety of methods 
well known in the art. Hybridomas formed in this manner are then screened using 
standard methods to identify one or more hybridomas which produce an antibody which 
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specifically binds with the protein or protein fragment. The invention also includes 
hybridomas made by this method and antibodies made using such hybridomas. 

The invention also includes a method of assessing the efficacy of a test compound 
for inhibiting breast cancer cells. As described above, differences in the level of 

5 expression of the markers of the invention correlate with the cancerous state of breast 
cells. Although it is recognized that changes in the levels of expression of certain of the 
markers of the invention likely result from the cancerous state of breast cells, it is 
likewise recognized that changes in the levels of expression of other of the markers of 
the invention induce, maintain, and promote the cancerous state of those cells. Thus, 

1 0 compounds which inhibit breast cancer in a patient will cause the level of expression of 
one or more of the markers of the invention to change to a level nearer the normal level 
of expression for that marker (ie. the level of expression for the marker in non- 
cancerous breast cells). 

This method thus comprises comparing expression of a marker in a first breast 

1 5 cell sample and maintained in the presence of the test compound and expression of the 
marker in a second breast cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 
Tables 1-6, may be is an indication that the test compound inhibits breast cancer {e.g. , 
decreases in expression in those markers that are over-expressed in breast cancer cells or 

20 more aggressive breast cancer cells and breast cancer cells from patients with poor 
clinical outcome or increases expression in those markers that are under-expressed in 
breast cancer cells or in more aggressive breast cancer cells or breast cancer cells from 
patients with poor clinical outcome. The breast cell samples may, for example, be 
aliquots of a single sample of normal breast cells obtained from a patient, pooled 

25 samples of normal breast cells obtained from a patient, cells of a normal breast cell line, 
aliquots of a single sample of breast cancer cells obtained from a patient, pooled samples 
of breast cancer cells obtained from a patient, cells of a breast cancer cell line, or the 
like. In one embodiment, the samples are breast cancer cells obtained from a patient and 
a plurality of compounds known to be effective for inhibiting various breast cancers are 

30 tested in order to identify the compound which is likely to best inhibit the breast cancer 
in the patient 
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This method may likewise be used to assess the efficacy of a therapy for 
inhibiting breast cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
5 efficacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-6, or blocks induction of a marker listed in 
Tables 1-6, then the therapy may be efficacious for inhibiting breast cancer. As above, if 
samples from a selected patient are used in this method, then alternative therapies can be 
assessed in vitro in order to select a therapy most likely to be efficacious for inhibiting 

1 0 breast cancer in the patient. 

As described herein, breast cancer in patients is associated with levels of 
expression of one or more markers listed in Tables 1-6. While, as discussed above, 
some of these changes in expression level result from occurrence of the breast cancer, 
others of these changes induce, maintain, and promote the cancerous state of breast 

1 5 cancer cells. Thus, breast cancer characterized by an alteration in the level of expression 
of one or more markers listed in Tables 1-6 can be inhibited by hampering or increasing 
expression of those markers. 

Expression of a marker listed in Tables 1-6 can be inhibited in a number of ways 
generally known in the art. For example, an antisense oligonucleotide can be provided 

20 to the breast cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the function or activity of the protein corresponding to the marker(s). Using 

25 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the markers). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the breast cancer cells of the patient. 

30 Expression of a marker listed within Tables 1-6 can be enhanced in number of 

ways generally known in the art. For example, a polynucleotide encoding the marker 
and operably linked with an appropriate promoter/regulator region can be provided to 
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breast cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 
capable of crossing the cell membrane, inserting itself in the cell membrane, or is 
normally a secreted protein, then expression of the protein can be enhanced by providing 
5 the protein (e.g. directly or by way of the bloodstream or another breast-associated fluid) 
to breast cancer cells in the patient. 

As described above, the cancerous state of human breast cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention thus 
includes a method for assessing the human breast cell carcinogenic potential of a test 

10 compound. This method comprises maintaining separate aliquots of human breast cells 
in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-6 in the aliquot maintained in the presence of 
the test compound (relative to the aliquot maintained in the absence of the test 

1 5 compound) may be an indication that the test compound possesses human breast cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 
level of expression is enhanced or inhibited, or by comparing both. 

20 Various aspects of the invention are described in further detail in the following 

subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to novel isolated nucleic acid molecules that 

25 correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 

30 polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g., those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
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intended to include DNA molecules {e.g., cDNA or genomic DNA) and RNA molecules 
(e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 

5 An "isolated" nucleic acid molecule is one which is separated from other nucleic 

acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 

10 nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, 

1 5 or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid encoding a 
protein corresponding to a marker listed in Tables 1-6, can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

20 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook etal, ed., Molecular Cloning: A Laboratory 
Manual 2nd ed. f Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

25 A process for identifying a larger fragment or the full-length coding sequence of a 

marker of the present invention is thus also provided. Any conventional recombinant 
DNA techniques applicable for isolating polynucleotides may also be employed. One 
such method involves the 5 ! -RACE-PCR technique, in which the poly-A mRNA that 
contains the coding sequence of particular interest is first reverse transcribed with a 3'- 

30 primer comprising a sequence disclosed herein. The newly synthesized cDNA strand is 
then tagged with an anchor primer with a known sequence, which preferably contains a 
convenient cloning restriction site attached at the 5 r end. The tagged cDNA is then 
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amplified with the 3 -primer, (or a nested primer sharing sequence homology to the 
internal sequences of the coding region) and the 5'-anchor primer. The amplification 
may be conducted under conditions of various levels of stringency to optimize the 
amplification specificity. 5'-RACE-PCR can be readily performed using commercial kits 
5 (available from, e.g., BRL Life Technologies Inc., Clontech) according to the 
manufacturer's instructions. 

Isolating the complete coding sequence of a gene can also be carried out in a 
hybridization assay using a suitable probe. The probe preferably comprises at least 10 
nucleotides, and more preferably exhibits sequence homology to the polynucleotides of 
1 0 the markers of the present invention. Other high throughput screens for cDNAs, such as 
those involving gene chip technology, can also be employed in obtaining the complete 
cDNA sequence. 

In addition, databases exist that reduce the complexity of ESTs by assembling 
contiguous EST sequences into tentative genes. For example, TIGR has assembled 

1 5 human ESTs into a datable called THC for tentative human consensus sequences. The 
THC database allows for a more definitive assignment compared to ESTs alone. 
Software programs exist (TIGR assembler and TIGEM EST assembly machine and 
contig assembly program (see Huang, X . , 1996, Genomes 33:21-23)) that allow for 
assembling ESTs into contiguous sequences from any organism. 

20 Alternatively, mRNA from a sample preparation is used to construct cDNA 

library in the ZAP Express vector following the procedure described in Velculescu et 
al, 1997, Science 270:484. The ZAP Express cDNA synthesis kit (Stratagene) is used 
accordingly to the manufacturer's protocol. Plates containing 250 to .2000 plaques are 
hybridized as described in Rupert et al 9 1988, Mol. Cell Bio. 8:3104 to oligonucleotide 

25 probes with the same conditions previously described for standard probes except that the 
hybridization temperature is reduced to a room temperature. Washes are performed in 
6X standard-saline-citrate 0.1% SDS for 30 minutes at room temperature. The probes 
are labeled with 32 P-ATP trough use of T4 polynucleotide kinase. 

A partial cDNA (3' fragment) can be isolated by 3 ! directed PCR reaction. This 

30 procedure is a modification of the protocol described in Polyak et aL 9 1997, Nature 

389:300. Briefly, the procedure uses SAGE tags in PCR reaction such that the resultant 
PCR product contains the SAGE tag of interest as well as additional cDNA, the length 
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of which is defined by the position of the tag with respect to the 3* end of the cDNA. 
The cDNA product derived from such a transcript driven PCR reaction can be used for 
many applications. 

RNA from a source to express the cDNA corresponding to a given tag is first 
5 converted to double-stranded cDNA using any standard cDNA protocol. Similar 
conditions used to generate cDNA for SAGE library construction can be employed 
except that a modified oligp-dT primer is used to derive the first strand synthesis. For 
example, the oligonucleotide of composition 5 r -B-TCC GGC GCG CCG TTT TCC 
CAG TCA CGA(30)- 3', contains a poly-T stretch at the 3* end for hybridization and 

1 0 priming from poly-A tails, an Ml 3 priming site for use in subsequent PCR steps, a 5* 
Biotin label (B) for capture to strepavidin-coated magnetic beads, and an AscI restriction 
. endonuclease site for releasing the cDNA from the strepavidin-coated magnetic beads. 
Theoretically, any sufficiently-sized DNA region capable of hybridizing to a PCR 
primer can be used as well as any other 8 base pair recognizing endonuclease. 

1 5 cDNA constructed utilizing this or similar modified oligo-dT primer is then 

processed exactly as described in U.S. Patent No. 5,695,937 up until adapter ligation 
where only one adapter is ligated to the cDNA pool. After Adapter ligation, the cDNA 
is released from the streptavidin-coated magnetic beads and is then used as a template 
for cDNA amplification. 

20 Various PCR protocols can be employed using PCR priming sites within the 3' 

modified oligo-dT primer and the SAGE tag. The SAGE tag-derived PCR primer 
employed can be of varying length dictated by 5' extension of the tag into the adaptor 
sequence. cDNA products are now available for a variety of applications. 

This technique can be further modified by: (1) altering the length and/or content 

25 of the modified oligo-dT primer; (2) ligating adaptors other than that previously 

employed within the SAGE protocol; (3) performing PCR from template retained on the 
streptavidin-coated magnetic beads; and (4) priming first strand cDNA synthesis with 
non-oligo-dT based primers. 

Gene trapper technology can also be used. The reagents and manufacturer's 

30 instructions for this technology are commercially available from Life Technologies, Inc., 
Gaithsburg, Maryland. Briefly, a complex population of single-stranded phagemid 
DNA containing directional cDNA inserts is enriched for the target sequence by 
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hybridization in solution to a biotinylated oligonucleotide probe complementary to the 
target sequence. The hybrids are captured on streptavidin-coated paramagnetic beads. 
A magnet retrieves the paramagnetic beads from the solution, leaving nonhybridized 
single-stranded DNAs behind. Subsequently, the captured single-stranded DNA target 
5 is released from the biotinylated oligonucleotide. After release, the cDNA clone is 
further enriched by using a nonbiotinylated target oligonucleotide to specifically prime 
conversion of the single-stranded DNA. Following transformation and plating, typically 
20% to 100% of the colonies represent the cDNA clone of interest. To identify the 
desired cDNA clone, the colonies may be screened by colony hybridization using the 

10 32 P-labeled oligonucleotide as described above for solution hybridization, or 
alternatively by DNA sequencing and alignment of all sequences obtained from 
numerous clones to determine a consensus sequence. 

A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 
or genomic DNA as a template and appropriate oligonucleotide primers according to 

15 standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 
invention can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 

20 In another preferred embodiment, an isolated nucleic acid molecule of the 

invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 

25 complementary to a given nucleotide sequence is one which is sufficiently 

complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion of 
a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 

30 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
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The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 
25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 
5 Probes based on the sequence of a nucleic acid molecule of the invention can be 

used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 

10 express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

The invention further encompasses nucleic acid molecules that differ, due to 
degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 

15 a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 

In addition to the nucleotide sequences described herein, it will be appreciated by 
those skilled in the art that DNA sequence polymorphisms that lead to changes in the 
amino acid sequence can exist within a population (e.g., the human population). Such 

20 genetic polymorphisms can exist among individuals within a population due to natural 
allelic variation. An allele is one of a group of genes which occur alternatively at a 
given genetic locus. In addition, it will be appreciated that DNA polymorphisms that 
affect RNA expression levels can also exist that may affect the overall expression level 
of that gene (e.g., by affecting regulation or degradation). 

25 As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 

occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
marker of the invention. Such natural allelic variations can typically result in 1-5% 

30 variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
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variety of individuals. Any and all such nucleotide variations and resulting amino acid 
polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
5 least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 
4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 

1 0 under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 
identical to each other typically remain hybridized to each other. Such stringent 
conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1 989). A 

1 5 preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 
annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0,1% SDS at 50-65°C. 

20 Further preferred hybridization conditions are taught in Lockhart, et al, Nature 

Biotechnology, Volume 14, 1996 August: 1675- 1680; Breslauer, et al, Proc. Natl. Acad. 
Sci. USA, Volume 83, 1986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw, etal, BioTechniques, 
Volume 8, No. 6 1990: 674-678; and Milner, et al, Nature Biotechnology, Volume 15, 

25 1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of the 
invention that can exist in the population, the skilled artisan will further appreciate that 
sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 

30 the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 
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sequence without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
5 acid residues that are conserved among the homologs of various species (e.g., murine 
and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 

10 are not essential for activity. Such polypeptides differ in amino acid sequence from the 
naturally-occurring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 

15 invention. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 

20 Mutations can be introduced by standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families of amino acid residues having 

25 similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 

30 (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 

phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 
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and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 

5 molecules which are complementary to a sense nucleic acid of the invention, e.g., 
complementary to the coding strand of a double-stranded cDNA molecule 
corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 

10 invention. The antisense nucleic acid can be complementary to an entire coding strand, 
or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5 1 and 3' untranslated regions") are the 5* and 3* 

1 5 sequences which flank the coding region and are not translated into amino acids. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 
40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 

20 oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be 

25 used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 

30 methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5 r -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
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N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouraciI, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
5 Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been sub-cloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

10 The antisense nucleic acid molecules of the invention are typically administered 

to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 

15 complementarity to form a stable duplex, or, for example, in the case of an antisense 
nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an breast-associated body fluid. Alternatively, 

20 antisense nucleic acid molecules can be modified to target selected cells and then 
administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 

25 nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric nucleic 

30 acid molecule. An a-anomeric nucleic acid molecule forms specific double-stranded 
hybrids with complementary RNA in which, contrary to the usual a-units, the strands 
run parallel to each other (Gaultier et al, 1 987, Nucleic Acids Res. 1 5 :6625-664 1 ). The 
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antisense nucleic acid molecule can also comprise a 2 f -o-methylribonucleotide (Inoue et 
al, 1987, Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue 
etal, \m,FEBSLett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
5 molecules with ribonuclease activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes (eg., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 

10 for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 
invention can be designed based upon the nucleotide sequence of a cDNA 
corresponding to the marker. For example, a derivative of a Tetrahymena L-l 9 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al U.S. Patent No. 

15 4,987,071; and Cech et al U.S. Patent No. 5,116,742). Alternatively, an mRNA 

encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:141 1-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 

20 structures. For example, expression of a polypeptide of the invention can be inhibited 
by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g. , the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. NY. Acad. Sci. 660:27- 

25 36; and Maher (1 992) Bioassays 1 4(1 2):807-l 5. 

In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 

30 acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistiy 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
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pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength; The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
5 et al (1996), supra; Perry-O'Keefe et al (1996) Proc. Natl Acad. Sci. USA 93:14670- 
675. 

PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 

10 PNAs can also be used, e.g. , in the analysis of single base pair mutations in a gene by, 
e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as probes 
or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry-O'Keefe et 
al, 1996, Proc. Natl Acad. Sci. USA 93:14670-675). 

15 In another embodiment, PNAs can be modified, e.g. , to enhance their stability or 

cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 

20 recognition enzymes, e.g., RNASE H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 

25 Hyrup (1996), supra t and Finn et al. (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
such as 5 '-(4-methoxytrityl)amino-5 ! -deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al, 1989, Nucleic Acids Res. 

30 1 7:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5* PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 
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synthesized with a 5 f DNA segment and a 3 1 PNA segment (Peterser et al., 1975, 
Bioorganic Med Chem. Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. Acad. 
Set USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad. Sci. USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol et al, 1988, Bio/Techniques 
10 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 
this end, the oligonucleotide can be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
15 region which is complementary to a nucleic acid of the invention, such that the 
molecular beacon is useful for quantitating the presence of the nucleic acid of the 
invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
20 portions of the nucleic acid in such an orientation that when the complementary regions 
are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

25 

II. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins which correspond to 
individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
30 against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
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techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 
5 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

10 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

1 5 also preferably substantially free of culture medium, I e. , culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, f.e., it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

20 preparations of the protein have less than about 30%, 20%, 10%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide of interest. 

Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 

25 which include fewer amino acids than the frill length protein, and exhibit at least one 
activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 

30 biologically active portions, in which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one or more of the fimctional 
activities of the native form of a polypeptide of the invention. 
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Preferred polypeptides have amino acid sequences encoded by the nucleic acid 
sequences described herein. Other useful proteins are substantially identical (e.g., at 
least about 40%, preferably 50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these 
sequences and retain the functional activity of the protein of the corresponding 
5 naturally-occurring protein yet differ in amino acid sequence due to natural allelic 
variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 

10 alignment with a second amino or nucleic acid sequence). The amino acid residues or 
. nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 

15 is a function of the number of identical positions shared by the sequences (i.e., % 

identity = # of identical positions/total # of positions (e.g., overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 

20 mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschul (1990) Proc. Natl Acad. Set USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl Acad Set USA 90:5873-5877. Such an 
algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et al 
(1990) J. Mol Biol. 215:403-410. BLAST nucleotide searches can be performed with 

25 the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 

30 Altschul et al (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 
molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
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default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:1 1-17. Such an algorithm is incoiporated into the 

5 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) 

10 Proc. Natl Acad. Set USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a /c-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

1 5 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein" or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

20 polypeptide (i. e. , a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
other. The heterologous polypeptide can be fused to the amino-terminus or the 
carboxyl-terminus of the polypeptide of the invention. 

25 One useful fusion protein is a GST fusion protein in which a polypeptide 

corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 

30 sequence at its amino terminus. For example, the native signal sequence of a 

polypeptide corresponding to a marker of the invention can be removed and replaced 
with a signal sequence from another protein. For example, the gp67 secretory sequence 
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of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et al 9 ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 

5 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et al, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

10 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane-bound) and a protein on the surface of a cell 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

1 5 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating (e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

20 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 

Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 

25 Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al. f supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 

30 A nucleic acid encoding a polypeptide of the invention can be cloned into such an 

expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 
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A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 
during secretion in one or more cleavage events. Such signal peptides contain 
5 processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (i.e., the cleavage products). In one 
embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 

10 in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the .expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 
be readily purified from the extracellular medium by art recognized methods. 

1 5 Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 
sequence which can function as either agonists (mimetics) or as antagonists. Variants 

20 can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist 
can retain substantially the same, or a subset, of the biological activities of the naturally 
occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively 
binding to a downstream or upstream member of a cellular signaling cascade which 

25 includes the protein of interest. Thus, specific biological effects can be elicited by 
treatment with a variant of limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

30 Variants of a protein of the invention which function as either agonists (mimetics) 

or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
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one embodiment, a variegated library of variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
5 potential protein sequences is expressible as individual polypeptides, or alternatively, as 
a set of larger fusion proteins (e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the polypeptides of the 
invention from a degenerate oligonucleotide sequence. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, Tetrahedron 

10 39:3; Itakura et al t 1984, Annu. Rev. Biochem. 53:323; Itakura et al, 1984, Science 
198:1056; Ike etal, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 
population of polypeptides for screening and subsequent selection of variants. For 

15 example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 
sense/antisense pairs from different nicked products, removing single stranded portions 

20 from reformed duplexes by treatment with S 1 nuclease, and ligating the resulting 

fragment library into an expression vector. By this method, an expression library can be 
derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 

25 combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. The most widely used 
techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 

30 combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recursive 
ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
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mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl Acad. Set 
USA 59:781 1-7815; Delgrave et aL, 1993, Protein Engineering 6(3):327- 331). 

An isolated polypeptide corresponding to a marker of the invention, or a fragment 
5 thereof, can be used as an immunogen to generate antibodies using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length polypeptide or 
protein can be used or, alternatively, the invention provides antigenic peptide fragments 
for use as immunogens. The antigenic peptide of a protein of the invention comprises at 
least 8 (preferably 1 0, 1 5, 20, or 30 or more) amino acid residues of the amino acid 

10 sequence of one of the polypeptides of the invention, and encompasses an epitope of the 
protein such that an antibody raised against the peptide forms a specific immune 
complex with a marker of the invention to which the protein corresponds. Preferred 
epitopes encompassed by the antigenic peptide are regions that are located on the 
surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 

1 5 hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
vertebrate. An appropriate immunogenic preparation can contain, for example, 

20 recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
further include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 

25 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

30 does not substantially bind other molecules in a sample, e.g. , a biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab') 2 fragments which can be generated 
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by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
5 immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies 
directed against a polypeptide or polypeptides of the invention. Particularly preferred 

10 polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 
immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 

15 recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 
standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 

20 isolated from the subject (e.g., from the blood or serum of the subject) and further 

purified by well-known techniques, such as protein A chromatography to obtain the IgG 
fraction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 

25 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, i.e., one that is 

30 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 



WO 01/51628 



PCTAJS01/00798 



- 48 - 

those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 
sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 

5 polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 

10 cell hybridoma technique (see Kozbor et al, 1983, Immunol Today 4:72), the EBV- 
hybridoma technique (see Cole et al, pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 
hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al. ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 

15 monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal antibody directed against a polypeptide of the invention can be identified 

20 and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an 
antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available (e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene 
SurfZAP Phage Display Kit, Catalog No. 2406 1 2). Additionally, examples of methods 

25 and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/18619; PCT Publication No. WO 91/17271 ; PCT Publication No. WO 
92/20791; PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; 
PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 

30 Publication No. WO 90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et 
al (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275- 
1281; Griffiths etal. (1993)£AffiO J. 12:725-734. 
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Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the 
invention. A chimeric antibody is a molecule in which different portions are derived 
5 from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 
incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarily determining 

10 regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 
monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 

1 5 European Patent Application 1 84,1 87; European Patent Application 1 7 1 ,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al (1988) Science 240:1041- 
1043; Liu et al. (1987) Proc. Natl Acad Scl USA 84:3439-3443; Liu et al (1987) J. 
Immunol 139:3521-3526; Sunetal (1987) Proa Natl Acad ScL USA 84:214-218; 

20 Nishimura et al (1987) Cancer Res. 47:999-1005; Wood et al (1985) Nature 314:446- 
449; and Shaw et al (1988) Natl Cancer Inst. 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones et al (1986) Nature 321:552-525; Verhoeyan et al (1988) Science 239:1534; and 
Beidler etal (1988) J. Immunol 141:4053-4060. 

25 Antibodies of the invention may be used as therapeutic agents in treating cancers. 

In a preferred embodiment, completely human antibodies of the invention are used for 
therapeutic treatment of human cancer patients, particularly those having breast cancer. 
Such antibodies can be produced, for example, using transgenic mice which are 
incapable of expressing endogenous immunoglobulin heavy and light chains genes, but 

30 which can express human heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
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against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 
cell differentiation, and subsequently undergo class switching and somatic mutation. 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
5 and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 
and protocols for producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 

10 5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

Completely human antibodies which recognize a selected epitope can be 
generated using a technique referred to as "guided selection." In this approach a 

1 5 selected non-human monoclonal antibody, e.g. , a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al. y 
1994, Bio/technology 12:899-903). 

An antibody directed against a polypeptide corresponding to a marker of the 
invention (e.g.* a monoclonal antibody) can be used to isolate the polypeptide by 

20 standard techniques, such as affinity chromatography or immunoprecipitation. 

Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing 

25 procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

30 phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
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rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 
materials include luciferase, luciferin, and aequorin, and examples of suitable 

• 1 . t ^ I25 X I3I T 35 ^ 3 TX 

radioactive material mclude I, I, S or H. 
5 Further, an antibody (or fragment thereof) can be conjugated to a 

therapeutic moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A 
cytotoxin or cytotoxic agent includes any agent that is detrimental to cells. Examples 
include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, 
etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, 

1 0 dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 

1-dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and 
puromycin and analogs or homologs thereof. Therapeutic agents include, but are not 
limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, 
cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine, 

15 thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), 
cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and 
cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic 

20 agents (e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 

25 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as 
tumor necrosis factor, .alpha.-interferon, .beta.-interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator, or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

30 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 
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Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see, e.g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al, "Antibodies For Drug 
5 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

10 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

15 Accordingly, in one aspect, the invention provides substantially purified antibodies or 
fragments thereof, and non-human antibodies or fragments thereof, which antibodies or 
fragments specifically bind to a polypeptide comprising an amino acid sequence selected 
from the group consisting of the amino acid sequences of the present invention, an 
amino acid sequence encoded by the cDNA of the present invention, a fragment of at 

20 least 1 5 amino acid residues of an amino acid sequence of the present invention, an 
amino acid sequence which is at least 95% identical to the amino acid sequence of the 
present invention (wherein the percent identity is determined using the ALIGN program 
of the GCG software package with a PAM120 weight residue table, a gap length penalty 
of 12, and a gap penalty of 4) and an amino acid sequence which is encoded by a nucleic 

25 acid molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. In 
various embodiments, the substantially purified antibodies of the invention, or fragments 
thereof, can be human, non-human, chimeric and/or humanized antibodies. 

30 In another aspect, the invention provides non-human antibodies or fragments 

thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
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the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 1 5 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 
amino acid sequence of the present invention (wherein the percent identity is determined 
5 using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 
complement thereof, under conditions of hybridization of 6X SSC at 45 °C and washing 
10 in 0.2 X SSC, 0.1% SDS at 65°C. Such non-human antibodies can be goat, mouse, 

sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 
antibodies. 

15 In still a further aspect, the invention provides monoclonal antibodies or 

fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
present invention, a fragment of at least 1 5 amino acid residues of an amino acid 

20 sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 

25 nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 

30 to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
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thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extrapellular domain of the amino acid sequences of the present invention. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety 
5 or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

The invention also provides a kit containing an antibody of the invention 

10 conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutical!/ acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

15 Still another aspect of the invention is a method of making an antibody that 

specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 
sequence of the present invention, an amino acid sequence encoded by the cDNA of the 

20 nucleic acid molecules of the present invention, a fragment of at least 15 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 
which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a PAM120 weight residue table, a gap length penalty of 12, and a 

25 gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 

30 antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 

recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
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The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal. Optionally, antibodies are collected from the 
antibody-producing cell. 

5 III Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a polypeptide corresponding to a marker of the 
invention (or a portion of such a polypeptide). As used herein, the term "vector" refers 
to a nucleic acid molecule capable of transporting another nucleic acid to which it has 

10 been linked. One type of vector is a "plasmid", which refers to a circular double 

stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 
they are introduced {e.g., bacterial vectors having a bacterial origin of replication and 

15 episomal mammalian vectors). Other vectors {e.g. , non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 
operably linked. In general, expression vectors of utility in recombinant DNA 

20 techniques are often in the form of plasmids (vectors). However, the invention is 

intended to include such other forms of expression vectors, such as viral vectors {e.g., 
replication defective retroviruses, a4enoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 

25 the invention in a form suitable for expression of the nucleic acid in a host cell. This 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 

30 interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence {e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
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sequence" is intended to include promoters, enhancers and other expression control 
elements {e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol. 185, 
Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
5 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, and the like. The 
10 expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide corresponding to a marker of the invention in prokaryotic 
1 5 (e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 

vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

20 Expression of proteins in prokaryotes is most often carried out in E. coli with 

vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fiision proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 

25 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 

30 and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
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pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et aL, 1988, Gene 69:301-315) and pET lid (Studier et ai, p. 60-89, In Gene 
5 Expression Technology: Methods in Enzymology vol.l 85, Academic Press, San Diego, 
CA, 1 991). Target gene expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET lid vector relies on transcription from a T7 gnlO-lac fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 

10 supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: Methods 

1 5 in Enzymology vol. 1 85, Academic Press, San Diego, CA, 1 990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 
vector so that the individual codons for each amino acid are those preferentially utilized 
in K coli (Wada et al> 1992, Nucleic Acids Res. 20:21 11-2118). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 

20 techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
ai t 1987, EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et aL, 1987, Gene 54:1 13-123), pYES2 (Invitrogen Corporation, 

25 San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 
9 cells) include the pAc series (Smith et al, 1983, MoL Cell Biol 3:2156-2165) and the 
pVL series (Lucklow and Summers, 1989, Virology 170:31-39). 

30 In yet another embodiment, a nucleic acid of the invention is expressed in 

mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1 987, Nature 329:840) and pMT2PC 
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(Kauftnan et al s 1987, EMBOJ. 6: 187- 195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
5 prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al, supra. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue- 

10 specific promoters include the albumin promoter (liver-specific; Pinkert et al., 1987, 
Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton, 1988, A/v. 
Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBOJ. 8:729-733) and immunoglobulins (Banerji et al, 1983, Cell ' 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 

15 (e.g., the neurofilament promoter; Byrne and Ruddle, 1 989, Proa Natl. Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 

20 (Kessel and Gruss, 1990, Science 249:374-379) and the a-fetoprotein promoter (Camper 
and Tilghman, 1 989, Genes Dev. 3 :537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 

25 manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mRNA encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 

30 regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid, or attenuated virus in which antisense nucleic 
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acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
aL, 1986, Trends in Genetics, Vol. 1(1). 
5 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
10 to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 
cells, yeast or mammalian cells). 

1 5 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 

20 lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 

25 integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 

30 marker gene will survive, while the other cells die). 
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A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 
invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
5 invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 
In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

10 The host ceils of the invention can also be used to produce nonhuman transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 
corresponding to a marker of the invention have been introduced. Such host cells can 
then be used to create non-human transgenic animals in which exogenous sequences 

15 encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 
are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 

20 used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 

25 develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

30 an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 
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A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 
the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
5 polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the 

10 art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 
based upon the presence of the transgene in its genome and/or expression of mRNA 

1 5 encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which 

20 contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted (/.£., no longer encodes a functional protein; also referred to as a 

25 "knock out" vector). Alternatively, the vector can be designed such that, upon 

homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5' and 3 f ends by 

30 additional nucleic acid of the gene to allow for homologous recombination to occur 
between the exogenous gene carried by the vector and an endogenous gene in an 
embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
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length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5 ! and 3 T ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 
recombination vectors). The vector is introduced into an embryonic stem cell line (e.g. , 
5 by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et al, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 

10 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 

15 vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 

20 example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
Natl Acad. Set USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman etal, 1991, Science 
251:1351-13 55). If a cre/loxP recombinase system is used to regulate expression of the 

25 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

30 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al. (1997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 
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IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 
as "active compounds") corresponding to a marker of the invention can be incorporated 

5 into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 
coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 

10 and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

1 5 The invention includes methods for preparing pharmaceutical compositions for 

modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention. Such compositions can 

20 further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

25 The invention also provides methods (also referred to herein as "screening 

assays") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, peptoids, small molecules or other drugs) which (a) bind to 
the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect on the 
activity of the marker or, more specifically, (c) have a modulatory effect on the 

30 interactions of the marker with one or more of its natural substrates (e.g., peptide, 
protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 
expression of the marker. Such assays typically comprise a reaction between the marker 
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and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

The test compounds of the present invention may be obtained from any available 
5 source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
peptide backbone which are resistant to enzymatic degradation but which nevertheless 

10 remain bioactive; see, e.g., Zuckermann et al, 1994, J. Med. Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound* library method; and 
synthetic library methods using affinity chromatography selection. The biological 
library and peptoid library approaches are limited to peptide libraries, while the other 

15 four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997 \ Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proa Natl Acad ScL U.S.A. 90:6909; Erb et 
al. (1994) Prop. Natl Acad ScL USA 91:11422; Zuckermann etal (1994). J. Med. 

20 Chem. 37:2678; Cho et al (1993) Science 261 :1303; Carrell et al (1994) Angew. Chem. 
Int, Ed Engl 33:2059; Carell et al (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and in 
Gallop et al (1994) J. Med Chem. 37: 1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 

25 1 993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull et al, 1992, Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1*990, Science 249:404-406; Cwirla et al, 1 990, 
Proc. Natl Acad ScL 87:6378-6382; Felici, 1991,7. Mol Biol 222:301-310; Ladner, 
supra.). 

30 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
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compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 
example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
5 marker compound in a complex. For example, compounds (e.g., marker substrates) can 
be labeled with I, S, C, or H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 
assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 

10 determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 
thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 

15 the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 
is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 

20 use of the marker protein as "bait protein" in a two-hybrid assay or three-hybrid assay 
(see, e.g., U.S. Patent No. 5,283,317; Zervos et al 9 1993, Cell 72:223-232; Madura et al, 
1993, X Biol Chem. 268:12046-12054; Bartel e/*/.,1993, Biotechniques 14:920-924; 
Iwabuchi et al 9 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 

25 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

30 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 
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transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
5 dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 
the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 

1 0 the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 
or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 

15 peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 
in this embodiment is an breast cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 

20 supplied from any source. 

The basic principle of the assay systems used to identify compounds that interfere 
with the interaction between the marker and its binding partner involves preparing a 
reaction mixture containing the marker and its binding partner under conditions and for 
a time sufficient to allow the two products to interact and bind, thus forming a complex. 

25 In order to test an agent for inhibitory activity, the reaction mixture is prepared in the 
presence and absence of the test compound. The test compound can be initially included 
in the reaction mixture, or can be added at a time subsequent to the addition of the 
marker and its binding partner. Control reaction mixtures are incubated without the test 
compound or with a placebo. The formation of any complexes between the marker and 

30 its binding partner is then detected. The formation of a complex in the % control reaction, 
but less or no such formation in the reaction mixture containing the test compound, 
indicates that the compound interferes with the interaction of the marker and its binding 
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partner. Conversely, the formation of more complex in the presence of compound than 
in the control reaction indicates that the compound may enhance interaction of the 
marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
5 binding partner may be conducted in a heterogeneous or homogeneous format. 

Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting complexes anchored to the solid phase at the end of the 
reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 

10 information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 
substance, i.e., by adding the test substance to the reaction mixture prior to or 
simultaneously with the marker and its interactive binding partner. Alternatively, test 

1 5 compounds that disrupt preformed complexes, e.g., compounds with higher binding 

constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 
various formats are briefly described below. 

In a heterogeneous assay system, either the marker or its binding partner is 

20 anchored onto a solid surface or matrix, while the other corresponding non-anchored 
component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
of methods, either non-covalent or covalent, that are typically well known to one who 
practices the art. Non-covalent attachment can often be accomplished simply by coating 

25 the solid surface with a solution of the marker or its binding partner and drying. 

Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

In related embodiments, a fusion protein can be provided which adds a domain 
that allows one or both of the assay components to be anchored to a matrix. For 

30 example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 

transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 
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combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
5 immobilized complex assessed either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 

10 partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 

1 5 advance and stored. 

In order to conduct the assay, the corresponding partner of the immobilized assay 
component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
and any complexes formed will remain immobilized on the solid surface. The detection 

20 of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the non- 
immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 

25 initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 
complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 

30 This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 
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determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

5 In such a homogeneous assay, the reaction products may be separated from 

unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifugation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifugation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 

1 0 different sedimentation equilibria of complexes based on their different sizes and 
densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 
Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 

1 5 appropriate gel filtration resin in a column format, for example, the relatively larger 
complex may be separated from the relatively smaller uncomplexed components. 
Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
the remaining individual reactants, for example through the use of ion-exchange 

20 chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, JMol Recognit 1 1:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed. Sci AppL, 699:499-525). Gel electrophoresis 
may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 

25 New York. 1 999). In this technique, protein or nucleic acid complexes are separated 

based on size or charge, for example. In order to maintain the binding interaction during . 
the electrophoretic process, non-denaturing gels in the absence of reducing agent are 
typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 

30 utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1 999). In this technique, all proteins binding to an antibody specific to one of the 
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binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifugation. The bound assay 
components are released from the beads (through a specific proteolysis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
5 the complex), and a second immunoprecipitation step is performed, this time utilizing 
antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 

1 0 interactions between the marker and its binding partner. 

Also within the scope of the present invention are methods for direct detection of 
interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 

15 Lakowicz et al y U.S. Patent No. 5,631,169; Stavrianopoulos etal, U.S. Patent No. 

4,868,103). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor' molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor' molecule (e.g., 
marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 

20 Alternately, the "donor 5 protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor* molecule label may be differentiated from that of the 
'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 

25 assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 
means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 

30 result in the generation of a signal variant to that of background. In this way, test 

substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 
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In another embodiment, modulators of marker expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
mRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
5 to the level of expression of mRNA or protein in the absence of the candidate 

compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression, of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 

10 marker mRNA or protein expression. Conversely, when expression of marker mRNA 
or protein is less (statistically significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as an inhibitor of 
. marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 

15 or protein. 

In another aspect, the invention pertains to a combination of two or more of the 
assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 
marker protein can be further confirmed in vivo, e.g., in a whole animal model for 

20 cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., an marker modulating agent, an antisense 

25 marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 
herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 

30 above-described screening assays for treatments as described herein. 
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It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 
ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 

5 sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to ' 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 
molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 

10 about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 
subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 

15 about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 
can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 

20 activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 

25 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

30 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
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components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
5 acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 
Pharmaceutical compositions suitable for injectable use include sterile aqueous 

10 solutions (where water soluble) or dispersions and sterile powders for the 

extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 

1 5 syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 

20 be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 

25 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 

30 (e.g., a polypeptide or antibody) in the required amount in an appropriate solvent with 
one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound 
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into a sterile vehicle which contains a basic dispersion medium, and then incorporating 
the required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active 
5 ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 

10 used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

Phannaceutically compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches, and the like can 

1 5 contain any of the following ingredients, or compounds of a similar nature: a binder 

such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 

20 peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 

25 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 

30 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art 
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The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
5 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

10 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S^ Patent No. 4,522,81 1 . 

15 It is especially advantageous to formulate oral or parenteral compositions in 

dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 

20 pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 

25 (generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 

30 enhance uptake and tissue penetration (e.g. , into the breast epithelium). A method for 
lipidation of antibodies is described by Cruikshank et al. (1997) J. Acquired Immune 
Deficiency Syndromes and Human Retrovirology 1 4: 1 93. 
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The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by, for example, intravenous injection, local administration (U.S. 
Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et ai, 1994, Proc. Natl 
5 Acad Sci. USA 91 :3054-3O57). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 
e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
1 0 which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Electronic Apparatus Readable Media and Arrays 

1 5 Electronic apparatus readable media comprising a breast cancer marker of the 

present invention is also provided. As used herein, "electronic apparatus readable 
media" refers to any suitable medium for storing, holding or containing data or 
information that can be read and accessed directly by an electronic apparatus. Such 
media can include, but are not limited to: magnetic storage media, such as floppy discs, 

20 hard disc storage medium, and magnetic tape; optical storage media such as compact 
disc; electronic storage media such as RAM, ROM, EPROM, EEPROM and the like; 
general hard disks and hybrids of these categories such as magnetic/optical storage 
media. The medium is adapted or configured for having recorded thereon a marker of 
the present invention. 

25 As used herein, the term "electronic apparatus" is intended to include any suitable 

computing or processing apparatus or other device configured or adapted for storing 
data or information. Examples of electronic apparatus suitable for use with the present 
invention include stand-alone computing apparatus; networks, including a local area 
network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; electronic 

30 appliances such as a personal digital assistants (PDAs), cellular phone, pager and the 
like; and local and distributed processing systems. 
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As used herein, "recorded" refers to aprocess for storing or encoding information 
on the electronic apparatus readable medium. Those skilled in the art can readily adopt 
any of the presently known methods for recording information on known media to 
generate manufactures comprising the markers of the present invention. 

5 A variety of software programs and formats can be used to store the marker 

information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and MicroSoft Word, or represented in the form of an ASCII file, stored in 

1 0 a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

forms. Any number of dataprocessor structuring formats (e.g., text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the markers 
of the present invention. 

By providing the markers of the invention in readable form, one can routinely 

1 5 access the marker sequence information for a variety of purposes. For example, one 
skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 

20 target sequence or target motif. 

The present invention therefore provides a medium for holding instructions for 
performing a method for determining whether a subject has breast cancer or a pre- 
disposition to breast cancer, wherein the method comprises the steps of determining the 
presence or absence of a breast cancer marker and based on the presence or absence of 

25 the breast cancer marker, determining whether the subject has breast cancer or a pre- 
disposition to breast cancer and/or recommending a particular treatment for the breast 
cancer or pre- breast cancer condition. 

The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has breast cancer or a pre- 

30 disposition to breast cancer associated with a breast cancer marker wherein the method 
comprises the steps of determining the presence or absence of the breast cancer marker, 
and based on the presence or absence of the breast cancer marker, determining whether 



WO 01/51628 PCT/US01/00798 

-78- 

the subject has breast cancer or a pre-disposition to breast cancer, and/or recommending 
a particular treatment for the breast cancer or pre- breast cancer condition. The method 
may further comprise the step of receiving phenotypic information associated with the 
subject and/or acquiring from a network phenotypic information associated with the 
5 subject 

The present invention also provides in a network, a method for determining 
whether a subject has breast cancer or a pre-disposition to breast cancer associated with 
a breast cancer marker, said method comprising the steps of receiving information 
associated with the breast cancer marker receiving phenotypic information associated 

10 with the subject, acquiring information from the network corresponding to the breast 
cancer marker and/or breast cancer, and based on one or more of the phenotypic 
information, the breast cancer marker, and the acquired information, determining 
whether the subject has breast cancer or a pre-disposition to breast cancer. The method 
may further comprise the step of recommending a particular treatment for the breast 

1 5 cancer or pre- breast cancer condition. 

The present invention also provides a business method for determining whether a 
subject has breast cancer or a pre-disposition to breast cancer, said method comprising 
the steps of receiving information associated with the breast cancer marker, receiving 
phenotypic information associated with the subject, acquiring information from the 

20 network corresponding to the breast cancer marker and/or breast cancer, and based on 
one or more of the phenotypic information, the breast cancer marker, and the acquired 
information, determining whether the subject has breast cancer or a pre-disposition to 
breast cancer. The method may further comprise the step of recommending a particular 
treatment for the breast cancer or pre- breast cancer condition. 

25 The invention also includes an array comprising a breast cancer marker of the 

present invention. The array can be used to assay expression of one or more genes in 
. the array. In one embodiment, the array can be used to assay gene expression in a tissue 
to ascertain tissue specificity of genes in the array. In this manner, up to about 7600 
genes can be simultaneously assayed for expression. This allows a profile to be 

30 developed showing a battery of genes specifically expressed in one or more tissues. 
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In addition to such qualitative determination, the invention allows the quantitation 
of gene expression. Thus, not only tissue specificity, but also the level of expression of 
a battery of genes in the tissue is ascertainable. Thus, genes can be grouped on the basis 
of their tissue expression per se and level of expression in that tissue. This is useful, for 
5 example, in ascertaining the relationship of gene expression between or among tissues. 
Thus, one tissue can be perturbed and the effect on gene expression in a second tissue 
can be determined. In this context, the effect of one cell type on another cell type in . 
response to a biological stimulus can be determined. Such a determination is useful, for 
example, to know the effect of cell-cell interaction at the level of gene expression. If an 

10 agent is administered therapeutically to treat one cell type but has an undesirable effect 
on another cell type, the invention provides an assay to determine the molecular basis of 
the undesirable effect and thus provides the opportunity to co-administer a counteracting 
agent or otherwise treat the undesired effect. Similarly, even within a single cell type, 
undesirable biological effects can be determined at the molecular level. Thus, the 

1 5 effects of an agent on expression of other than the target gene can be ascertained and 
counteracted. 

In another embodiment, the array can be used to monitor the time course of 
expression of one or more genes in the array. This can occur in various biological 
contexts, as disclosed herein, for example development of breast cancer, progression of 
20 breast cancer, and processes, such a cellular transformation associated with breast 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
the expression of other genes in the same cell or in different cells. This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
25 ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
serve as a molecular target for diagnosis or therapeutic intervention. 



30 



VI. Predictive Medicine 

The present invention pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
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are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 
assays for determining the level of expression of polypeptides or nucleic acids 
corresponding to one or more markers of the invention, in order to determine whether 
5 an individual is at risk of developing breast cancer. Such assays can be used for 

prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs or other compounds administered either to inhibit breast cancer or to treat or 
10 prevent any other disorder {i.e. in order to understand any breast carcinogenic effects 
that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

15 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample (e.g. a breast-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

20 the polypeptide or nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

25 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

30 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 
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A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 
thus forming a complex that can be removed and/or detected in the reaction mixture. 
5 These assays can be conducted in a variety of ways. 

For example, one method to conduct such an assay would involve anchoring the 
marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 

10 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 

1 5 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g. , biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 

20 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 

25 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 

30 (e.g. s by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 
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In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
5 It is also possible to directly detect marker/probe complex formation without 

further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
al, U.S. Patent No. 5,631,169; Stavrianopoulos, et ai, U.S. Patent No. 4,868,103). A 
fluorophore label on the first, ' donor' molecule is selected such that, upon excitation 

1 0 with incident light of appropriate wavelength, its emitted fluorescent energy will be 

absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 
that emit different wavelengths of light, such that the 'acceptor' molecule label may be 

1 5 differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 
should be maximal. An FET binding event can be conveniently measured through 

20 standard fluorometric detection means well known in the art (e.g. , using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 
(see, e.g., Sjolander, S. and Urbaniczky, C, 1991, Anal Chem. 63:2338-2345 and 

25 Szabo et ai, 1995, Curr. Opin. Struct Biol 5:699-705). As used herein, "BIA" or 

"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 
of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 

30 resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 
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Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 
assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential 
5 centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Set 1 8(8):284-7). 

10 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

1 5 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. Mol Recognit. Winter 1 1(1- 

20 6):141-8;Hage,D.S., and Tweed, S.A. J Chromatogr B Biomed Sci Appl 1 997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel et al, ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

25 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 
In a particular embodiment, the level of mRNA corresponding to the marker can 

30 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
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cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 
against the isolation of mRNA can be utilized for the purification of RNA from breast 
cells (see, e.g., Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley 
5 & Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 
4,843,155). 

The isolated mRNA can be used in hybridization or amplification assays that 

10 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

15 of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

20 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily 

25 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

30 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl Acad. Set 
USA, 88:189-193), self sustained sequence replication (Guatelli et al, 1990, Proc. Natl 
Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh etal, 1989, 
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Proc. Natl Acad ScL USA 86:1173-1177), Q-Beta Replicase (Lizardi etal, 1988, 
Bio/Technology 6:1 197), rolling circle replication (Lizardi et al, U.S. Patent No. 
5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
5 detection schemes are especially useful for the detection of nucleic acid molecules if 
such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5 f or 3' regions of 
a gene (plus and minus strands, respectively, or vice- versa) and contain a short region in 
between. In general, amplification primers are from about 10 to 30 nucleotides in length 

10 and flank a region from about 50 to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mRNA does not need to be isolated from the breast cells 
prior to detection. In such methods, a cell or tissue sample is prepared/processed using 

1 5 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

■# 

As an alternative to making determinations based on the absolute expression level 
of the marker, determinations may be based on the normalized expression level of the 

20 marker. Expression levels are normalized by correcting the absolute expression level of 
a marker by comparing its expression to the expression of a gene that is not a marker, 
e.g., a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

25 sample, e.g., a patient sample, to another sample, e.g. , a non-breast cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

30 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
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for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 
that marker. This provides a relative expression level. 

Preferably, the samples used in the baseline determination will be from breast 
5 cancer or from non-breast cancer cells of breast tissue. The choice of the cell source is 
dependent on the use of the relative expression level. Using expression found in normal 
tissues as a mean expression score aids in validating whether the marker assayed is 
breast specific (versus normal cells). In addition, as more data is accumulated, the mean 
expression value can be revised, providing improved relative expression values based on 

10 accumulated data. Expression data from breast cells provides a means for grading the 
severity of the breast cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to a 
marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 

1 5 preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab f ) 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 

20 or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
detected with fluorescently labeled streptavidin. 

Proteins from breast cells can be isolated using techniques that are well known to 

25 those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1 988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York). 

A variety of formats can be employed to determine whether a sample contains a 
30 protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
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readily adapt known protein/antibody detection methods for use in determining whether 
breast cells express a marker of the present invention. 

In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
5 such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

10 One skilled in the art will know many other suitable carriers for binding antibody 

or antigen, and will be able to adapt such support for use with the present invention. For 
example, protein isolated from breast cells can be run on a polyacrylamide gel 
electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 

1 5 detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 
support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an 

20 breast-associated body fluid). Such kits can be used to determine if a subject is 

suffering from or is at increased risk of developing breast cancer. For example, the kit 
can comprise a labeled compound or agent capable of detecting a polypeptide or an 
mRNA encoding a polypeptide corresponding to a marker of the invention in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 

25 the sample (e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 

30 marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 
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For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
5 marker of the invention. The kit can also comprise, e.g., a buffering agent, a 

preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label (e.g. , an enzyme or a substrate). The kit can 
also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
10 individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 

B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 

15 of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) breast cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype 
and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 

20 therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

25 Accordingly, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

Pharmacogenomics deals with clinically significant variations in the response to 
drugs due to altered drug disposition and abnormal action in affected persons. See, e.g. , 

30 Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 
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conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
5 main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes {e.g., N-acetyltransferase 2 (NAT 

10 2) and cytochrome P450 enzymes CYP2D6 and CYP2C1 9) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

1 5 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 

20 response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 
Thus, the level of expression of a marker of the invention in an individual can be 

25 determined to thereby select appropriate agent(s) for therapeutic or prophylactic 

treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 
identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 

30 and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 
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This invention also provides a process for preparing a database comprising at 
least one of the markers set forth in Tables 1-6. For example, the polynucleotide 
sequences are stored in a digital storage medium such that a data processing system for 
standardized representation of the genes that identify a breast cancer cell is compiled. 
5 The data processing system is useful to analyze gene expression between two cells by 
first selecting a cell suspected of being of a neoplastic phenotype or genotype and then 
isolating polynucleotides from the cell. The isolated polynucleotides are sequenced. 
The sequences from the sample are compared with the sequence(s) present in the 
database using homology search techniques. Greater than 90%, more preferably greater 

10 than 95% and more preferably, greater than or equal to 97% sequence identity between 
the test sequence and the polynucleotides of the present invention is a positive indication 
that the polynucleotide has been isolated from a breast cancer cell as defined above. 

In an alternative embodiment, the polynucleotides of this invention are sequenced 
and the information regarding sequence and in some embodiments, relative expression, 

15 is stored in any functionally relevant program, e.g., in Compare Report using the SAGE 
software (available though Dr. Ken Kinzler at John Hopkins University). The Compare 
Report provides a tabulation of the polynucleotide sequences and their abundance for 
the samples normalized to a defined number of polynucleotides per library (say 25,000). 
This is then imported into MS-ACCESS either directly or via copying the data into an 

20 Excel spreadsheet first and then from there into MS-ACCESS for additional 

manipulations. Other programs such as SYBASE or Oracle that permit the comparison 
of polynucleotide numbers could be used as alternatives to MS-ACCESS. 
Enhancements to the software can be designed to incorporate these additional functions. 
These functions consist in standard Boolean, algebraic, and text search operations, 

25 applied in various combinations to reduce a large input set of polynucleotides to a 
manageable subset of a polynucleotide of specifically defined interest. 

One skilled in the art may create groups containing one or more project(s) by 
combining the counts of specific polynucleotides within a group (e.g., GroupNormal = 
Normall + Normal2, GroupTumorl + TumorCellLine). Additional characteristic values * 

30 are also calculated for each tag in the group (e.g., average count, minimum count, 
maximum count). One skilled in the art may calculate individual tag count ratios 
between groups, for example the ratio of the average GroupNormal count to the average 
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GroupTumor count for each polynucleotide. A statistical measure of the significance of 
observed differences in tag counts between groups may be calculated. 

C. Monitoring Clinical Trials 

5 Monitoring the influence of agents (e.g. , drug compounds) on the level of 

expression of a marker of the invention can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for breast 
cancer. In a preferred embodiment, the present invention provides a method for 

1 0 monitoring the effectiveness of treatment of a subject with an agent (e.g. , an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 

1 5 obtaining one or more post-administration samples from the subject; (iv) detecting the 
level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 

20 increased administration of the agent can be desirable to increase expression of the 
marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease 
expression of the markers) to lower levels than detected, i.e., to decrease the. 
effectiveness of the agent. 

25 

D. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, breast cancer. As used herein, a "surrogate marker" is an objective 
30 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
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disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 
particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 

5 of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 
RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 

10 markers in the art include: Koomen et al. (2000) J. Mass. Spectrom. 35: 258-264; and 
James (1994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 
used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 

1 5 pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 
pharmacodynamic marker may be indicative of the concentration of the drug in a 
biological tissue, in that the marker is either expressed or transcribed or not expressed or 

20 transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 
the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 

25 Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
readily detectable than the drug itself. Also, the marker may be more easily detected 

30 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
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Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 
observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al (1991) Env. Health Perspect. 90: 229-238; 
5 Schentag (1999) Am. J. HealthSyst. Pharm. 56 Suppl. 3: S21-S24; and Nicolau (1999) 
Am, 1 Health-Syst. Pharm. 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 
used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e.g., 

10 McLeod et al (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a 
specific drug or class of drugs prior to administration of the drug. By assessing the 
presence or quantity of one or more pharmacogenomic markers in a subject, a drug 
therapy which is most appropriate for the subject, or which is predicted to have a greater 

15 degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 
present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 

20 pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 

VH Experimental Protocol 

25 A. Subtracted Libraries and Transcript Profiling 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mRNA (tester) 
compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 

30 SMART PGR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 
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abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

5 B. Proteomics 

Proteins that are secreted by normal 1 and transformed cells in culture are analyzed 
to identify those proteins that are likely to be secreted by cancerous cells into body 
fluids. Supernatants are isolated and MWT-CO filters are used to simplify the mixture 
of proteins. The proteins are then digested with trypsin. The tryptic peptides are loaded 

10 onto a microcapillary HPLC column where they are separated, and eluted directly into 
an ion trap mass spectrometer, through a custom-made electrospray ionization source. 
Throughout the gradient, sequence data is acquired through fragmentation of the four 
most intense ions (peptides) that elute off the column, while dynamically excluding 
those that have already been fragmented. In this way, approximately 2000 scans worth 

1 5 of sequence data are obtained, corresponding to approximately 50 to 200 different 
proteins in the sample. These data are searched against databases using correlation 
analysis tools, such as MS-Tag, to identify the proteins in the supernatants. 

In addition, protein profiling experiments are undertaken to assess whether the 
proteins associated with the expression of individual markers of the invention are 

20 secreted. Transcriptional profiling experiments are performed on fractions of RNA that 
are obtained from either (a) endoplasmic reticulum-associated (ER-associated) 
ribosomes, or (b) free ribosomes. Eukaryotic RNA which is isolated from ER-associated 
ribosomes tends to encode secreted and membrane bound proteins rather than 
intracellular proteins. Accordingly, markers of the invention which exhibit significantly 

25 enhanced expression in fractions of RNA from ER-associated ribosomes (in comparison 
with RNA from free ribosomes) are predicted to be associated with secreted proteins. 

Vni Summary Of The Data Provided In The Tables 

Table 1 shows 4068 novel nucleotide sequences identified through subtracted 
30 library experiments. The sequences of Table 1 were reinterpreted and those sequences 
are set forth in Tables 3 and 5. These sequences were determined to be novel through 
various BLAST searches of the available databases. 
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The library source for SEQ ID NOS: 1-675 was breast cancer cell cultures 
(ascites and pleural fluid cultures) versus normal (i.e., non-cancerous) human epithelial 
mammary cell lines (HMEC). SEQ ID NOS: 1-470 are preferred and SEQ ID NOS: 1- 
315 are most preferred. 
5 The library source for SEQ ID NOS: 676-1644 was cancer tissue samples 

(clinical invasive lobular carcinomas (ILC)) versus normal breast tissue samples. SEQ 
ID NOS: 676-890 and 1056-1363 are preferred and SEQ ID NOS: 676-792 and 1056- 
1 254 are most preferred. 

The library source for SEQ ID NOS: 1645-2941 was cancer tissue samples 

10 (clinical invasive ductal carcinomas (IDC)) versus normal breast tissue samples. SEQ 
ID NOS: 1645-2454 are preferred and SEQ ID NOS: 1645-2124 are most preferred. 

The library source for SEQ ID NOS: 2942-4068 was cancer tissue samples 
(clinical ductal carcinomas in situ (DOS)) versus normal breast tissue samples. SEQ ID 
NOS: 2942-3626 are preferred and SEQ ID NOS: 2942-3351 are most preferred. 

15 Table 2 shows 4843 novel nucleotide sequences identified through subtracted 

library experiments. The sequences of Table 2 were reinterpreted and those sequences 
are set forth in Tables 4 and 5. These sequences were determined to be novel through 
various BLAST searches of the available databases. 

The tester source for SEQ ID NOS: 1-64, 1960-1976 and 3038-3080 was . 

20 aggressive breast tumor cell lines and the driver source was indolent breast tumor cell 
lines (detects markers upregulated in more aggressive tumors). 

The tester source for SEQ ID NOS: 65-72, 1879, 1977-2004 and 3081-3127 was 
indolent breast tumor cell lines and the driver source was aggressive breast tumor cell 
lines (detects markers upregulated in more indolent tumors). 

25 The tester source for SEQ ID NOS: 73-629, 1880-1894, 2005-2296 and 3128- 

3471 was poor clinical outcome breast tumors and the driver source was good clinical 
outcome breast tumors (detects markers upregulated in more aggressive tumors). "Poor 
clinical outcome" is defined as the patient suffering disease recurrence following 
surgery within a period of less than five years. "Good clinical outcome" is defined as 

30 the patient remaining disease free for at least five years or more following surgery. 
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The tester source for SEQ ID NOS: 630-862, 1895-1900, 2297-2385 and 3472- 
3602 was good clinical outcome breast tumors and the driver source was poor clinical 
outcome breast tumors (detects markers upregulated in more indolent tumors). 

The tester source for SEQ ID NOS: 863-1262, 1901-1910, 2386-2567 and 3602- 
5 3988 was breast tumor lymph node metastasis and the driver source was indolent 
(colloid and tubular) breast tumor samples (detects markers upregulated in more 
aggressive tumors). 

The tester source for SEQ ID NOS: 1263-1392, 1911-1916, 2568-2735 and 3989- 
43 19 was indolent (colloid and tubular) breast tumor samples and the driver source was 
1 0 breast tumor lymph node metastasis (detects markers upregulated in more indolent 
tumors). 

The tester source for SEQ ID NOS: 1393-1638, 1917-1943, 2736-2940 and 4320- 
4604 was T1N1 breast tumors (tumors 2.0 cm or less in greatest dimension with regional 
lymph node metastasis) and the driver source was T1N0 breast tumors (tumors 2.0 cm or 

1 5 less in greatest dimension with no regional lymph node metastasis), good clinical 
outcome (detects markers upregulated in more aggressive tumors. 

The tester source for SEQ ID NOS: 1639-1878, 1944-1959, 2941-3037 and 4605- 
4843 was T1N0 breast tumors with good clinical outcome and the driver source was 
T1N1 breast tumors (detects markers upregulated in more indolent tumors). 

20 Table 6 shows novel nucleotide sequences shown to be associated with breast 

cancer. 

The contents of all references, patents, published patent applications, and 
database records cited throughout this application are hereby incorporated by reference. 

25 Other. Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed is: 

Claims 

5 

1 . An isolated nucleic acid molecule selected from the group consisting of: 
a) a nucleic acid molecule comprising a nucleotide sequence which is at 
least 90% homologous to a nucleotide sequence of Tables 1-6, or a complement 
thereof; 

10 b) a nucleic acid molecule comprising a fragment of a nucleic acid 

comprising a nucleotide sequence of Tables 1-6, or a complement thereof; and 

c) a nucleic acid molecule comprising a nucleotide sequence of 
Tables 1-6, or a complement thereof. 

15 2. A vector which contains a nucleic acid molecule of claim 1 . 

3. A host cell which contains a nucleic acid molecule of claim 1 . 

4. An isolated polypeptide which is encoded by a nucleic acid molecule 
20 comprising a nucleotide sequence which is at least 90% homologous to a nucleic 

acid comprising a nucleotide sequence of Tables 1-6. 

5. An antibody which selectively binds to a polypeptide of claim 4. 

25 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 
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7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 

5 b) determining whether the compound binds to the polypeptide in the 

sample to thereby detect the presence of a polypeptide of claim 4 in the sample. 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 

10 

9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 
1 5 b) determining whether the nucleic acid probe or primer binds to a nucleic 

acid molecule in the sample to thereby detect the presence of a nucleic acid molecule of 
claim 1 in the sample. 

10. The method of claim 9, wherein the sample comprises mRNA molecules 
20 and is contacted with a nucleic acid probe. 

1 1 . The method of claim 9, wherein the sample is isolated from breast tissue. 

12. The method of claim 9, wherein the sample is a tumor sample. 

25 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 
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14. A method of assessing whether a patient is afflicted with breast cancer, 
the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6, and 
5 b) the normal level of expression of the marker in a control non-breast 

cancer sample, 

wherein a significant difference between the level of expression of the 
marker in the patient sample and the normal level is an indication that the patient is 
afflicted with breast cancer. 

10 

15. The method of claim 14, wherein the marker corresponds to a secreted 

protein. 

16. The method of claim 14, wherein the marker corresponds to a transcribed 
1 5 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

17. The method of claim 14, wherein the sample comprises cells obtained 
from the patient. 

20 1 8. The method of claim 17, wherein the sample is a breast tissue. 

1 9. The method of claim 1 7, wherein the cells are in a fluid selected from the 
group consisting of blood fluid, lymph, ascitic fluid, cystic fluid, urine, a breast exudate 
and a nipple aspirate. 

25 

20. The method of claim 1 4, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein 
corresponding to the marker. 



30 



21 . The method of claim 15, wherein the presence of the protein is detected 
using a reagent which specifically binds with the protein. 
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22. The method of claim 21 , wherein the reagent is selected from the group 
consisting of an antibody, an antibody derivative, and an antibody fragment. 

23. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 

polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 

24. The method of claim 23, wherein the transcribed polynucleotide is an 
10 mRNA. 

25 . The method of claim 23, wherein the transcribed polynucleotide is a 

cDNA. 

1 5 26. The method of claim 23, wherein the step of detecting further comprises 

amplifying the transcribed polynucleotide. 

27. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 

20 polynucleotide which anneals with the marker or anneals with a portion of a 

polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 

28. The method of claim 1 4, wherein the level of expression of the marker in 
25 the sample differs from the normal level of expression of the marker in a patient not 

afflicted with breast cancer by a factor of at least about 2. 



30 



29. The method of claim 1 4, wherein the level of expression of the marker in 
the sample differs from the normal level of expression of the marker in a patient not 
afflicted with breast cancer by a factor of at least about 5. 
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30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of markers 
independently selected from the markers listed in Tables 1-6, and 

b) the normal level of expression of each of the plurality of markers in 

5 samples of the same type obtained from control humans not afflicted with breast cancer, 
wherein the level of expression of more than one of the markers is 
significantly altered, relative to the corresponding normal levels of expression of the 
markers, is an indication that the patient is afflicted with breast cancer. 



10 31. The method of claim 30, wherein the level of expression of each of the 

markers is significantly altered, relative to the corresponding normal levels of 
expression of the markers, is an indication that the patient is afflicted with breast cancer. 

32. The method of claim 30, wherein the plurality comprises at least three of 
15 the markers. 

33. The method of claim 30, wherein the plurality comprises at least five of 
the markers. 

20 34. A method for monitoring the progression of breast cancer in a patient, the 

method comprising: 

a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
Tables 1-6; 

25 b) repeating step a) at a subsequent point in time; and 

c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of breast cancer in the patient. 



30 



35. 
protein. 



The method of claim 34, wherein the marker corresponds to a secreted 
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36. The method of claim 34, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

37. The method of claim 34, wherein the sample comprises cells obtained 
5 from the patient. 

38. The method of claim 34, wherein the patient sample is a breast tissue. 

39. The method of claim 34, wherein between the first point in time and the 
10 subsequent point in time, the patient has undergone surgery to remove a tumor. 

40. A method of assessing the efficacy of a test compound for inhibiting 
breast cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient and 
15 exposed to the test compound, wherein the marker is selected from the group consisting 

of the markers listed in Tables 1-6, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
20 sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting breast cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

25 

42. The method of claim 40, wherein the first and second samples are 
portions of pooled samples obtained from the patient. 
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43. A method of assessing the efficacy of a therapy for inhibiting breast 
cancer in a patient, the method comprising comparing: 

a) expression of a marker in the first sample obtained from the patient 
prior to providing at least a portion of the therapy to the patient, wherein the marker is 

5 selected from the group consisting of the markers listed in Tables 1-6, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the 
second sample, relative to the first sample, is an indication that the therapy is efficacious 
1 0 for inhibiting breast cancer in the patient. 

44. A method of selecting a composition for inhibiting breast cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 
15 b) separately exposing aliquots of the sample in the presence of a 

plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6; and 

d) selecting one of the test compositions which induces a lower level of 
20 expression of the marker in the aliquot containing that test composition, relative to other 

test compositions. 

45. A method of inhibiting breast cancer in a patient, the method comprising: 
a) obtaining a sample comprising cancer cells from the patient; 

25 b) separately maintaining aliquots of the sample in the presence of a 

plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6; and 

d) administering to the patient at least one of the test compositions which 
30 induces a lower level of expression of the marker in the aliquot containing that test 

composition, relative to other test compositions. 
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46. A kit for assessing whether a patient is afflicted with breast cancer, the 
kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-6. 



5 47. A kit for assessing the presence of breast cancer cells, the kit comprising 

a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-6. 



10 48. A kit for assessing the suitability of each of a plurality of compounds for 

inhibiting breast cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-6. 

15 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with breast cancer, the method 
comprising: 

isolating a protein or protein fragment corresponding to a marker selected 
20 from the group consisting of the markers listed in Tables 1-6; 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
hybridomas; and 

25 screening individual hybridomas for production of an antibody which 

specifically binds with the protein or protein fragment to isolate the hybridoma. 



50. An antibody produced by a hybridoma made by the method of claim 49. 



wo 
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51. A kit for assessing the presence of human breast cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein 
corresponding to a marker selected from the group consisting of the markers listed in 
Tables 1-6. 

5. 

52. A method of assessing the breast cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of breast cells in the presence and 
absence of the test compound; and 
10 b) comparing expression of a marker in each of the aliquots, wherein the 

marker is selected from the group consisting of the markers listed in Tables 1-6, 

wherein a significantly enhanced level of expression of the marker in the 
aliquot maintained in the presence of the test compound, relative to the aliquot 
maintained in the absence of the test compound, is an indication that the test compound 
1 5 possesses human breast cell carcinogenic potential. 



53 . A kit for assessing the breast cell carcinogenic potential of a test 
compound, the kit comprising breast cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
20 Tables 1-6. 



54. A method of treating a patient afflicted with breast cancer, the method 
comprising providing to cells of the patient an antisense oligonucleotide complementary 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 

25 1-6. 

55. A method of inhibiting breast cancer in a patient at risk for developing 
breast cancer, the method comprising inhibiting expression of a gene corresponding to a 
marker selected from the markers listed in Tables 1-6. 



30 
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Sequence 1 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTATCCCTCATGATGACTCCTTAAGTGGNTC 

ATCGTCTGCATCTTCGTGTGAACCAGTGAGTGATTTTCCAGCATCTTTCCGAAAATCTAC 

CTACTGGATGAAGATGAGAAGAATCAAGCCAGCTGCTACTTCTNATGTCGAAGGGTCAGG 

TGGAGTATCANCCAAGGGGAAAAGGAAACCCAGGCAGGAAGAAGATGAAGACTATNGAGA 

ATTTCCTNAGAANAAGCATAAGCTTTATGGGAGGAAGCAACGGCCTAAAACTCAGCCCAA 

TCCCAAATCCCAGGCCCGTCGTATTCGGMGGAACCACCAGTTTATGCAGCAGGCAGTTT 

GGAGGAGCAATGGTACCTN 

Sequence 2 

TAGGGCGAA TTGG AGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCAAACTCCAGATCTAA 

ATATGTTTACTTTTCA TGTGG ATTCGGAACATCCCAATGTAGTGCTAAATTCATCCTATG 

TTTGTGTCGAATGCAATTTTCTTACCAAAAGGTATGATGCACTTTCTGAGCATAATCTGA 

AATATCACCCAGGAGAAGAGAATTTTAAGTTGACTATGGTGAAACGTAATAACCAGACAA 

TCTTTGAACAAACMTAMTGATCTGACTTTTGATGGTAGTTTTGTTAAAGAGGAGAATG 

CAGAGCAAGCAGAATCTACAGAAGTTTCTTCTTCGGGAATATCTATCAGTAAAACTCCTA 

TCATGAAAATGATGAAAAATAAAGTGGAAMTAAACGGATTGCAGTTCATCATAACTCAG 

TTGGGGACGTTCCTGAAGAGAAAGAGAATGAAATCAAACCAGACCGTGAAGAAATTGTAG 

AAAATCCAAGTTCTTCAGCTTCTGAATCTAATACAAGTACCTGCCC 

Sequence 3 

AGGTACCTGCCCATCCACTGCCTTTTCCATGTATCCTGGAACTGAGCATAGACCTCTTCC 
CAGGCAGAGCTGACAGCAAGTAAAGGAGATCATAATCAGGGGACCAAACAACTTTGTCTA 
AAGTGTGAATGTCACCTAAGGAGAAGCTGTGAGATCAGAAGGGTGGGGCAGAGGAGCAGA . 
CACCATGAGGGAGAGTCCTTGGGGGTACCTGCCCG 
Sequence 4 

CCGGGCAGGTGCTGATNTTNCNCGGTNAAGCCCNCGGNNGGACCCATNCCTGCGGNCACC 

TTGACCGAANCCTGTGAGANCTCCTGAAATANAGGANCCAGCNNTTNCTCTGNTANACAN 

ANACNCTGGNGCCNTTGNNGCTGGNCTGNNTGAANGCNCACNTGAACTNCGGATTCACAG 

NAANTNNTNGGNTNATCNACNGTGGANGAAGCCANANNCCANNNTATNAGNNTNNTCAAN 

CNNNTTNANGNAGATGTATGTATGGAGAGAAACTGAGGCCTCCNNGCCAACAGCCAGCAC 

TAACTTGGCAAGCATGTTTGAGAGCCACCTGGNGAAGTGGAGCCTTCAGCCCCANTTTAA 

AGCCTTCANATGAGACTGCAAGTTCCTGGGCCACCATTCTGGGACTGCAACTTTCACAAA 

GGAGCCTCCTAAANCCAAGAAGNCCCATGCCAGATGGGAATTCTTTGGGCCCCCCAAGAA 

AATTTGTGCC CACTTACATTTTTTGGGGAAAATTTTATTN NTGCAAGCNCATTAAG ATTT 

ACCAATTACCACTTCACTTATTTTCN I NIMH I NTGGAATGCCCATACTTAAAAAAAA 

AAAAAAAAAAA 

Sequence 5 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAAGGATGAAAAAGGTGGTGT 

C ATGTTTTG GGGAGAATCTTACTTCTCAAATGGAAATTGCACTTTTTGCTGAATCCTTTG 

CATTTTTTTGGTAGTAAGCAGTTCATTGAGTATCAGGTCCTCAAAGGAATGAGTTGGCCC 

GGCTAGGGTGGGCCCTNTTGACCTAACTTCAGAGGGGGCCTTGGCTCAGTAGGTGTGAAT 

CAGGGAAGCCACATTGTCCTCAGGGTGCTGTATGAAGCTGGGTGTGGGCGGATTCCTCCC 

ACACCTTCACACTGGCCTGCCTCCAACTCATACAAATCTCGGAGCGGTCGGTACCT 

Sequence 6 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCACTGGAGCACTTGG 

AAAACAGAAGAAATCAAAGGGCAGTCAGGACTCTCCCAGCAGGCAGGGACCAGGTCAGAG 

TCAGGATGAAGACAAAGCACCCTGAGCAAGAAGCAATGGAAGAGATGCTTGTGACCCGAG 

CTGATGGGGAAGGTTGAGGGCAACGCANAGACCCCACTGCCGACTTGCCAAGAAAACGTG 

CAGGAAATGAGGGAAGGCTCCATTTTCCTGAGAAAAGGGATGGAGGAACATAAGCAGCAT 

CAAGCTCCAGCTACTGGCTCTTTGAGAATGTAGTCGGGACTTGCCCCAGTCACCCTTGCC 

CTGAGAACTCA 

Sequence 7 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAAAGGATGAAAAGGTGGTGTC 

A TGTTTTG GGGAGAATCTTACTTCTCAMTGGAAATTGCACTTTTNGCTGAATCCTTTGC 

ATTTTmGGTAGTAAGCAGTTCATTGAGTATCAGGTCCTCAAAGGAATGAGTTGGCCCG 

GCTAGGGTGGGCCCTCTTGACCTAACTTCAGAGGGGGCCTTGGCTCAGTAGGTGTGAATC 

AGGGAAGCCACATTGTCCTCAGGGTGCTGTATGAAGCTGGGTGTGGGCGGATTCCTCCA 
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Sequence 8 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTATTTGTCACTTAAAGGT 
TCTTTCTGTAAACTGCTTCAGATTCTTTTAC 

AAGAA CGTT AATA7TCCTCTTTATAATCAATCTTTCCCAGTTAGCCTTAAAAATGTATTC 

CCTACTTTTGCTTCAGGAGATCATTATTTGCAAATGCAAAGAI MM I ACTTAGACTTTT 

GAAATCACTCTTAGTAACTTTAACATTGTTTTTAGGTATGAAATTGAAGGCTGTGAAGTC 

ATTTGGGCCATTAAAGATAAAGCTATAGGGAATACTTTTCTTCGATGCAGGAGCAGCTTG 

AATTCTTTGACTTCAAAGCTCATTGCTGGAAAAATCAAGAGGGCTTAAATTTGCCCCTTA 

AAAGGAACCNGGATTTTCCCACCTGGANGGAAGGG 

Sequence 9 

GATGCAGAGTCGGAAGGGAGCAGACTGCCCTGCTGGTGAGGGAGGTGATTACCCAGGGCG 

AGTGTGCAGCCACTCAATGGCAGGGATTCTTTTCTATTTCCAGCCGGGGTGGAAGAGAAC 

CCCTTTGTCCCTCAGACAAGTGGCCCCAGTTGAGCCCAGCGCTGAAACCGCCAGTCATCC 

GCCTCTGCAGGAAGTGGCCGCCCTGACACATAGCCCAGGGCTGGATTCTTGTGGAAGAGT 

CAGTGGAGTGAGATGGTGATTGAAAAAGGAAGAAGGCCCCTCCTCAGATCCAGGGCTTTA 

ACCCCAGCGCACTCCCCGATGTCTCACCCTTTGAGCTCCATGACTGCATTCCCACTTCAA 

GGTGCAGGCACGGACGCTAACGGCCCANTGACTTGGCCAAGGCCACCCAGTAATGCAGCA 

GCTGAATTGGGGGCATTTAC 

Sequence 10 

CCGCGGTGGCGGCCGAGGTACATTTTTGAAAAGCACAATAACTTGTATTAATTGCACTTA 

ACACAATGAACCTTTAGTTTCCAACCAGTmCATTCTCTGCAGACCCGGGCrrTCT^ 

TATAAAAACTGCTTTCAAAAGGCATAGAGACACCACACATGGTCCACAGTAAATTCAAAT 

AGAGAGGTGCAATAGTTGCAGTGGTAAACACACAAAAAAATACATTTTTTTGGACTAAAA 

ATCTGGTCACGGATAAAAGCATGTGCCTTTTCATTCTTCTCTGGGATGTTACAACAGCAA 

CACGCTCTAAAACAATTAAGTTACATGCATAATGCTAAAAGAATGTGAGCAATCCTNTAA 

CCAGCTTTTAAGCCCATCTGCTTG ATTTC NTTTTTTT 

Sequence 11 

CCGGGCAAGGTACACACCAGGGATTGGGGGCCCTGCAGCTTCTACGCCCAGGGACATTTG 

CTTCGGGGACTGGAGTCCTTGCTGTGGCGTGAGGCTGTGTCTGGCGTCTGGGAGGAGGAC 

TGTGTGGGGTCTGGGTTCCCCAGCCCTAATGACCCAGCTGGTTGAAGGCAGCAGATGAAA 

GGAGACAAATGACCACGCAAACCCTGGGTGGCCTCATGAACAGGTGGCGGACAAGGGTGA 

TCCCTGTTGAGCAAGGTCCTGCACTGCGGTGGGTCCAAGGACTAAGCTGCCAGC 

Sequence 12 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACTTGGGA 

TCAAATGGAATCTTGATTCACTAACCAATTTAAGATCTGACTTCTAATTTTAGGAACTTT 

GGGTTATGAACGCTTCCATTTTATACCTGTGTCTAGTTAGTTTCTGCCTATCTATCCAAG 

AAGCTTTTATCAAGGGTCCACCATGTGCCAGCCACTGAAGTAGATATAAATACAAGGATG 

TGTAAGGTATGGATGATGGTATACGAACTGTCATCTTACTGGATTTGTCCGCTCTGTTAA 

AGATACGGTTCCGAAAACTTTTTAAAGCCCTAGAGAGGGCTTTAAG 

Sequence 13 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACTTGGGATC 

AAATGGAATCTTGATTCACTAACCMTTTAAGATCTGACTTCTAATTTTAGGAACTTTGG 

GTTATGMCGCTTCCATTTTATACCTGTGTCTAGTTAGTTTCTGCCTATCTATCCAAGAA 

GCTTTTATCAAGGGTCCACCATGTGCCAGCCACTGAAGTAGATATAAATACAAGGATGTG 

TAAGGTATGGATGATGGTATACGAACTGTCATCTTACTGGATTTGTCCGCTCTGTTAAAG 

ATACCGGTTCCGAAAACTTTTTAAAGCCCTAGAGAGGGCTTTAAGG 

Sequence 14 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATATGCCTGCTG 

AAGTGCTCTGACTTTAGGATGAGAAACTCTAACATAGGCCGGAAGACAAATAAACCATAA 

ACTGTAACAATGACTAAACAGACACTTGGCCCACTGTGGTGGATTTGTATAACATCTCTT 

CGCCAATTTATGAGCTGTTTTTATTTCCTGTTTAGTTCTCTTAGCCATGAGAGGTGGACT 

CTTTGACCTGCCCG 

Sequence 15 

CGGCCGCCCGGGCNNGTACCAAAAGGNTNAANACCCANAGGGNAAACCCACCGCGGGNNG 
AGCAACAAAGNAGGCACACANGGGGAAAGACCCCANANACGGGGGGAAANGCGGGAAAGC 
CCCCCGAGGAAGCACACACCNCANACAACACCAAAAAANACACACGGGCGAGAAGNCGAN 
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GCCCAGAANAGNGAGGGGAAAAAGANGCAGACGGAGCCCAACCCCCANGGAACAAAAAGA 

GGACCCANACCGGGGAAAGACCNAACCAGNNCAAAGAGGGGGGGAAAAGGGNCAGGCAGC 

CNAGCAACCANAACCGCGNGCAGAGAAAGCACACCNGGAGACAAGCCCNACNAANGGNGA 

AGAAAGGGGGAAAANGCCGCCAAGGAGAAAGCCAGGGCNGAAACNCACGAGAAAAACCAC 

ACCAGGGGGNNAAGACCNANGCNANACAACGAAAACCAAAGGGAAAGGGNNCGACCCAAA 

NGGGCCGGCCCGGGNAAACCANCAAGGGAAGGCCACCCCNAGGGNGAAAACCNAAAAAAG 

NGGAGGGAAAGGGGGGAAACCC 

Sequence 16 

AGGTGNGAATCAACGCAGGTCAAAATGAAATTTACACTGAAGGCTTCCAAACCAAAGGGA 

AGGACAGGATGTGTCATCAMTATGTTTNGTCACCTTGTATTATACAAAANGCTATTTTC 

TAANGAGTCAGAGA AANT NTGTGAANCTTATTGTGCGGCCCCCTTGTAATNAAATGTTAA 

CTCCCTTGTATTTAATTTTCAACACTACATTAAGAATTAAGTGGTTTNAGTTTGNGGACA 

TAAGCANCACTTATTAATATCAAGGTGTTTAAGAACTCAGGAG 

Sequence 17 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTNTGCTTCCAGTTTATATTTCACA 

TCCTGTTCAGCTTNATTAATATGCCATGGGTCACAAAACTCAGTGCAATAAAATGTGTAT 

GAAAGAAACACCCTTCAGAAAAGATGAGACTCTTTCAAGTGTAAATACTCTAAACTAATA 

TAAGTCAAAATATATTTTTTGTGCCCAGNGATTmAAAMTTACC^ 

CTCAATAATTCAAATACTCAAGTGTCCATTTATATTTTTGGAATAAGCGAGAGTGATCGT 

AGTACCTGCCCG 

Sequence 18 

CNATTGGAGCTCCCCGCG GTGGC GGCCGAGGTACGCGGGGATTGTGTGCAAAATCACGAG 

GGGGGTGCAAGATCCTGATTTTTNAGGAGTTCAAGCGACAATGGCAGNCCAATACGGCAG 

TATGAGCTTNAACCCCAGCACACCAGGGGCCAGTTATGGGCCTGGAAGGCAAGAGCCCAG 

AAATTCCCAATTGAGAATTGTGTTAGTGGGTAAAACCGGAGCAGGAAAAAGTGCAACAGG 

AAACAGCATCCTTGGCCGGAAAGTGTTTCATTCTGGCACTGCAGCAAAATCCATTACCAA 

GAAGTGTGAGAAACGCAGCAGNTNATGGAAGGAAACAGAACTTGT 

Sequence 19 

CCGCGGT GGCGG CCGAGGTACCTCACGCGCATAAATTTGCTGCTCCTA I iTTTTTl F CTG 

TTTATGTGTTTTTATGGATCTAAGTTAAATCTTTTGGCAATATATAAAAATGTAAATA 

AAACTTTATTTATTAAGAATGTCATCTTTTTTAATT^ 

ATTTATTTTTTCTATACAGT TrTAA ATACTCAGACATATTTTGCTGTTCATGA^ 

ATCCTGTTCTCATGGATTTGTTTTCCCATACTGTTTTCTCTGATCTCAATTACAGGTTGG 

ATCTCACAAATAATAATGTCAGAGACAGAAATATTTTGCCACTGTTGATTACTATACTTT 

AAAGTTCTATATTATGAAAATATATAATAGCTTGTACCTGCCCG 

Sequence 20 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTTTCC 
GTGTGTGTATGCTGGTGAACAGTCAGATTTATTTATATTTTTTTGCAAGCATTGAATAAT 
CTAAGTTTTAAATATTATTTATCCCCATCCGTTCGTATTTATATTAAAGAATTCTGTACC 
T 

Sequence 21 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGA 

ATAAGGTTCACAGCTGAATTGGCTTGGTGTCCTAATTCCCTATTCCAGTATTCTCAGAAG 

GATCCCATCTATGATATATGCAGAAATCACAGCCACATTTGAATGGTTCAATCTTGATTC 

ATACTGAA CTCC CTTAAGCCCAGCAGTTTCTCATTCTTAAAAGGTAGGTTATTAGAGCCC 

CAAAGCAATTTTCATAACAAATCTACCTTTGTATATTTAAGAATATGCATTAAGGCTAGG 

AGTGGTGGCTCACGCCAGTAATCCTAGCACTTTGGGAGGCCGAGGTGGGCAGATCACAAG 

GTCAGATTGAGACCATNCTGGCCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAA 

ATTTGCCCGGGGCGGTGGCTCACGCCTATAATCCCAGCACTTTTGGGAGG 

Sequence 22 

CCGCGGTGGCGGCCGAGGTACTCTATGTCCCTCCCAGTTGCAACAATGTTGTTTCCAGCC 

TGGTTCAGCAGCTTGCTGGTGGTTTACTAGGCAGACATGAGCCAGAAACAAAATCAGCAA 

CCCTGGAGTGGTCATTTAGGATCAGTTTTGTCTCCTTTTACAAAGCAATGAGTGCTTTAT 

AAAAGCATTTCTCCTCAGGAAACTGGCTCCTCAAAAGTCGTTCTCCAAGCCAGCAGCTCA 

CAGTAGATGGATGAGTTTACCTGTCCTAATAATAC' n i I I ATT AT AATACTTTTATTACT 

GATACTTTATrATAATACTTTTATTACTGATACTGATACCTAGTAATACTTTTATTAGAT 



Page 3 of 635 



WO 01/51628 



Table 1 



PCT/US01/00798 



GAAATGTATCCCCAGGCCTCCACATCAGCCTTCTGGGGCCCTGACAGGTGTTTACCAGTA 
GCTNCACAACCTTCACAGGCTTGTTCTCTCATCCCTGTGAGGGAAGGTTCAAAAGGCAAC 
T 

Sequence 23 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCGCTCAAATTAAGTTTTTTTAAAAGGTCTGT 

AATTTGAAAGGAAMCAATTTTTCACTAAAAATATCCTTArrACATGAAAGCCATMm 

AAAAGACAGAGAAAAAAGCTTTTTAATTAACAAATAAACAGCATCTCTCAGAGACACTTG 

GGAATGTTTGTTTTTAATCAGTGGGTCCTAAAAATCACTGCTTCCACCTGCAAACGAACA 

CGAATCCACTGTGGATGGCGTCCTCTGACCCACCCTGAAGGGTTCTGTTTCTCTCCACAC 

ACTTCCTTTTCTGCAACTTTCAGCAAAGCAGGTTTGGAAGAAAGAGGATTACAAAGAACT 

ACGACTGGCTCCCTAGTGMTAACTTAACACGGTAAAACCCGG7TTTCCATTTACATT 

Sequence 24 

CCGCGGTGGCGGCCGAGGTACCTACTATGTGTCACGCCATGGGGGATAC. a AAGATCTATA 

AGGCACAAGACCCTCAGTCTTGTAGTCGCCTGACAGCCAGCCAGCTACAACATAATGTGG 

AAAGGACAATGGTGGAAAATGCACTCAGGTCTTCCTAATGCACAGAGTATGCTCAGGCTG 

TGACATCAGGAAGAAAACAGATATTTACCTTAACACGGACTTGGAGGACCTTCAAAAAAC 

AGTGATGGGAGGAAATCCAGTTTTAAAAGTCTTGATTTAAAAAAAAGAAMCACTTTCTG 

TGGATAAAGATAGGCTGCAGGAAATGTAACCTATGAAATTTTCTCAAATTAGCTTTCAAA 

CACACACAAAAAATTGCATTTGGTTGAGGAGCAGAATGTAACTTATATTAAAGAATAAAC 

TACTATTTAGTATCTGAGTGAAGTACCTGCCCG 

Sequence 25 

CGAATTGTTGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAAGTTCACCCACCC 

TGCACTTTNGCCCTTAGATCAATCCTAAGTAGCCATTGCCAGTAGGCCAAGTTTAATCAG 

AGGACAGTGCCTACCAGTAAATACTGAATAGTTACAATAGTTATGTCCATCCAACCAGTA 

GCAGATGAACAGCTAATACATCATGATGCTATGCTCTCCTAACAGGGTCCCCTCAGATCC 

TCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATCTCTACCAGGTATTAACAC 

CTAACTACTCTCTAGCCAGAGGCAATTCCCTTTATTTCCTTACTCTCGTCGTCTTCTCTT 

TAGCCCAATCTCCTGACAATAGTTAAAACAAAAAGACCCCCAAAATATCTCTTGCTAAAA 

CAG 

Sequence 26 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAG 

CTAAGAAATGTAACAGTATCAACCCTCCCAGTTGCTTAATTATACCCATAGGTAATACAA 

AAAGCTCTGAAGACCCAAAGATGACATTACTAATGATGTGATTTCAGGAGCCACAGAAGA 

ACCTTACCAGCTTCCCTCAAATCAGTCCTTATCCTCTTTCTATCTTCACTCCCATCATCA 

TCTATTTTCACACTATCCAGCTAAGCAAAGATTCCTGGAGGCTGACTTGTATCTTCAGAC 

TCACAGAGTGAATTCAGCTNTTCTGAATCAAGACCCACCCAGTN 

Sequence 27 

ACGCTACTATAGGGCGAATTGNNGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTTTT 

rrTTTTTTTTTTTTI IGGAGAAACAGGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACT 

CTTGGCCTCAAACAATCCTNCTGCCTCGGCCTGCCAAAGTGGTAGGATTATAGTCATGAG 

CCACTGCGTCCAGCTAGAAATAGCAATTTTTCTAACACATATATTTCATTGGTTTATGTA 

TGAATTTAGAATATCAAAGTGGCATGAACAGGTTCAAGATCCAACAAAAGGGAATCTTGA 

TTCTGCCTCACCTAATAAAAGCAGAAGTGCGTGGGCNCNCTAAAA 

Sequence 28 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAAGAGGTTCGTGGA 

AAAGGAATATGCTTAATAACTTCGAAAAAAAATACCCATAAATGGACAAGGGTGTTGGGC 

CCCCTTTCTGGGTGGATTCAGGGGTAGTCAGATATTCCAGGTTAAACTGAACCTGATGCA 

TCAATCTCCTTCTTGGAGGTGTCTCAGTGTGCCAGACACTGAATCCACCTTAGACATCTA 

TTGCCATTCAGCCAGCAACAAACACACATTCACACTCCTCTGTGCAAACCTTTCTTTGCT 

CATCTTGGTGAGAAAACACATCCAAGAGTACCTGCCCG 

Sequence 29 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGAAGGCACTGAAT 

AAATTCACAAAGGCCGATTGGTTCACCCATTCTTTTAGAGACAACAGACACGCAATTCTG 

ACGAGGACTCCTGTTACTAAAAGACACAGCCTCTGATACAAGAGAGATATCCCTTTGACT 

GAAGCATTACCAGGGTCCCCAGGGCCCCCTCCCACTGGGGCGGTAACACTACGGGTCTCC 

CCACCATATATTCCATGTCAAAGTATCTACACAAATACAGAGGAAATTAAGCAAGTAAAT 
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ACGGTATGTAATTGTTATCATTTGTATTTCTTTAAGGCATATTTATAAATATTTTAAA 

AAACAATATGAGGTGAGTGCCTTTCATTAGCTATGATCTTTCATACTGATATATTTTGAC 

TGGATCTGAATAAGGCAGGTTTACTGGNGGGAGCATATTAACATAAAACAGCTTATATGA 

TTTCAGGTGGGTACCTGCC 

Sequence 30 

TACTTAGGG CGAA TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAGGGAC 

ATMmCTCTTTTGGCTGTTATACATTTCTTTTAAAMGCAAGTGArTGGTAGGM 

GGCAAAAGTTGAATAAGTTACMGMCAGGCTTTTATTCATTTGCTATTTGGTTT^ 

TTTCTTGTTTACT TCGTT TGATTTGTTTCTCAAAACTCACTCAGTGGGATTACCTGTCTG 

TCATGTAGCAGCTTTTTAAAAAACTGGACTGTGCAGCGTGGTTCTGAGTGCAGTCTTTAT 

AAAGCCC AGN GCTGN ACCT 

Sequence 31 

ATACGACTATT TAGGG CNAATTCCAGCTCCCCGCGGNGGCGGTNGCCCGGGCAGGTTNCT 
NACGCGC ATANT TTTGCCGCTCCTA I I II 1 1 I 1 1 CTGTTTANGTGTTTTTATGGATCTAA 
GN CAMTCT TTTGGCAATATATAAAMTGTAAATAGTAMCTTTATTTATTAAGAATGTC 
ATCiun I AATTTA TATTT ACACAATTGTTCATCTAATTT 

AAATACTCAGACATATTTTGCTGTTCATGATATTTTTATCCTGTTCTCATGGATTTGTTT 
TCCCATACTGNTTTCTCTGATCTCAATTACAGGGTGGGATCTCACAAATAATAATGGCAG 
AGACAGAAATATT 
Sequence 32 

CTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATTGCATC 

ATGAAGTTTGGGCATATCAACATTTTGACTCATTCGGGAAATCATACGCATTAAAAGTTG 

AAGCTTTCTACGATTTGGTGGGGGAAGTAACAAACAACATANCTGTANAGCATCNATGGC 

AACCCTCTCT AAAT GAGGTTGCAGCAAGCTTTGTGTGCCAGTCAACATAGAAGAAGCTGG 

AAGTAAAGAATTTTCTGAAAGTTCTATTGTACCTTGGCCGCCCGGGCAGGTACGGAGGTG 

AACTGAAAGAGACTGCTTCCATTGGCCTAGTCCAGATGAGTGTATATATTCCCATCAGAA 

CCCACCACTGGTCATCTTTMGGCATNCACAAAATGTCCCTCACAAATTCACTTAATCTC 

Sequence 33 

TCGAGGTACCTAAGTCAAMGGCACTGNTTTGGAGATGGCACACTCATTTTCATGCGTGT 

AAAATNTTAAATCATCCACTTTGCAGGCAGTGGCTTTGATAACTCACTGCAGTGTTCAAG 

GGGTTTATAAAACTGGTTATAAGCTTCAAACCCATGTTTAGAAAAATTGACACTCATAGA 

AAAAAATGCTTTCTCTGGGCTAATTAAAATTAAAAAAATTGAAATGTAAACCGAGAAGTA 

TGGCTCACAAAGCTATAGAAAAGATCCTTCATATCATCCCTGGCCCTAGCACCGTGAGTA 

GATGTTCACCCTGATAAGGCCAGGCGGAGGTGGCCCATGTGAMTTCTTTGGCTTTGAGC 

TAATTG 

Sequence 34 

CCGGGCAGGTACTAACATGATGATAGGTTTTCAAAATATCTTTGTAGTGGATGCTGCATA 
ATTACATTCACTTCTCTTAGACTGTAAAAGACTTTCTTGACTTGTTTTAACAGTAGAGAT 
AGCAGTACCT 
Sequence 35 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGTGCCCCTTTTCA 
G ACAGi \ \ \ [ GATTCGCTCTAGAC I rTTTTTm I AATAGGGAAAAAATTTG ATAATTTT 
Cum IC TACAT GCACTTAAGACTAAAACACAGGTTTGGATTAATTTTATTTGCTTCCT 
TTTTCCGCTTTTCTTCCCGCAGAGCCTGATGGGAGAATGTCCAGGGCAGGGAAACCACAT 
TTTTTGTAGGTGATAACTCMTGA AAATT GGTGCTTATTTTTTACACTTCTCTCTTGTGG 
CTCTCTTGTGGGGCTATCTATCTGTTTTAAGGTCTCCTTGAAGGCGCACTGGGG 
Sequence 36 

CCGGGCAGG I I I I II ! ! I I I I I I I I I I I I NTCAGGGNTTGAACACNTTTTTAATTATTAG 
AMTCC ATACATGCATATAATTAACCTCTAMTTTATTTTTAAAAAATAAATTATGG N C A 
TGAAAGAACAATAGNGCTTAAANAATTTNAAATTTNCATANACATGTAATTTTTATTCCC 
TAAAAATGAAATAAGATGAAACCTTTATGGGTANATTAATGGGATNTTTTA 
Sequence 37 

AAGGGCGAATGGACT CCAC CGCGGTGGCGGCCGAGG l M I Mi i I I H ll H li 1 CTTTT 
TTTGGAGAGAACTCCATTTTATTATGGAAAGNTAAAAAACAAACAAANCAAAACAGGCAA 
TTGATAAAGGCGGCACAATNGNNAAGGAGAGGTTGAGGTGTCTCCTTAGCCACCCGACAC 
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CTGCCCGGGCGGCCGCTCTANAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTC 
AAGCTTANTCCGATACCGTCTGACCTCGAGGGGGGGCCCCGGTACCCAACTTTTTTGNT 
Sequence 38 

AGGTACAGAAACCTTCAGAGAGGATAAATAGCTTGCCCTGTAGAAGCAGGACTGAAACCC 

TTGTCCGCCTGACTCCCCCAGCTACTCTGCCCACTGTAGCCCCCTNNCTTACTGTCCTGG 

CACACCCCTCACCATCCTNTNTACCTTAAATATCAAAGAGGGCAAGAGNGAAAGGGCTTT 

AAAGATAAGTTATTTTTTTAAGGMCCTTAATATTATTTTTAAGNAAG 

GTGACGTGAAATTACAANAAACCTGNCCGGGCCGGCCNGCTCTTAGAACCTAGTTGGAAT 

CCCCCCCGGGCTGCAGGGAAATTTNTGANTTTTCAAAGCTTTATTCNATACCCGTNCGAC 

CCTTNAGGGGGGGGGNGCCCCGGGTACCCAAACNTTTTTGGTT 

Sequence 39 

TAGGGCGAATTGGAGCTTCCCGCGGTGGCGGCCGCCCGGGCATGGTCCGGGCAGGTACAC 

ATGCATGCATACACACCCATACAAACATCTGTGTGAGGGCAGTTCTGGAGATGAGCAGAG 

AGAGACCGGAATAMCTCAATCTTTTCTTTCCCAAGCTCCTAGCCAACACTATCCTTGGG 

AGAAAGAAATTTGCAGAAACTGCTNANACCAAGTGTGGAGATGTCAAGCTAGTTCACACT 

CTGAGGCTCAGAATATGTAGGACATGCACAATTGTGCAGTCCTTTGGGATTGGAAGTGAA 

AC AGTCTG NGATCCCCTACCTTCTAGG G AACTAGGACCTAGGAAG AGGTAAAG ATTN TCA 

GGTATGCAAAGCCGCCCCA 

Sequence 40 

CCGCGGTGGCGGCCGAGGTACGCGGAACACCCATGGCAGTGGGATATATCCCGGAAACCC 

ACAGGATGAGAGAAAAGCTTGGCGCAGATGTGATAGAGGTGGCTTTTGGGCAGATGATGA 

TTATTCTCGCTGTCAGTATGCAMTGATGTCACTAGAGTTCTTTATATGTTTAATCAGAT 

GCCCCTCAATCTTACCAATGCCGTGGCAACAGCTCGACAGTTACTGGCTTACACTGTGGA 

AGCAGCCAACT7TTCTGACAAMTGGATGTTATATTTGTGGCAGAAATGATTGAAAAATT 

TGGMGATTTACCAAGGAGGAAAAATCAAAAGAGCTAGGTGACGTGATGGTTGACATTGC 

AAGTAACATC ATGTTGGC TGATGAACGT 

Sequence 41 

AGGTGGAAAGCTGGTGCTGCTGCTGCCTGATTCCCGCCGACAGACCTTGGGACCGGGGCC 
AACACTGGCAGCTGGAGATGGCGGACACCGAGATCCGTGCACGAGACTAGGTTTGA 
Sequence 42 

CCGCGGTGGCGGCCGAGGTACCCTTTGCCTCCTTTTGTGGGGAGATATCTGGAAAATAGC 

TGAACAAACTCTACATGTGGCAAAACAAAGTGACAGTTGTTATAAAATGTCCACTAAAAT 

TACCTGCTAGCATTACTACGTATAACTGAAGTTTTTGCATATCCGCATAAGAAGAAATCC 

ACATTACAAGGAAAAGTACACCTGCCCG 

Sequence 43 

CAACTTCGGCGCCACCCAATCGCTCACCTGGAACGGCTTGCGGATCAACCGTTGCTGCGA 

TGCCAGATNCACTTTGTCCTGCAACTTGGCGAGGAACACCGGGTCGAGGGTCGACGGGAA 

AATCTCGCTGAGTTTCTGGTCTTTCAAATCATTGGTCAGGATCACCGGTGGATAACTCGC 

CAGCCCCGACACCAACTGGATGTAGGCCTGCTTGTTGTTGTCATCGGTCAGCCATTGCAC 

CGCTTGCTGCTGCGCCTTGAGCAACGTGGCCACGGCTTCAGGATGCTCATCGACAAACTT 

GCCGCTGCCCACCAGCACACTCTGGATACT 

Sequence 44 

CCGCGGTGGCGGCCGAGGTGTCACTTCTACGCATAAACTAGATATAGCTTTTGGTGTTTG 

AGTGTrCATCAGGGTGGGACCCCATTCCAGTCCAATTTTCCTAAGTTTCTTTGAGGGTTC 

CATGGGAGCAAATATCTAAATAATGGCCTGGTAGGTCTGGATTTTCAAAGATTGTTGGCA 

GTTTCCTCCTCCCAACAGTTTTACCTCGGGATGGCTGGTTAGTGCATGTCACATGACATC 

CACATGCACATGTATTCTGTTGGCCAGCACGTTCTCCAGACTCTAGATGTTTAGATGAGG 

TTGAGCTATGATATGTGCTTGTGTGTATGTCTATGTGTATATATTATATATACATTAGAC 

ACACATATACATTATTTCTGTATATAGATGTCTGTGTATACATATGTATGTGTGAGTGTA 

TGTATACACACACACACACACACACNCACTTTTGCAAGAGTGATGG 

Sequence 45 

AGAAGACTAAAGAC AGTTA TCTTATAATAACGAAATATAGTATAAATAGCACCTTATCAA 
GAATTCTGCAG GGGTTTTAAC ACTTAC AATAATAG G AAATAG CC AT TAAAAAG TTGCTCT 
MCTrTAGATTTCTAACTTTAGTGTTCTTTAACAAAGGCCATATmGTGGCCT^ 
CAAAAAATTATATCTGGCTTTATCTATTAGTAAACACAAAGGGTCCATATTTTATTCTGA 
AAAAATATTTATTATATTCATTCATAAATGTTCTAACTAA7TTAACTAAAAAAATCTTCT 
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AGTATTTTCTGATGCCACAAGCTTACTAAGAAAATTACTTCTAAAAATTGGTNATATAAA 

TCATCAATGGATTTACCCTACTTTAAAAAAGAGGGGTATCTGGNTTTCTCTTACATTTAA 

TAACCTGAAAATGGAGGTCTATAAAAATATTTTTTAAAAAATACAGNGACNCCTGNTGGA 

GGTTTTGGTAGGGCCCCTTGGTTTTTTNAAN 

Sequence 46 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCACTAGAAT 

AATGCTGTGTCCTCCTGTTACACATACTCATGGTCTCTCCTACTCTCCCTTCATCACACC 

TGTGACATCTTGCAATTTATATTTGATTGTGTCATTATCTAATGTATTCCTCCTGGACTC 

T AAGCTCCAAATGAAC AAATGCTTTGTCTACTTTGCCATAACTGN GTCCCCAGTGCTTG G 

CATGGGACCTGGTGCAGAGTAAATACATCATAAATATTTTGTGAATGAATGAATGAATGA 

ATGACCATGATTAATAAAAGGATATAGCTGCTCAGTCTGGTGCTGATAATGGTGGTAGTG 

ATCAAAAGGGTTNAACCACAGGTCATTAMTGAAAACAGCTAACTTAGTGCTTACAGGAA 

CCATACACTCCCTGTGTATAAACATGCCGGTGTGTGTGTCCTNGGT 

Sequence 47 

GCCCGCGGCCGCCCGGGCAGGTNCAAAAGAACAGCATAAAANCCANAAANTTNGAAGCNA 

ACAAGGANNNGCAANGGGCCGGGAAAAGTAAANAAACAGCGGCCCGCGCACAANCGGNNA 

NACAAGACAAGGAANNNAACAACCCAANCCCCACACGGAGAAGGAGAGCCNAAAAAAGGA 

AAAGNCGCCAGNCAAANGGAAGCCCCGAGGNNAGGAANGAAACCAACANGAGACCACCGC 

AAAAGAGAAAACCNANGNGGGCAANGCAAACCGNGAAAAGANGGGNCCCGNCCAAAAAAA 

ANAANNNNNCCCCTGGCNGNAGGGANGAAAAANATGGGCATATAGCCCAGAGCCACNAAG 

AAGAAAAAACANCCCACCCCCAANAAANCNAAGGGAGGCCCCAACNGAAGGCCAAAGGGG 

GNCCCAAGGGAAAAAAAANGGANGAAAGAGCCCCCCNGGGANCCAGGGAANGGGNAAAAA 

AGGGGGGAGGAGGGNGGAAAAAAACCAGGGANACCCNAAAAACCGGCNAAAGGGCCCGNC 

ACCCCAAGCNNAGGGAACCCNACCGGGAANGGGGGGGNG 

Sequence 48 

CCGCGGTGGCGGCCGAGGTACAAGAGMCAAATTAAAATTGAAAAATTGATTTCACTTAG 

AAAAACTTCTAGGAACAGGGTGAACCACTGATTTTAATTTGCCTAATTATCTTATGACAA 

GTATCAAATTAAGATGACACTTAAAGATCCTTAGCATTAACTTAATGATGGAGAAGAGTG 

C TCAAC AGACAGTTCCCAGTMGGTAATGAGATGCCATTTTCAGAGACATTCTAAGAAGA 

TATTTTGATTCATTAAAACATT AAAT AAAMGCCCTCCTCAGATTGGAACCCCCAAATCG 

ATGG AGCCACATTAATAATACTTTTCATGCCTCACTTTGACATGACAGGCATTTNGATTT 

TTTTAAAGATCTTTAATACTTT 

Sequence 49 

CGCNGGCGGCCGCCCGGGCAGGTNCGAGAAAAGAGCTAGGGTAGGCAACTTAAACTTACA 

CAGTGCCAGTCTCAGGAGGTCAGTAGCTCACAGAACTCAACAGATAAACTGGATTAAAAC 

TTAAAAGTCTTCTTTCTATTTC 

CTGCCTATATATAAATTTTTTATTMTTTTCTTGTATTGGGAAGA 

AGGATGAGAAGAAAAAATACGCTGACACTGCTAAATCGGGTATATGTTTTTGCAATAAAG 

AACACTGGTCAATATACAACTGAGGAAAAACTGAAACAGATGTGAGTCCTANAACCACAA 

GAGTTTGAATTTGCCCAGAMTGCTATTTTAAACACTCTATATGTTGGCTGCTGTTTTT^ 

GGGGAATAATGCATTCTTGGCATCCTTAAAAGGTTTCAATATGTTACAAAGGTTATCCCG 

GAAAGAGAAAAAGC 

Sequence 50 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCTGTCTTCTGGACCAG 

TCAGTCTATGAGCTCTTCTTTGAAACAGTCTTAGACCCAGAATGGTCATCTCACCCCCAG 

AAGACATGGTTTCCAAGGGTGGGAGGTGTTAGTTATGGGTGCCGAGCCCCACTTCCTGGG 

CGACAAGGCAAAGGTCAATTTCTCCCTGCTTCAAGAATGAAGAGGGAACTGGTGCTCCTT 

GAACCAAAGATTCTTGGAAGGGTCATTGTCACCAAAGTGGCTGGTGTGGGCAGGAGCACC 

AAGTGCAGACAAACCCACCGGGGCCTGGCGGGGCAGCCCTGTCATGGTTTCTGTCTTTGT 

TCCTGTTAGTGAAACAGAAGACCCCCGACCCCCGCCACCTAGCAGCATGGAACACCTGCT 

GCCCAGATACAGACAAGGCTCTGCTTTGTCTCCTTGTGCTGGTTTTTGGCAGAAACGTTA 

AAGGGGCTGAAGGCCTGTGGGACAGAA 

Sequence 51 

GCTCCCCGCGGTGGCGGCCGAGGTACTCATNATAGTTACATCCCTATATAAAAAGTATGT 
TTACATTTAAAAAATTAGTAGATAACT^ 

TTGAGTCAACTTTTGTTTTGGAACAAATTAAGTAAGGGAGCTGCCCAATCCTGTCTGATA 
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TTTCTTGAGGCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTAAGTGGG 

AT TCTT AATATCACCATTCTTGGGACTGGTATACATAAGGCAGCCGTGAAACTGGAAAGT 

CATTTTGATGACTG ATGT GATACATCCAGAGGTAAMTGCATTTAAACATATTAAAGGAT 

TTGCCAAAGATCAATTTTCTTGCTGACATAAAAATCACACAAACCAGTCCCCCCCAAACC 

ACAACTGNCTCTCAMTAGCTTAAAAAAATTGGAAAACATTTTMGGATTTTTCMGGTT 

TCTAGATTTTNAAAAGGATGGTCAGCTTTTAGAGGGNATGGTNAAAAATTTT 

Sequence 52 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAATTTACTTGAC 
CATTCCCCTTATTTTTCATCTAGAGGAATCTCGGATTCAGCCCTTTCATTGCTAAGACAC 
CTTTTCACTGAGGTTCTTACCAGCTCAGCCAAATCTCCACTCTGCTATAGCAGAAGCAAT 
AATG TTTGCTTTAAAAAGATTTCTTGACCNATGCCTTTTNTTANAMGGGGGGTAGATTA 
TTTTGAACTTNCTATC ATCANANC AGNCCCCNAAANGGGGGGGTTGGGGGGGGGATTTTT 
TTTNTGGGGNNGNNN I I I II I I INNCCNCCCCCCCCCCCNN 1 MM IAAGGI 1 1 I IGNGG 
GGGGGGGGNAAAGGGGNA I M 1 M M M M M I NNNCNCCNNGGNAAAAAAGNNNNNNAN 
TTTNI I I! M 1 1 NNTTNGGGGGGGGGGGGGGGGGGGGG 
Sequence 53 

AAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCCATGTGGAATCTGTG 

AGTGTCCTCTTAAGTAGCGTGGGCTAGCCAATCTGCCGTTCATGGTGTATTGTAAACTCC 

GAATTCCATATGTAATAGGATGCAAGTCTAAGCGTTTCATGTGGACATAAATGTATCTAA 

ATAAMCTTTCCCTAGCACTGTGGCTGACCTCACCCTTACTTTTATACTTTAGTATGAAA 

CTG ATGAGAACTTTGGTAGTGAGTA I M I M I I ATATATATACATATATATGTACCTGCC 

CGGGCGGCCGCTCTAGAACTAG 

Sequence 54 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCACGCGCATAAATTTGC 
TGCTCCTA I WWW I CTGTTTATGTG 1 1 I I I ATGGATCTAAGTTAAATCTTTTGGCAAT 
ATATAAAAATGTAAATAGTAAACTTTATTTATTAAGAATGTCATC I MM 1 AATTTATAT 
TTACACAATTGTTCATCTAATTTA I MM I CTATACAGTTTTAAATACTCAGACATATTT 
TGCTGTTC ATGATA I I I I I ATCCTGTTCTCATGGATTTGTTTTCCCATACTGTTTTCTCT 
GATCTCAATTACAGGTTGGATCTCACAAATAATAATGTCAGAGACAGAMTATTTTGCCA 
CTGTTGATTACTATACTTTAAAGTTCTATATTATGAAAATATATAATAGCTTGTACCTGC 
CCG 

Sequence 55 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTATATATACATGTGCATATATATGTAAAT 

TTATATTTATATATAATATCTTTATATAGATAGATATCTGCAGACAGGTTATTGATTATA 

GAGACCCAAGAAAGCAACTCAATAATTGTTCAAAGTTTTCCTCACTGACTGCTGGTGTGT 

AGTTTAAGAAGCCCCGATTTGTTTGCACTCAGAATGCCTTATCTTCTTTTAATC 

TTGATGTGGAATTTTAAGTCTGAGAAGTGAGGTGCCTTCTGAAGAAGAGATTTTAGAGAC 

TCCCTTCCTCTATAAGTTGGAAATGACCAAGMGTCTTTAAGTAGACGACAGTTAGCTGA 

CTTTGACATTGTAGGGACGTAATCAGCTTTTAACCAMTTATAGAACTGGTAAAGGGTAA 

AGATCAAATTTGCCAAAAAAA 

Sequence 56 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAAGAAAAGAGCTAGGGT 

AGGCAACTAAAACTTACACAGTGCCAGTCTCAGGAAGTCAGTAGCTCACAGAACTCAACA 

GATAAACTGGATTAAAACTTAAAAGTCTTCTTTCTATTTGAGCCCATAATGACTATTT^ 

AACATGGCTCTTTTGCTGCTGCCTATATATAAAI MM I ATT AATTTTCTTGT ATTG GG A 

AGATCTTGAATACNCTCCAGGATGAGAAGAAAAAATACCCTGACACTGCTTAATCGGGTA 

TATGTTTTTGCCATAAAAACCCTGGNNCATATACCACTGGGGGAAAACTNGAACCCATNT 

GGGGTCTTTNAACNCCAAAGGTTTTAAATTTCCCCCCAAANNCTNTTTTTTNAAA^ 

TTTTGTGGGGGCGGCNGG I M M M IGGG 

Sequence 57 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCCCGTTCTGCCTGAGCA 
\ 'V\ I I ICCTAAAGGGAAGAATCAATAGTTTCTGACTGTTTTAACAGCTGAAAGCTCCAAC 
TGGAGGCAGAAGATGGGATGGCTTTTCACACACGTGCGTGCAAGTTTAGCCACCTCCAAA 
GGCCTTGTTCTTAAAGCA 
Sequence 58 

CCGCGGTGGCGGCCGAGGTACTGGGAAAATTTATAGAAATCATCTAGTCTTACCCTTCAT 
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CTTACATATAAGAAAAATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 

TTGACCATGAATGTATGAGCCTAATTAGAGAAACAGAAAATCAGCATGTCAGTTTTCCTT 

CATTCAAAAAAACATAGTCTTTNTAAGCAGACATTNTGGNGGTAANANGCCTAGTTTAAA 

GCCCCCCGTGGNGGCT CNAAAM NTTTTGGAATGGACTTCCCGNCNCNNCCNTTTTAAAG 

GANGNCCCCCCCNCCCATTTT TCCTG NCCCCNCNCNGNGNNTTNNNNGAAAGGGGTTCCA 

CNCCCGG GGGGG GGGGGNGNTTTTNNCCCCCCCCCANGNTNTTTTTCCAAGGGGGNTTNT 

TTTCCCCNTTTTTNTAAATAAAAN AAAAAN NCCCCCCCCC AA 

Sequence 59 

CTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCGTAAAACAAGGATCAT 
CGATGTTGTCTACAATGCATCTMTAACGAGCTGGTTCGTACTTTTAGTAACAGTTCAGA 
ATTCATCTTTATCTCCTACCTGCCTCATCGGTGGAAGTTTAAAGTCATGA I Mill 11 AG 
ACATTGATACTTGTGTCTATAGACAAATAAACTCATATTAGATGACAATTGATTTTTTAA 
AAGTCCAGGTAGAGAAAGGAGCMTCATTTTGAACTAAAATCTTTCTATGTTTTTTGATT 
ACTATTCAACTTGCTATTTTTTAGCAAAAAGCCGAAGTTTCAATTAGTGTTCATCTCAAA 
TCTTATTGCTTTCAACCCGGGGTACCTGCCCGGGCGGCCGCTCTAGAACTAGGTG 
Sequence 60 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGATAGAGA 

T AATAG AAGACAAAG AG ATTAAAAG G AAATAAAAATGC ATG ATTAAAAAC TAAG AAT AAA 

AAACCTATTTTTATGTTTCCTAAAGGAAATTGTTTATTCTACAGACTCAGTAGGTAGACA 

CAAACATAAAGATTTCCCTAGAAGACATAGAGTGGGATTTGATAACACTGTCTGTTATTT 

TCTGTACCT 

Sequence 61 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTATACCTTTTTAT 

TATTGTTATAATTATTATGGGTATTTCTMTTAATATGATGTTGAAACCTGTTTGGCACC 

TTCTGGAAGCTACCAAAAAAATGACACTCCATTGAAGTGCTTAAAAGCTGTTCTCATAAG 

AATTCTACTGGCCTATTGCCNAAAAAAAAAAAAAAAAAAAAAAAAG 

Sequence 62 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTGAAAAGCACAATAAC 

TTGTATTNTTTGCA CTNAAC ACAATGAACCTTTANTNTCCMCCAGTTTTCATTCTCTGC 

ANACCCGGGCTTTNTTTTTATAAAAACTGCTTTCAAAAGGCATAGAGACACCACACATGG 

TC CACAGTA AATTCAAATAGAGAGGNGCAATAGTTGCAGTGGTAAACACACAAAAAAATA 

CATTTTTTTGGACTAAAAATCTGGTCACGGATAAAAGCATGTGCCTTTTCATTCTTCTCT 

GGGATGTTACAACAGCAACACGCTCTAAAACAATTAAGTTACATGCCTAATGCTAAAAGA 

AATGTGAGCAATCCTATAACCAGCT 

Sequence 63 

ACTATAGGGCGAATTGNTGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCACA 

AACTCTGAAATTGGGTTCCATATTGGCAAGGCTGCCACAGTTGTTAAGAATAATCCTCTA 

TGTTTCTTCCTCACAAAACCATATCTCATTTATATCCAGACCATTACTTCACTATAATTA 

CAAGGACAAATTATTAGCAAGAAATAAGAATAGTATTAGMGAATTGATCCTATTTTGAA . 

CCCCTCTCCAGTATCTTCACACTCTTGTCAACTCTCCAGGCCTCTCTCTTGCCCTGAGTT 

ATCAGCCTGTGTGGTGTTAACTACCTTAGAAGGTACCTCGGCCGCTCTAGAACTAGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGNCGACTCGAGGGGGG 

GGCCCGGTACCCCAGCTTTTTGTTCCCTTTTAGG 

Sequence 64 

CCGCGGTGGCGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTA 

CAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGCCGTCACCTT 

CGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTC 

CTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTC 

CGAGGAAGGCATGGTCAAT 

Sequence 65 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAGGTGTGGTGCCTATAAGCCAGGG 

TCGGAAACTTTCCAGACAAACCTAGATAGCTCTACTAGGAGGGAGTCAACGACCTATTGT 

GTCAATTAGATCCCAGCCTGGACAAGGGCATGACTATTGCTGTTTGGGGACACGTGGCTC 

TGTGTTGAAGGCAATGACTGTGTGGCTGTTNGCCATGTGGCCTGTTCTCCCCTAAGCCCT 

GAACCAAAGGTTNCAGGCCCAGATGAAGAAGGCCCCCTGGGAATTGCATGCCATANGGGG 

TTGATGCTGGACATTTACCCAGCCNTC.CATTAAATGAAAACCAl nTTTTTI 1 AAAACTT 
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ACCTGNT TTTNA AAAAAA I rTTTTl I NACCCANCAAACCCTAAAAAACCTGCAAAANGGG 
GGTTGGGTTTTANNTTTTNAAAAATTNTGGGGAAAA^ 

CCCTTTTNGMCNTNNTNCTNNNNANNCNNNAAATTTTAANANNANTNTTAAAMNTTTT 
T 

Sequence 66 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGTGGAGTCGCAGCTGCAGCGCACAGAGACTGG 

AGCGAOGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAGCTGAGCCCGAGGCTGA 

CAAGGGCACTGT GTTG GCACTCACTGAAAATAACTTCGATGACACCATTGCAGAAGGAAT 

AACCTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTG 

GGAGGAACTCTCTAAAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCCGAAGTAG 

ACTGCACTGCTGAACGGAATATCTGCAGCAAGTATTTCGGTACCTCGGCCGCTCTAGAAC 

TAGGTG 

Sequence 67 

CTCCCCGCGGTGGCGGCCGAGGTTTTTTGTTCAATACATTTTAGATTMGGATTGACAAG 

TAAAGATACTGCTATGGAATGATACATTGTATTTTCTGCATTGTGTGAAATAAG1 ITTTA 

TTGAAAGTCMGTGACATTTCAAAAGAAGTTCTATAACAATTATGTTTCATGCTTAAAGT 

AAAAATTCCCAGAGTTTAGTTTAGAAMTGTAATCTTTTAAATTTCAGACTGA 

CCAAGTATTTTCAT 

Sequence 68 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTGACTCAAATTTTCTCTAATCATTCATACT 

CGCA CATTC ACTCAAGCATACATACTAGACATAAAAATTGTGCAAGATCAAGCATCAGCT 

TTCTATTTTAAAAGCCTTAATTTAAAAAAACTTAGATAGGCTCACTGCAGTTCCTTTAAA 

GGAATATTCAGTACCTCTGAAAGGTGGCATTCTCACAGATGGGTTGAATGAAGCTTTCAT 

CTGGACATTCTACCACCACAAAAGCAGCACCAGGATCTGGAGGGAGTACCTCGGCCGCTC 

TAGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTC 

GACCTTCGAGGGGGGGGCCCCGGGTACCCAGCTTTTTG 

Sequence 69 

CCGGGCAGGTCCAGGGGTAAAAAAAACCCTTGTGAATCCTGCACACATCCCTTCCTGTGT 
TGGTGTTTCCTTTAGACTGGCCTAGCCTGGGCATCACTGTAAAGTATGTGCAGTTGGTCT 
CTGTCCTGTACCT 
Sequence 70 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAATACCAGTGCTTGTTATACTAGTTACTA 

AAAGAAGAAGAAACTCAAAATTCCTATCTGCGTGCTAATTTGAAAAGAACAACGTAGATA 

GATTTGTTGGCACATATATATGGCATATTCACATATGGCATATATACATATGGGGAGAAA 

ACATGAACCAAAGGCCAATTCAGTTATGGGAGCTCATCTCCTTCCATCTCTCCTAATCAA 

GAGCAAAGGGAACAGCAAGGCCTAACAGCAGGGTTGGGGAAGGCAAAAGGACTGGGCACT 

GAACTAAGTGAAAGGGGCGTCTGGTCTTATTTCAGAGGAAGAGGCTGGAATTGGGCTTT 

Sequence 71 

CTCCCCGCGGTGGCGGCCGAGGTGTGCTATAGACGCACAAACGACCGCGAGCCACAAATC 

AAGCACACATATCAAAAAACAAATGAGCTCTTATrTTGTAAACTCATTTTGCGGTCGCTA 

TCCAAATGGCCCGGACTACCAGTTGCATAATTATGGAGATCATAGTTCCGTGAGCGAGCA 

ATTCAGGGACTCGGCGAGCATGCACTCCGGCAGGTACCTGCCCG 

Sequence 72 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCCCTGCT 

GCTGGCTGGGTTCTCAGCCCCTGACCATGAAATGGTTCCACATCGATCTACAGAAAAAAC 

AGCCACTTTGGCCCGTGGGCCCTGACGCGTAATATCATTATGGAGTCAGCAGCGCATCCC 

GTCCCATTATCGCCTGGCCAAGCTTTTCCAAGGCCAGCCTGGCTGCAAATGGCTTTCTCC 

TGTCTAGGGGGGCCACAGCCACTGTTTGCCCTCCACTACCGGAGGAGAAATTTTATCTTT 

AATTGACACTGGCTGCCTGCCACCCCCTTGCCTTCGGCTGGCCCAGGCTCCAAGCCCGGA 

TCTTTTCCTTACACTTCAGGAACAGGCTTCCCCTGGCTTCTTCCCTTCTTCTCGGCAATT 

CCCGTCCTCACCTTGGTTGTTCTACACTGGCCAAGACAGTCCNGGGGGGNGGGTCTTTGG 

CCTTT 

Sequence 73 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTCT 

CCTTCCCTGGCTCTGAACATCATTCAGAACAGGAACATGGGGAAGCCAGCCCAGCCTGCC 

CTGAGCCGGGAGGAGCTGGAAGCACTCTTCCTCCCCTATGACCTGAAGCGGCTGGAGATG 
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TATTCACGGAATATGGTGGACTATCACCTCATCATGGACATGATCCCGGCCATCTCTCGC 

ATCTATTTCCTGAACCAGCTGGGGGACCTGGCCCTGTCTGCGGCTCAGTCGGCTCTTCTC 

TTGGGGATTGGCCTGCAGCATAAGTCTGTGGACCAGCTGGAAAAGGAGATTGAGCTGCCC 

TCGGGCCAGTTGATGGGGACTTTTCAACCCGGATCATCCGCAAAGTTGTGAAGCTATTTA 

ATGAAGTTCAGGAAAAGGCCATTGAGGAGCAGAATGGTGGCAGCGAAGGATTGTTGGTCA 

TGGGAGCCCACGATGAAAGACCCTCAGTGACGACCTAANATGAAGCAGCAAAG 

Sequence 74 

GGGCNCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAT 

AGATCCTGATCTACTGGCAGAGCTCAGCGAAGAACAGAAACAGATCCTGTTCTTCAAGAT 

GAGAGAGGAACAGATCCGACGATGGAAAGAAAGAGAAGCAGCTATGGAAAGAAAGGAGTC 

CCTGCCAGTGAAACCCAGACCAAAGAAAGAGAATGGGCAAATCGGTTCATTGGAAACTTG 

GAGCTGATAAG 

Sequence 75 

TTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGCCCGGTACTTTGNTTCTNTT 

GGGNAGCNTAANACAGNAACCCTCAAGGAGCTAGAGAACCGGATGGGAGACATGAGCAGT 

AATTAACTCACTTGTTCCCCAGAGTNTCTATTTGGANNTGATTTTCTTTTTCTGNGACTN 

ATTTTCCTATTTTCTTTCCTCCATGTAATTTTCACTATGGCCCAACTAATATAAACACCT 

GGAAATTACAAGGAAAAAAAATTCTTCCTCTAATAACTNTCCAMTTTGNGGMTATTTA 

TTTGTAANANCAGTTATCAAGGTATGCTTATATAGCAT 

Sequence 76 

GNGGGCGGCCGCCCGGCCAGGTNCAGCGTGGTCAAGGTAACAAGAAGAAAAAAATGTGAG 

TGGCATCCTGGGATGAGCAGGGGGACAGACCTGGACAGACACGTTGTCATTTGCTGCTGT 

GGGTAGGAAAATGGGCGTAAAGGAGGAGAAACAGATACAAAATCTCCAACTCAGTATTAA 

GGTATTCTCATGCCTAGAATTTTTGGTANAAACMGAATACCTTTCTNTGGCAAATAACT 

AACCCNTGGNGGGACCAAAATTNTGGGATTTTAAAGTTG 

TAAAAAGGCCTGTTTTTTGGGANAAANAMNAAMCCTTTTTTNNTT^ 

AAAAAGNNGATTTTTTGGGGGGGGTTTNCCNTTTTrNACCCNTTNTTNAA 

GGGGGGTTTTTAAAAAANTTGGGGGGGCCCCCCCCCCGGGNGGGGGGGGGAANTTNNAAA 

NCNAANTTTTTTNCCCCCCCCC 

Sequence 77 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I Mill I CTCTTTGA 

AATCTCATAGGATAGTCACACTTATAAACATTCCCAATATTCGGATTCTAGAAGAAATGC 

AATTCATTAAAATTTTCCTGGCACTGAGAGTTAATCTTTAGCAGATTGCATGAAMTACT 

GAATTCCTGGTAAGGAGATATTTTGTTTTAAAAATAATGTGTTTTGATACGAATCAGTGT 

ATTAACTGATAACTAAAAAGTACCT 

Sequence 78 

CCGGGCAGGTACGTTGAACGTTTATTACAACTAATTGGCGATGTGATAAGACAGTGCTCA 
CGTGGCCTGAATGTTGGTCACAATCACAACAAAGCTTAATCCAGCCCAGCATATATAAGT 
GAAAATATAAACCATGAAGACATGTTTAGATATGTATAAGTACCT 
Sequence 79 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGGTTCTCTTGG 

TCTTATTCCCATCCTGGCCAATGCTTAAATACTATTTGTTGAAAATAATTCTTTGAGACA 

GAT7TCAGCTACCTCCCTTCCAGGTTCGATTTAACTTC 

TATAGGTCTTACCTGTGTGAAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAGAAAGGAAAT 
TATAAGGTCAAGTTAACAGTTTTG AGGTTTTGTG I mm I CTGGAACTACTTCAAGTGA 
GAAAATAAAAAAAAATGGTGACAAAGCTGTACCT 
Sequence 80 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGI I 1 I II I I M 1 1 M IGAAA 

AGGAAGGAAAATATTTTATTAAAAGTTTTTTAAAG7TTATGAATNGAAGTC 

AAACTACTTGATACAAAAACATTCAGATATTAGATGTCAAAATAAATCAGCACATTTGAA 

AATACTATAAATTATATTTCAAACAAATATATACACATTATGTTAACCTTCAACAGGATC 

CATTATCACTACTTAGAACACTGATATGTTTATCTTTTAAGTATGTAAAAATTACATAGC 

TGTTAACTTTGTATGGCMTTCACCTATAACACATTTAAGAAAGCATTACAAMTTCATT 

ATATAATAATTCTCAAAAGTTTTCTCACATT 

Sequence 81 

GGTACTTTAACTTTTGAAGGTGGTTTCTGGTTCCCCCAATCGGTGAGCCCAAAGACCTCA 
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AATAACATTTTATTCACACAGACTTCTTAGAGATGAAAAGTTTCTAGAGAGCAGAGCCTT 

TAGGGAACTAGGGGATGATGCATTCACTGAATAAATCACAGTATTATACTCAAGAGCAAA 

TACGTTTTCTTTCTTATTGGTGTCATCTTTCCTTGTGAAATACGGCAACTGATGAAGAAG 

TCTCCTATTGAGAATMCCAGACGAAATCACAAGGCTAGACAAGCAGCTACTAACACCAT 

CCCCTTCCCCCTGCCCATGGTAAACCCTGCTGGCAAATGTATTTCTCCCTCCTAAACCTG 

GGAGGACTCACGTTTCTGCACCCAGAGCCCCAGGGGGAAGTAAACAATGGGGAGTGAGCC 

GATGAGGTGGGAT 

Sequence 82 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTCTTATAAAAGATTCTTTCTCCCAGAATT 

ATATCTCCTCAGAGCAACAGCAAGGTTCTCAGGATCGAAGCCTACTCTAGCCTGAAGGGC 

TAGGAAGATTAGGATAAGGATAAGGATAATAATCCAAAAGTCTCGACAATTCCAGTAGTC 

TCTGGATGGCTCCMCATCATAGAAATTTAACACTGTTCCACTTGTTTACAAAATCTAAC 

ACTGGCTTAGACATTCTGGACTTTCAGTGAGGGTTCCAGCATCTGATGTCCCTCAACTCC 

TTTCCAGGGTGAGAGGCCCACTTACAGGAAACTTAACTTCTCACCATGTGGACCCTGAGG 

GGTTTTTCTCTTGCAGAAGGGCTAAAAAAGTGAGGAAGCTATGGAAAAATGGTGGGTCAC 

TTTTMGGCAGACTGTTTGGGTTGTGTGGGAATTTTTCCTAACCTAAGTTCTACCATCTT 

GGGATTATTC 

Sequence 83 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGTGCGTA 

GGTGGAGCGGTTGAAGCAAGGTGGGTGTGAAGCCCCGAGAGTGGTGCTTTGGTCGGACTG 

GCACACTCCGGGCAGCTGGGAGAGTGACAGGGAGGCTGAGAAGGGGCCCGCTGGATCCCG 

CAGCTGGCGCGCACATAGTGTCTGCTGCCCTGCTTAGGACAGTTCTCTTTGCATCCGTGA 

GGAGCCAAGACAAGTCACAACTGCAAGTGAGGGGGTTGCCAAAGAGGTTGATCGATAGGA 

CCTGGGAGGAATCCAGGGTCCAAGGAGGGAAGGAACTCAAGTTGCAGTTCTGGAACAGAA 

GGACCTATAGATGTGGAGTATCTTGAAAGATGTGTGTGGCGACAGAAGCAAGTGCTGGGG 

GAGTCTTGACAGTCCAGCTGTTGCAGGTGGGGGGTCATCTTGAAAAATGTCCCCCTCAGG 

GGTCGTCAAGCCGAGGCATNTTCACCTNGGNCCGCTTTAAAACTAAGTNGGATCCCCCCG 

Sequence 84 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTG 

GAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCT 

TCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCT 

ACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 85 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGG CCGC CCGGGCAGGTACTCTTTGA 

GGACATTTTTGTCAGATTMCTATAACAGTGTAGTGTAGTTTTTAAAATTGCAGTTGAAA 

AGTTTAGCTGTCTTGGAAGTCAAATTTATCCAATTGTTCAGACTTCTGTTACTACTTAAT 

ATGAAGCCACCATGCTGGCTTGGACAGAATTMTTTCATTCATGTTATGGAGAATTCTAT 

ATTACAAATCTGGTCCCCTATAATATGAACAGTGAGCAGTCAGAAATATACAAAGGGTTA 

AATAGGGTAAAGAC7TTGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTC 

TCTAATTAAAATGACTGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATC 

TAGAAATAATTTTTCAGTTAAAAAACAACTCTCAAMCCAGCACTCATTTCTCTAAGATA 

G GGTTATAAGTATTTTACG ATTTCTTGGTATATTAAAAT 

Sequence 86 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGCCAGCTCCCACC 

ACTCACGGCCTTCATCCTGCCTTCGGGAGGCAAGATCAGCTCGGCGCTGCATTTCTGCCG 

GGCCGTGTGCCGCCGGGCCGAGAGACGTGTGGTGCCTCTTGTCCAGATGGGAGAGACCGA 

TGCGAACGTGGCCAAGTTCTTAAACAGACTCAGTGACTATCTCTTCACGCTAGCCAGATA 

TGCAGCCATGAAGGAGGGGAATCAAGAGAAAATATACAAGAAAAATGACCCAT 

Sequence 87 

CCGCGGTGGCGGCCGAGGTACAGAAGAGGTTCGTGGAAAAGGAATATGCTTAATAACTTC 

GAAAAAAATACCCATAAATGGACAAGGGTGTTGGGCCCCCTTTCTGGGTGGATTCAGGGG 

TAGTCAGATATTCCAGGTTAAACTGAACCTGATGCATCAATCTCCTTCTTGGAGGTGTCT 

CAGTGTGCCAGACACTGAATCCACCTTAGACATCTATTGCCATTCAGCCAGCAACAAACA 

CACATTCACACTCCTCTGTGCAAACCTTTCTTTGCTCATCTTGGTGAGAAAACACATCCA 

AGAGTACCTGCCCGGGCGGCCGCTCTAGAACTAG 
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Sequence 88 

CTTAGGGCGAATTG GAGCT CCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAAAGGAT 
TGGAATTGT G TTTAGT TTTTGGTGTTTAGATTATTTTCTT^ 

nTTTTTTI ICTTTTTTA CAATAGG CCAGTAGAATTCTTATGAGAAACAAGCTTTTAAGC 
ACTTCMTGGAGTGTCATTTTTTTGGTAGCTTCCAGAAGGTGCCAMCAGGT^ 
CATATTAATTAGAAATACCCATAATAATTATAACAATAATAAAAAGGTATAATATGGTAC 
CT 

Sequence 89 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTGCACGTCGAATTA 

AGGATCTAGCTGCTGACATTGMGAAGAGCTTGTTTGTAGACTGAAAATTTGCGATGGGT 

TTrCACTGCAACTAGATGAATCAGCTGATGTTTCAGGACTTGCTGNGCTGCTTGTGTTTG 

TTCGTTATAGGTTTMTAAGTCTATTGAGGAAGACCTACTCCTGTGTGAATCTTTGCAAA 

GTAATGCTACCGGTGAAGAAATATTCAACTGTATCAACAGTTTTATGCAGAAACATGAAA 

TTGAATGGGAAAAATGTGTTTGATGTTTGTAGTGATGCTTCTAGGGCAGTGGATGGGAAA 

ATTGCCGAAAGCTGTCACCTTAATAAAATATGTGGCTCCCGAAAGCACCAGTAGTCACTG 

CCTATTATACAGACATGCCTGGGCAGTTAAAATAATGCCTACATCTCTNAAAAATGTGCT 

AGACCAGGCTTTCCTGTACCTGCCCGGGCGGGCGCTTAAACNAGGGGGGATCCCCCNGGC 

Sequence 90 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGAACAATG 

ACAGG GGAA GGGTATTGGACACGGCAGCGTCCTCCTTATTGAAAACACATTATGTCAGTT 

GGGMTTTTAAATMGCTTTTAGC AAAC CTAAC ACTAAAAG CAAAATANAAG AAAGCTAT 

ACCATTACCATAATACATTTTTCATCTCATGGCTACMTGGMTTNTTGAAAAGGAAAAA 

AAAATCCTATCTACATATAAAAACCTGCATGAATGAATCACTACATATGCTTATAATGAG 

GA AGAGT TATGGGTCCTGAGTGTAATTTTTTATCCTTTCTTAAAAAGTTTCTGTATTA 

CATTTTTGATAACACTCTGATGATCCTTCCCTTACATTTGAAATGTTATGTACCCTNGGC 

CGCTCTAGAACTAG 

Sequence 91 

CCGCGGTGGCGGCCGAGGTGGCTATTGAACTTCCTTTTCTTGTTTGAAGTTAGCTTCAAA 

TTTGTTCCTATCCAGAATATTTACAGGTAATTTAGCATAGGAGCAAATTACCTGTAAATA 

TTCTGGATAGGMCTACTTGAAATAGTAATTTGTTAAAAGATATGACAAAATGAAAATGC 

TTAAACTACAGAAATTTAAAAATGCCATAACAATCTTGCGAGACTAACTTTAAAATATAC 

TTTAAATGATTATTATGATTTTGGTGGTAACGATCCCCCACACACAACCACTATGAAGAA 

ATAATGCCGCATTTTTCCCCCATTG 

Sequence 92 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACA 

TGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTG 

CCATGTAGGCTAAGAAAGAAATGCTAACCTGTGGAAAGTTGGTTTTGTAAAATTCCATGG 

ATCTTGCTGGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTGACAGCCTCCAC 

CTT GAGC ACTATTCTMGGAGCAAATACCTTAGCTCCCTTGAGCTGGTTTTCTCTGATGG 

CACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCTCAGGGCATGCT 

TATTAGCAGCTGGGTTGGTATTGGGAGTTGGCAGACAGGGGATGTTCAACTTAATGAAGA 

AATACAGCTAAGGCCTTGCCAGCAACACTTGCGTAAGTTACTGGCTTGAGTGAGGGCTTA 

AAAAATTNAAAGGGTACTGGGTTTTATCCTCTATCCCTTTTTCCCT 

Sequence 93 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATGGACGATACTGGCT 

ACAGCGAGAAAGCAACCTTTGCCTCAGTGACATATCCTCGGGGCTATCCCTACAACGGGA 

CAAACTATGTGAATGTCACCATGCACCTCCGAAGTCCCATCACCAGGGCAGCTTACCTCT 

TCATAGGGCCATCTATAGATGTTCAGAGCTTCACTGTCCACGGAGACTCTCAGCAACTGG 

ATGTGTTCATAGCCACCAGCAAACATGCCTACGCCACATACCTGTGGACAGGTGAGGCCA 

CAGGACAGTCTGCCTTTGCACAGGTCATTGCTGGATCGTCACAAAATTCTGTTTGACCGG 

AAT 

Sequence 94 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGCTGAAATTGAGA 

CTCTTCTCCCTTTTGAGCAGAGGTCATTTTCTCTTGACATTGAATTCCAGTATTTGTGCA 

GAAAAGAGGTGTCAATGAGGACTCCATGGATACTTTCAGTTGATCAGTATTGGCTCTGCT 
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GCAGGGTGATTTTGGCTTCACACATAAATGTTGCAGTGGCATTTGTCCAGGGGAAAGCCC 
GCTTGGAGGCTGTAGGATGACCCTGATGGAGGCTGTCACTCAAACAGCTCCAGGTAGTCA 
GGCAGCTGGTCCTGCTGACCACAGGGATAGAGAAGGAGCTCCTTCCCCGGGGAAGTGATG 
GGTT 

Sequence 95 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGAAACCCACCTCACC 

CCGGCTCACATCTAAAGGGGCGGGGCCGTGGTCTGGTTCTGACTTTGTGI 1 1 I IGTGCCC 

TCCTGGGGA CCAGA ATCTCCTTTCGGAATGAATGTTCATGGAAGAGGCTCCTCTGAGGGC 

AAGAGACCTGTTTTAGTGCTGCATTCGACATGGAAAAGTCCTTTTAACCTGTGCTTGCAT 

CCTCCTTTCCTCCTCCTCCTCACAATCCATCTCTTCTTAAGTTGACAGTGACTATGTCAG 

TCTA ATCT CTTGTTTG C C AGGGTTCCTAAATTAATTCACTTAACC ATG ATGCAAATGTTT 

TTCATTTTGTGAAGACCCTCCAGACTCTGGGAGAGGCTTGGTGTGGGCAAGGACAAGCAG 

GATAGTNGAGTGAGAAGGAGTTGGAAGGGNTTTA 

Sequence 96 

CCGGCCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATA 
CC G G NTATAGTNATTTATCGCATGATGAGTAATAGAAAGG AGCTTC ACAGCTTN ACTTAA 
AAATGGGGGTGCGGAACATTANTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGG 
TTAGCCGGGCCGAG 
Sequence 97 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGMGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 98 

TATNCTGGAACTGAGCATAGACCTNTTNCCAGGCAGAGCTGACAGCAAGTNAAGGAGATC 

ATAATCATGGGGACCAAACAACTTTGNCTAAAGTGTGAATGTNACCTAAGGAGAAGCTGT 

GAGATCAGAAGGGNGGGGCAGAGGAGCAGACACCATGAGGGAGAGTCCTTGGGGGTACCT 

GCCCGGN 

Sequence 99 

GGGGAACACCCCCCNGNGGCGGCCGANGCCMNANCTTTTTGNGGGGGAGGNAAACCCCA 

CCCCCCCAGACAAAACNGNCACGNACCNCGGNNTTTNTAGAACNAGGGGNGCCCCCCCCC 

ANAAANAANNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTTANAAAATTTTTTTGAGGGGGGGGGGCNCCCCCGNAAAAAACANGGNCANAGAAG 

NNCCCCCGNGNGAAAGGGGGANCCGNNCACAANNNCACACAACACACGAGCCGGGAGCAN 

AAA 

Sequence 100 

GGNAACCCCCCCCNGNGGCGGCCGCCCGGGCAGGNTCNCTTAGGGGGGGGGNAAACCACC 

CCCCCCCCNGNGCGAAAGNCNACCNNNGATGTTTTCAGNTNCANGGGGGGAANCNCCCNG 

NAAAACGGGNGNGCAGGCNGNAANNNNCNNNCNNNANAAANNCNGCAAANCNAAAGAACG 

ATTTTTCC 

Sequence 101 

ACCGCGGNGGCGGCCGCCCGGGCAGGTACTTTTGTNGGNTGGGGAAAAAACNCTTTAACA 

AGCTTAGATTTATTATNGCGCCTTTTATGGAAGATGAGGAAGAAATAGGAAAGCAGTAAA 

ATGAACAATGAGAATCTGTTACCCAAACTGATATAAACTCCCTCAGAAAGAAAGATACTA 

CTCCCTAATCTNTNGAAGAGGGNAGNGGCAATAATGATCAAATGCCGNAAAGGGGANTGN 

CACTACAGCTAATAC 

Sequence 102 

AGGTACAAGGGATTTTAACCNTTNNNCANAAANCCCANNNGTGAATGCTTNGCCATACTT 

GCTTTAAGGTGTTATGGTNGGCACAGTTTACTGGCTTCGCCTGTTAMTTTACAAATGTC 

CTGTTTGATACTACTTGTTAGMCACTATTTTTTTAMTACAGAAAAAGCTNCCTA^ 

GGCAACTTTCAGANGAAATTTAAAAAATNCACAGGAGGTATTTTATTTACCCAATGGCTT 

GCCAGGGTTCCCTGGC 

Sequence 103 

CCGGGCAGGTACAGTTGGATTGACTACAAAAAAAAATTAGGCCTTGTCCATTTATCCAGA 

GGTTTCCTTCAAAACTTTrAAAAAATTrTATCACCTAAAATGGATTTT 

TTTAGATGTAGCAATTAAGTAAATCTTAAAGGAGGTGTAGAATTCCTTTAAGAAGTTATT 
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GTCTGCTTCCCATTGCAGCAATGTCTGCMTACTACTTTTTTTAATGAGCCCAGAAACAA 
TTCT CTGCAACTGTAAAAATTATGCCAGAATGCAAGTTTTTGGTTAGTGCCATAGTTTCG 
TTTrGTTATTGAAAATGGGATMGTCATGATTTATTCCCTCCCTTCTATGGAA 
Sequence 104 

AGGTACTTTTACCTTGAATCTACTAGAACTCTCTGCTATTCAAAACAAAGAGCTCATACT 

TGTTGGAGTAGGGAAAAMTTAGAAATTTGACCAAAAGATAGATTCAATCAACTACAAAG 

TCAATTCCAAGATGCTGATAACATCGAAACTCTTGAAAAAGAACTATTAGAATCTGGAGT 

TGAACTTTAATGGATCATATACCTGTTTTGCTTGATCAAGTGATTGATCAGCTTAATATT 

AAAGMGATGGTATCTATTTAGATCTTACTTTAGGACGTGGTGGTCATTCGAGTCAAATT 

TTAAAAAAACTTACTAGTGGCAAGCTTATAGTCTTTGATAAAGACAAA 

Sequence 105 

MCCCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTGCCATGTCAGTNATGACTCC 

AATT TTCTGTGTGCAAGAGCAATCACACGGAAGCCCTGTTTAGTGAAGTCTTCCAAAACG 

TTTTGAAAATCGACAGGAACTGTTTCAGGTTTACAGAGACCGGCAATGGCCTCGGGCGCT 

CCTTTNATGTAGGCNTCCATTTTCCTATCCCCCAGCACCCTGGCAACCACACTCATACGT 

TGCAAAGCAGAAGAAAATGGGAACTGGCGAACAATTCCTATCTCATAAGTAGCTGGAAGT 

TCAAACAGCTCCATTTCTTGGTTTCCTGCAGGGGTAGATTCAGGAAGCAGTTGTTTGGGA 

GGACGAACCACTGTGGGCA 

Sequence 106 

GCGGGCAGGTAAAAAAGAATTAGTAGATACTAAACTACTATTTATTCTTGCCGGTATGGG 

CGGAGCAACAGGGACTGGAGCTTCACATATTT7TGCAAAAGTAGCCAAAACTCTGAAATC 

TTTAACTATAGCTATTGCTATCCAACCTTTTGATTTTGAAGATAGTAAAAGGCTTTCAAG 

AGCTTCTGAAGGAATTAAAAMCTACAAGAAAATTCAGACGCGCTAATTGTAGTTTCCAA 

TTCTAAAATCGCGGAGCTATACMCGGCATAAGTATTTCAGATTCTTTTACAAAAGCTAA 

TCAMTTATTTTTGATATTATTTAAACTATTATTGATTTAATTAG 

Sequence 107 

NNNAANCTCNCCCNCCAAACCNNNCCCGGGNNNNNNNNGGGGGGGNTTTTNTTGGGGGGA 

NGGGGNNNNNNCCCNGNNNNNNNNGGCCGCCCGGGCAGGTACATTCNCAAGGGTTGGAAC 

CCAAGCCCCACCCTGGGTTTTCTTAAGTTCATTATTCCCCCACCCCAGNAATTCCCTTGG 

GAGTTCCTTGGCCAGAAGCCATCAGAGACAGCAGGCGAAAAGCAGGGCTTAACTGAATNC 

CATATTGGGG 

Sequence 108 

AGGTACATCCCGAAAGACAGCAAAAAGAAGAAGCACCGAGCTGAAGATTACTCAGCAGGG 

CACGGACCCGCTTGTTCTCGCCGTCCAGAGCAAGGAACAGGCCGAGCAGTGGCTGAAGGT 

GATCAAAGAAGCCTACAGTGGTTGTAGTGGCCCCGTGGATTCAGAGTGTCCTCCTCCACC 

AAGCTCCCCGGTGCACAAGGCAGAACTGGAGAAGAAACTGTCTTCAGAGAGACCCAGCTC 

AGATGGGGAGGGTGTTGTGGAAAATGGAATTACCACATGTAATGGAAAGGAGCAAGTGAA 

GAGGAAGAAAAGTTCCAAATCAGAGGCCAAGGG 

Sequence 109 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTAGCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 110 

AGGTACCGACCGCTCCGAGATCTGTATGAGTTGGAGGCAGGCCAGTGTGAAGGTGTGGGA 

GGAATCCGCCCACACCCAGCTTCATACAGCACCCTGAGGACAATGTGGCTTCCCTGATTC 

ACACCTACTGAGCCAAGGCCCCCTCTGAAGTTAGGTCAAGAGGGCCCACCCTAGCCGGGC 

CAACTCATTCCTTTGAGGACCTGATACTCAATGAACTGCTTACTACCAAAAAAATGCAAA 

GGATTCAGCAAAAAGTGCAATTTCCATTTGAGAAGTAAGATTCTCCCCAAAACATGACAC 

CACCTTTTCATCCTTTCAGTACCTGCCCGGGCGGCCGCTCTAGAA 

Sequence 111 

CCGGGGAGGTACTACCATTTAGGAAACTGCTATAACACATAATTTCATGAAGTAACACCT 

AATACGGTGTAGTTCCCTGGTCATATTTTATACAATTCAACCATATAAAAGGGTGTCACT 

GTAATTTCAGTAGTGTGGGTTTACAAATAATCTGCTGGTTAGCTTATTACCTTGAGGTTT 

TGAAAAACTAGAATTATATTGAGGCATTTCATAAACATATCTCTTGCACCCTCTTCATGG 

TGGAGTTAAGGATAACTTGCAGGTGGTTGGCCAAGGCCCAATATAGATGATTATAACATT 
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TAGAATTGGCAATTAGAAGTTGATAATCCATATAGGACCATAGG 
Sequence 112 

AGGTACCCACCTCACATTCTTTMCACTTAAGGTTTTTCTGGGTAGTAAGTGCAATACAT 

TCTTATTATAAAACAATATGGACAGTTCAGTATGTATAAATGAGGATAAAATCAAAATCA 

CCC ACAAT CTCACCACTTTGTGATMTMCCATTMCTTCAATATTGATGAATTTCCTTG 

MTGT TTTAT CTACAATAT7TCTTTCATATAGTTGATATCACTCTGTATGCACAAACTTG 

TATCCTTTTTCTAAATCTTMTATCATAATATTAGCATTTTCTAA'I GTTAGTAGATGTAC 

CTGCCCG 

Sequence 113 

AGGTACTATGAGTTGCCACCATGTCATGCAACATAACATATTGCTTTCTAATGGAATATA 

GTGAGCTTACGATTGATACTATGGACATAACCACAATAGCTAGAGCATAGTAATAGTATT 

CATCAGTGCTCCACAGTATAACACTGAACAGCTGGAAAATGTAAAATGGGTTGAGAACCT 

CTTTAATTAGAAGCTTAAAAACAGAAGGCACTTTTACAGCAATTTCATTTACTCCAT^ 

GCAGTTTTCTGTAGGCATGCATCCCCTTTGTCAGTCCTGCACTATGCTTTTCATAAATTG 

ACGTACCTGCCCG 

Sequence 114 

AGGTACTGCACTGGTCATTGATAAAGATGTATTCAAAGGCTCACTTAAAGACTTCGAATA 

CAAGCTGATGGGAATCTGATTACTATTTTCACTGCTTGGAGCTGATCCAAATTCAGAGAG 

AGAAGGTTCTGACCCAAATTCCAGAGCCCCAAACTGCACATTTAATCCTGTGACATCTGC 

TGAACCAGGCATTTCCACTGCAGAAGCTGGGATCTTAGAAGCTGGGGGTATCCGCCGCTT 

AGCAAGTTTGATGTGTTTGGGCTGTGGCTGGTGGACAGACACAGAGATATTTTCAATGGT 

CGTGCTGGGAAGCTGCAAAAGCTTGTTCACAGTGGAGGGACTGTC 

Sequence 115 

CCGGGCAGGTACCTGGTTCTACAGATGGTTCTCATGCACAAAATTTCAGAACCACATTGT 
AGAAAAGTAAAGCAGTATGACATGCTTTGGAAACTGCAGATAATTTAGTGCAACTGTATT 
ACAGGTTACAGATAATAAGAGATGAATCTGGAAAAGAAAAGAATGTTATGGTACCT 
Sequence 116 

AGG I I I I I I I I CACTNNCGTATTGTTTATGGAAGAAGAGATTAGAGGACAATACAAGTAG 

CCACAGCTATGATGCATGGAATACTACAGAATATGGTGAAATGCTATGTAAGGGCTAGAA 

ACAATTCATTAGGTGTACCTGCCCGGGCGGCCGCCCGGNCAGGTACATGCAAATCGCCTG 

TGGTAGCCATAGTCACAGGATGTGTCTTCAAGACAGAAACTTGCTTTGTGGCCCTCAGCC 

ACTCTCCTNTGGGTGTTGNCATCAAGCANGTCATAGAGNCTAAACTCATCCATACTGTGG 

NAATG 

Sequence 117 

CNGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATA 
CCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAA 
AAATGGGGGTGCGGAACATTAGTTAGTTGGTNGGGTAANGGCCTACCAAGACNATGATGT 
TTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 118 ' 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACTTCCTTATTTCACAA 

ATTTAAGTTTGGTTTATATATTTTATTGACATGGTTACTCAATGTCCACATCATTCCATC 

TGCATCGTCTTCCTACAAACAGTTTTTCTTCTACTATTCGGTTATTTCTCC I I I II I I GT 

TTCCTATTTCAGAATCAAATTTATTTTACTTGCAMGTCAGTGGAATATGGTTTGGAACC 

AGTAGGGCCTCTAACTTAAGCCCAGAACCTGTCAAAGAGAAGTGCAGTATCATTGCTAAG 

ACTTGAACAGTTTATCTCTCAGAATCTTCAGTTCCTTTGAATTTCTCAGCTCTTAGTGTA 

ATCTGTTTTATGTGTTTGTTGNAGACTTCCATTATGGATAGATTNCCAAAATAANTTGGG 

TAATCAACTGGTATTTTAGCATTCTGGGAACTAACATGTTTAAAAATCTAAGGCTCTTTT 

TCATGCCAAGGAG ACTG A I I I I I GGCTT 

Sequence 119 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGTAATATCATGAAGCTGC 
TGTCTGCACTTGTCAATATCCTGTMGCCAGTMCAATAAAATTCAGCTTTTGGTTGAGC 
CCGGCCTGGCTGCTGGGCTGGAAGTCACTGACGATGATGCCGAGCTGCCGAATGTTCTCC 
ACGAACTTCTCCAGGTGCTCCTCTAGGTGGTCAAACTTCTCCGCCACCTGCCCG 
Sequence 120 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAAGAAATTGCCAAGGA 
CAAAGTTTTAAAAGACTTTTATGTTCATACAGTMTGACTTGTTAT^ 
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AATAGACAATATGGCTCCTAGTCCTGGTCATATATTGAGAGTTTACGGTGGTGTTTTGCC 

TTGGTCTGTTGCTTTGGACTGGCTCACAGAAAAGCCAGAACTGTTTCAACTAGCACTGAA 

AGCATTCAGGTATACTCTGAAACTAATGATTGATAAAGCAAGTTTAGGTCCAATAGAAGA 

CTTTAGAGAACTGATTAAGTACCTGCCCG 

Sequence 121 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATGGGTCATTTTTCTTGTATATTTTCTCT 

TGATTCCCCTC CTTCATGGC TGC ATATCTGGCTAGCGTG AAGAGATAGTCACTG AGTCTG 

TTTAAGAACTTGGCCACGTTCGCATCGGTCTCTCCCATCTGGACAAGAGGCACCACACGT 

CTCTCGGCCCGGCGGCACACGGCCCGGCAGAAATGCAGCGCCGAGCTGATCTTGCCTCCC 

GAAGGCAGGATGAAGGCCGTGAGTGGTGGGAGCTGGCTGGTGTACCT 

Sequence 122 

GAAATAAAAATTAGCAAACAATTArrCTAGGGATATTTTCAGATTTTA 
AAATGCG NGTGCCA TATGCAATTGCATTTCTTGTGCCAAGAAACTAATAGAACTTATTTC 
ACTTTA CCTl nil 1 AAAATGTGAATTTAGTTATTATAGTTTCAATTTTATGGCCTTACA 
GATG GCTTT TATTTTGTTTGCAGCNTGACACTGCAGTTCCTTTCATGCAAMTACCCATA 
AACTGTTTTGATGGAAAAATTCATTGCCCCTTAATGGGAAM 

ATTATAAACTANTTCCCTACTGNT ACCC TGCCCCCGGGGCCGGGCNCGCTTTTCGACCCA 

AACCATTNGTNGGGTNGMGCNATTTTCCCAACGGGNGCCGCCAATTGGAAANGTCCTNG 

GGGTNGCCTTGGTGCCTTNCCGMGTCNTNCTTGGNAAATAATTTTTTTGGATANGGGAA 

AGCCGGACCANAGGGAAAAAAATTT 

Sequence 123 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAAGTT 

CACCCACCCTGCACTTTGGCCCTTAGATCAATCCTAAGTAGCCATTGCCAGTAGGCCAAG 

TTTAATCAGAGGACAGTGCCTACCAGTAAATACTGAATAGTTACAATAGTTATGTCCATC 

CAACCAGTAGCAGATGAACAGCTAATACATCATGATGCTATGCTCTCCTAACAGGGTCCC 

CTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATCTCTACCAGG 

TATTAACACCTAACTACTCTCTAGCCAGAGGCAATTCCCTTTATTTCCTTACTCTCGTCG 

TCTTCTCTTTAGCCCAATCTCCTGACAATAGTTAAAACAAAAAGACCCCCAAAATATCTC 

TTGCTAAAACAGAAGTAGTCCCTAAACTCTCTCATCTTAGACTACTGTCAGGTACCTCGG 

CCGCTCTAGAACTAGGTGGATCCCCCGG 

Sequence 124 

CCGGGCAGGT ACAAA G CACT G GAATT GGGGAAATAGCAGGGTGTTTCCCCCACAATTAGA 
AGCAGTGTTGCTTTTCATTTTCCTmACTGATTAG 

GAATTTTTCAAAAACAGCATTTAGGGTCCACATTTATTTTAATTCTGATCTTC 

TAATTGGGTGAAACTTATGGTGAAAAAATATGCATAGTTACTTTTGACATAGATTTGTTT 

AAGCATGAAAGCTAGGAATTGATTAAAACCAATACATAATATTTAGTTTTGTGTTACTTA 

GTTTCTTTGTAATAGTGTGTAGAATCATGTGAATTACTG 

Sequence 125 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTAGTTTAGTTTATTTCTAG 

TTCAAAAATAATCTGTAATTGCTGTAAGAAATGTCAACCACTTACCTAGGATGTTTGACA 

ATTGGGATGAAGTCTACATATACTAAGNAATGGCAAGACAATTATTTTATTGCTCAAAAG 

AAAGTCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGGCAGGATTTCAAGTATGACCT 

TTAAGAA TCAG GAAAAGACTAACTTATGCTTTAGGATTAAMCAATCAAATAATTAAATT 

AGTTCAATTTTCTMCATAGTCTCTATCTTCAGTTAAAGTGCATCATTGCATGTTATACA 

TTACTAAAATTACACAGTGCATAATTGTTACCATGTGACTATTTAATTCAGGGTCAACTG 

TCTAAAGGTCTCAGGTGTCACATTAAGGGTTATAATCTAGATGAATTGGAACAAGAAAGA 

AAAGATTTCT 

Sequence 126 

AGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTCGNCCGCCTGACCTGGGGCAAGT 
GCTTTCAACTCTCTGAACTTCGGTTCCTCATCTGCCAGTTGGCAGACGCTCAGCAAATCT 
TCCTAGACTCACGGGGCGAATTGTACCTGCCCGGGCGGCGTTAGAACTAGTGGATCCCCG 
N 

Sequence 127 

CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTCCCAACAGCATGTCCT 

TCCTGGTTCTCTACCCCCACAGCACTTCTTAGAGCAGAGGCAGAGCCCAGAAGCTGTGTG 

GGTCACAGGCAAGAGCTGAAGTAAGACCTGCAAGAGGCGGCAGGGAGCTAACTGTAGCAC 
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GAGGACAAAAATGAACACGGTAATACTGAGGTAAATGAACACTCAATTCATGTGGAGGCT 

GTAAACGTCCTGATGTCACCTCTGCCTCAAGAGCAGAAAATGTGACTGGAGTGGTTACAG 

GAGGGGCCTGCCAGACCCCTGTGGGAATACTACATCTGGGACACCTCAATCAAGGAGGCA 

AGAGAGAATTTCTGGCCACTGGCAAATGAAGCATACTGGCTTGCAGGGACCTTCTGATTT 

CAAGTACCTGCCCGGGCGGCCGCTCTAAAACTAGGT 

Sequence 128 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTTGCCGGAC 

TTTCCCTTAAAGAAGGAGAGGATCAGAAAGAGATAAAGATTGAGCCAGCTCAGGCTGTGG 

ATGAAGTGGAACCTCTACCTGAAGACTATTATACAAGACCAGTAAATTTAACAGAGGTAA 

CAACCCTTCAGCAGCGTCTGTTACAGCCTGACTTCCAGCCAGTCTGTGCTTCACAGCTCT 

ATCCTCGCCACAAACATCTTCTGATCAAACGGTCCCTGCGCTGCCGTAAATGTGAACATA 

ATTTGAGCAAGCCAGAATTTAACCCAACGTCAATCAAATTCAAAATCCAGCTGGTCGCTG 

TCAATTATATTCCAGAAGTGAGAATCATGTCAATTCCC 

Sequence 129 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATAGATCCTGATCTACTGGCAGAGCTCAGCGA 

AGAACAGAAACAGATCCTGTTCTTCAAGATGAGAGAGGAACAGATCCGACGATGGAAAGA 

AAGAGAAGCAGCTATGGAAAGAAAGGAGTCCCTGCCAGTGAAACCCAGACCAAAGAAAGA 

GAATGGCAAATCGGTTCATTGGAAACTTGGAGCTGATAAGGAAGTCTGGGTATGGGTGAT 

GGGCGAACACCATCTAGATAAACCCTATGATGTGCTCTGTAATGAAATTATTGCTGAGAG 

GGCTCGGCTGAAAGCAGAACAGGAGGCAGAAGAGCCCAGAAAAACTCACTCTGAAGAATT 

CACCAATAGCTTGAAAACAAAATCACAGTACCT 

Sequence 1 30 

CCGCGGTGGCGGCCGAGGTACATCATATGCCTGCTGAAGTGCTCTGACTTTAGGATGAGA 

AACTCTAACATAGGCCGGAAGACAAATAAACCATAAACTGTAACAATGACTAAACAGACA 

CTTGGCCCACTGTGGTGGATTTGTATAACATCTCTTCGCCAATTTATGAGCTGTTTTTAT 

TTCCTGTTTAGTTCTCTTAGCCATGAGAGGTGGACTCTTTGACCTGCCCG 

Sequence 131 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGTTTGTT 

tcctaatatttattaaccaccttataccaaatgtcttgcaaagaaatgttattaaaacct 

tgaattrrtacaaatgtaaaaaacaaaaagtgtattaatgtatttgttcaggaaaagcta 

cataccgaagggcttttgtatatgaattctgtggtggggagacccatttgtaatctatat 

ggcagttccatctgggttttaagtttagatttcaccgtgtcttanctgcttcattctatt 

ggtttattggaacatgtaataaataggagtagtgatgtattaaaacacangnnttcatta 

atgntttatatcttcactaamttcw 

atttcagtcagaagtgaaaa 

Sequence 132 

CTTAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTACCAAAATA 

GTTCAAAAGAGTTCTGTATTATCCTTAACATAGTGC I I I 1 1 II ICCTTCCCCTTCTCTTG 

GGGAAGCTACAAGACATAATCCTTCAGTGCTTAACTGGAAAGAACAGTCAATCCTAAATA 

TTAMGAATCTTATGCTGTTGAMGTAAATGAAAAATTTCGAGTTATTTGTGGCATA^ 

GAGCAACCAATATTTATTGCATTTAATCAACTGACTTTAAAAAATTACAATCT^ 

TTCATTAATTTAGCATGTGAAATTTAATTAGAAATACTGGGTACCTCGGCCGCTCTAGAA 

CTAG 

Sequence 133 

ACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGAAATATTTTGTA 

ATGTGTTGATTTTAAAAAAAATCTGTAAATAAAGTTTTAAAAAAAGAA 

ACGCTGAAATATGTAGATATTTTGCTATTTATTTAAAGGAGTATT TTAAG AGATATTGAA 

CTATCTGAAATTGACCAGTAATCAAAGTTCCAATCATCTGAATGCTTTTCCTTGAGGTAG 

AATGTGAGTCTCAGAAATGACTGCATTACCTGCCCTTT^ 

ATTTTGCAGAACAACAACAACAACAAAATTGTGCCTT^ 

GAGTTTGTTTCTGTCTGACCAAAAGCAACATTTTTTGCAGAAAACAGGNGGATGGTATTA 
AATACTGGTATCATACCAAAAACCTGCCANGGTGGTATATAGGATGCTTTCTGTCATACT 
GGTGGTTTTTCAGATGCAGAATTTT 

Sequence 134 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGCTTATTCACATTTTTAC TAAATC CAACA 
CAACTTTCACAAATGGCAAAATGATTGCCTCTTCAAAGCAATGCAGCCTAG I I I I IGGTG 



Page 1 8 of 635 



WO 01/51628 



KTT/US01/00798 



Table 1 



GGTTCTGGTCACTGCTTAGCTAAGTCTTTGTTGGGCAGAGTCCTGGCTCCACAGTCTCCT 

TCGATGGGCTCCTCGATACACGAGGCTTCATAACATGCGCTCTTTGAAGAACCATTTCTG 

ATGATCCGAGTTGGTGCAGTCTCGTAAGAGTGGAACGAAACTGTCACTCGACTCCTTCCT 

CGCAGCCTGGACACATTTCTTGGACTGTTCGTGAAATAAAAGATCCATCCTCCTGCAAGA 

TGAACTTCTGATTCTCTGGGGGCA 

Sequence 135 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGCCTCTTTGAGTATGA 

ATGAACTTGGAAGAATAAAGCCTTCCAGTTGGATGAAAGCACATCCAGCCTTTTGTGAGT 

GACAGCTTTGTATATAATCTTATGCAGAGCTGATATGGATTCCCTCCAGGCAGCTCAAAT 

CTGGAAGTTGTAAATGAATGGCTATGCCACCTTGGAGTATCACCATAATACATCTGTGCT 

TTAGAGCTGATATACAGATGTGAAACGATCGAACAACATGATTTCTCATTCTAGTGCTCC 

TTAGAAAGGAGTTCTGATAAGCCCCAAAGCAGACCTGGGTTGGAATCGTGGTTATTATTC 

AAGTACCTGCCCGGGCGGCCGCTCTAGAAACTGGTG 

Sequence 136 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGCTATATCCG 
CCAGTTGATGACTGTCCATCTCTAGCCCCTCTAACCATGCTATGCACCCAGTGAGTGCAG 
AATGAT TACTA GTAATTGATGTTGACTTTACAATTCTATGACATGTAATACAGGATGCCG 
AGGCACTTTTAAAATTCCACCTMCATGTTCTGG 

A ATAC TGGTTAATATAGGAAAAATTATGCTTMGCACCACTAACTTAAAAGTGTAATTTC 
ATTTTCTTGAATTTAGGTAAAAGCAAAATTCTCAAAATTCTATTATGGCACATGGTTTCA 
CAGTTTCAACCT 
Sequence 137 

CCGCGGTGGCGGCCGAGGTACATACACATACTCTAGTTATGAGCACCTAGGGTCTTTCCC 

ATCACTCTTGCAAAAGTGTGTGTGNGTGTGTGTGTGTATACATACACTCACACATACATA 

TGTATACACAGACATCTATATNCANANATAATGTATATGTGNGTCTAATGTATATATAAT 

ATATACACATAGACATACACACAATGCACATATCATAGCTCAACCTNATCTAAACATCTA 

GAGTNTGGAGAACGTGCTGGCCAACAGAATACATGTGCATGTGGATGTCATGTGACATGC 

ACTAACCAACCATCCNGAGGNNAAACTGTTGGGAGGAGGAAACTGCCCACAATNTTTGAA 

AATCCAGGCCTACCAGGCCATTATTT 

Sequence 138 

CTATAGGGCGAATTGGAGCTC CCCGC GGTGGCGGCCGAGGTACCAGNGCCCNTTTTCAGA 
C AGTTNTNGATT CGCTCTAGACi IlilUHlllI I AATAGGGAGGG AAAAAATTTGATA 
ATTTTCTTTTTTCTACAT 

TTCC TTTTTC CGCTTTTCTTCCCGCAGAGCCTGATGGGAGAATGTCCAGGGCAGGGAAAC 
CACATTTTTTGTAGGTGATAACTCAATGAAAATTGGTGCTTA I II I I I ACACTTCTCTCT 
TGTGGCTCTCTTGNGGTGCTATCTGTTTTAAGGTCTCCTTGAAGGCGCACTGGGGTCCCT 
GGCCATGCCTCGTTCTCCCTGCTTTCTTTA 
Sequence 139 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTGCTATAGACGCACAAAC 

GAC CGCG AGCCACAAATCMGCACACATATCAAAAAACAAATGAGCTCTTATTTTGTAAA 

CTCATTTTGCGGTCGCTATCCAAATGGCCCGGACTACCAGTTGCATAATTATGGAGATCA 

TAGTTCCGTGAGCGAGCAATTCAGGGACTCGGCGAGCATGCACTCCGGCAGGTACCTGCC 

CG 

Sequence 140 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGATAGAACTAAGGAAATAGTGGTTTTGAG 

TGAAGGGAAAGGAAACCCAGAAACATTTTACGTTGCTTTTACTTCTGTAGTGTAGATTGC 

CCCGGCCCCTCTCTGAGCCCTGTAGCATCTGTGATAGCTTNTGTCCCTTCATCGGTTCAT 

GTCACAGGGATTTTCTTTCCCAGGAAGCGGACACGGAGAGTCAGCCCTAATAAATGAGCA 

CATGCCCTGGC 

Sequence 141 

GACTACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATTGTTCTCA 
AGAGGGCCATCAGAAGGAAGCCAAAGAGTTCACAGCCTCAGCACCAACAACTCAACATGG 
TCATCATGTTTTCTATATGGTTTTTCCAGCTAGCAGTACCTGCCCG 
Sequence 142 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGAGAAGCATGGGGCAGTAGAAAGAGCATG 
TGGGCTTTAGAGTTCAAACCAAGTCAGGCTCAAACATAGTTCTGTGATAAGCCCTGAGCA 
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AGTTACCCGGGTCTTCCATTTCCCCCTTCTGGAGAAGTCCTTTGGAGGATGAGTCCTTCT 

GGAGGATGAGTCCTTCTGGAGGATGAAGTCCTTCTGGAGGATGAGTTCGTTGTAAGAATA 

AAATGAGAATGTAAGACACCTAGAGGATGCCCGAGTAAAAAATGACAGTTGCTAGTAGTA 

GTAATTTGTAGGGCTCATTATCTAGAATAATTTTGTTTGACGTTACTAATTAAAATGAAC 

TCTTAMGAAAAGCAGTGTATTTAGACTCTTGTAGTTAAGAAAAATTACACCACAGAGCC 

CTTTTTACTTTTTAAATTCATTTTT^ 

Sequence 143 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATATTAAGTGGT 

TCTATAAAAGCTATTCACAAGTTCTACTGTGATGGACATCCTCATCATAGACAAACCTCC 

CCTGTTTTATCCTCAATTTCTAGTTACAGAAATTTGGTGATGCTTA7TTTTGCCM 

ATGTCAAAATAAGNTAAAACTTCCCTCCTGTTCACCTCTTGGGTCTCTATCCTGTGTAAC 

CTNTGGTGTAGTATTTGCCCATAGGCAACCAGAGCCACTTCCTCTGAACCCAACATCTNC 

TGGGGACCTTCGCAGCANGAGGAAAGCACTGAGACAATAGCTTGCTAAGCAGGGGCCCAG 

NGGTGTCTCAAAGAAACCATGGNTGTNCTCGCCACTTCCCAGGGNGGGTGANGNGAGCTC 

GGGAACATAACGATGGTTTTG 

Sequence 144 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGATTATG 

CTTACCTCAGGTTTCTTTAGTGTGCTTGAATGCCCTTCTTTCCATATMCACTTATCTTC 

TTCTTMTTCGGCAATGGMTATCTTTTAAGTTTTAAAAAAACTGGAATAATTAT 

TCI III 1 1 GCCGTTTATATTTAGGGG I I I I I GTTG AT AAAATC AAGTCTTGGTTGTGGCT 

TGCTGAATTAAATATTTATGAGTGGTGCATTTTTAAGTATAGTGAACAAGACACCATATT 

AAGTACCTCGGCCGCTCTAG AACT AG N 

Sequence 145 

CCGCGGTGGCGGCCGAGGTACCTGCCGGAGTGCATGCTCGCCGAGTCCCTGAATTGCTCG 
CTCACGGAACTATGATCTCCATAATTATGCAACTGGTAGTCCGGGCCATTTGGATAGCGA 
CCGCAAAATGAGTTTACAAAATAAGAGCTCATTTG I llll I GATATGTGTGCTTGATTTG 
TGGCTCGCGGTCGTTTGTGCGTCTATAGCACCCTTGCACAATTTATGATGAATTATGGAA 
ATGACTG GGACATGTACCTGCCCG 
Sequence 146 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACCTCATCAATGT 

CTCTAAAACTAGGTCTTGAATCTAAAATGGAATTATACATAGAACTATTACATGAAGCAG , 

TTCTTTCACTACAATCATTTTTAGCCCCCCAGTGACCAATCTCCTCAGGTAAACAATGTC 

TAATGACTGATGATTGTTTACTGTATGCCAATTACTGCTAAATCCCTCATATAGATTATC 

TCATCTATGAAGACACAAGGATTATTAATAACCCCACTTCAGAGATAAACAGACACAGGT 

TAACACATCTCGCTAAGACCTNTTGTAAGTGAAAAACCAAGTTCAGAGCCCAAAAAGTT 

Sequence 147 

CCGCGGTGGCGGCCGAGGTCTGTATCTCATGTTTGCTGATGGCAGTTTGTGGGAGAGGGA 

ACCAT7TCAGCAATGATTTCAGTTAGCAAATAGCTTTGCAGCTTTAACAGAACTCTCTCT 

GCTTGCI im IATTTCCTGGGTTTTCTGGCCAGGCGCGGTGGCTCACGCCTGTAATCCC 

AGCACTTTGGGAGTCTGAGGTGGACAGATCACTTGAAACCAGGAAGCCAGGAGTTCGAGA 

CCAGCCTGGCCAACATGGTGAAACCCTGTCTCCACTTAAAAAAAAAAAAAAAAAATTAGC 

CTGTAATTCCAACTACTCGAGAGGCTGAGGCACAGGAATCGCTTGAACTTGAGAAGCAGA 

GGTTGCAGTGAGCCAAGATCGCACCACTGTGCTCCAGCCTGGGCAACAGAGCAAGACTCC 

ATCTC AAAAAATAAAAATAAATAAAAAATAAAAAAATA * 

Sequence 148 

CCGCGGTGGCGGCCGAGGTACACTAAGTTTGCAACATTTATTGAGATCTAAGTCTGTCTT 

GCCTTCATTTCTCTTTTTATCTCCCCCTTGCCCTCATTCTTGAACAGCTGGAGGAATACA 

TTTTATTCTGTCCATGAAGCATACACTATGAAATTCAAGTGCTTAAAAATACTTCTATGA 

CTCTCTGCTATCCCACTGTATAGATCCACAGGGAGCAAACACTTAGAAATGATAGAGAAC 

TGMGGAGATCAATGGTTTAACAGTTATCCATGCCMGTCCCATTGTCAGAAATATTCTT 

ATTACTCAGTCAAACACTCTTTGAGCTTCCCTTCCTAAAGGTAACCATTCCAGTGAATAG 

ATGTGCCCTTTTATAAGGAAACTTCTGATGTTTATTAAAAAAAACTGGCCTTrTGATAAG 

AGGG AACTTAATTTGGGAATTTG GT G N GTTG N AATGGC ATTTAATT 

Sequence 149 

CCGCGGTGGCGGCCGAGGTACAGAAGTTAGAATTTTTGACTCCAGGCAGCAGTTTGCTCA 
GTGATCTTGAACAAGTTATCCAATTGCCTCTACATTTGCATCAGTTTCTCTAGCTGCAAA 
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AT GGGG ATAATACTATATACCTACCTCACAGTGGGAGGGCAGGAGATTTTGAGGCCCTGA 

GGTTTTAGGTGGGCTGTGAGGGCCAACGCTTGACACAAAGTCCATGGGTTATTATTCAAG 

AATGCACAGGCCCATCGGCCTTTTAGAAAGACAAGACAGGGAGTGCTTGTTTGATATTTC 

AAGG AATAAAGCCGGA GCTCCTGAATTGTAGTCCACCTTAAAAGAGAGACCTGTATTGGA 

GAATATTTTATTTTTTTGGCAAATTTGATCTTAC^ 

AAAGC 

Sequence 150 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGGGTTTGAACGTGAA 

CGTATCTTTCTTGGGAGATGCGATTCMGCCACAAGAGCATTTACTTCTACTGTAAGAAG 

AAAACAMTCGAGTTTAAATCCTATGTATCT7TTGTTTCCAAACCATGCTGCAGAAACTA 

AAACTGAACTGCACTATATATATTTCTAGGTGTCTGAAGTTTTGAAAGAGCTTCCCAAAA 

CTTTMGGTAGATCCCTAGTTTCATCATAMTGTAACTAAAAGAATTGGCCTAAATGATT 

TCAAAGATCTCCACCAGCTCCTGGTTAAAAACTGCAAAGCTATGATGTTGTGGGCTTGTA 

AAAAAAACAAAATACACACACACACACACACACACACACACAAACAAAACTGCAAAGCAC 

AGTTGGTGTGCACCTGTAATCCCAGCTACTTGGGAAGCTGAGGGGAAAGGGTCACTTGG 

Sequence i51 

CCGCGGTGGCGGCCGAGGTACTCAGGTGACTTTCTGGTTAAAAATATTGAAGACGGATGA 
CAACTGGGCTTTTTTTACTTTGACAACTGAGACAAMTGACAMTTGTCAGTGTTCAGAG 
ATCCAGACCAACTTCTCAAAAAAATATGTTTACCCCTGATATCATCATTATTTTAGCCCA 
ACTGTGCCTTrTTGGGGGGATCACMCT 

Mill (GGGCCCTTAAAGCNAGGGGGTTTAGGGNANNATAACNCCCCCTTNTNNNNGANN 

GGGGTTNGAAAAACCCAACCCCCCTTTTTTNGGTCCNCGGGGNGGCCCMNNCCCTTTTT 

TNTTTCNTTAANGGGGGTTTCCCCCGGGGGGGGGGGGGCCTNTTTTNTNGGNNGNNGNTA 

AAAAAAAAAAA 

Sequence 152 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTTTGATGGACGAAAACGGATGATCTTG 
AGCACTATTTCATGGATGGGAGGAAAAAATCCATTTTTGGGGATTGCTTACATCGCTGTT 
GGATCCATCTCCTTCCTTCTGGGAGTTGTACCT 
Sequence 153 

CCGGGCAGGTACCMTTAGAATGTCTTCAGTTATTAGTAATAGAGCATCCTAAATCAACT 

GGCTTAAAAATAAGTTTAGTCTCTCACAAAAGAAACAGTCCAAAGGAGGAGTGGCTACAA 

GGCTGCTTGGTTTGGTGGCTCAAAGACATCATCCAGGTCTCAGGGTCTTTCAGTATTTCT 

GCTCAGGCCATGATTAAACTATAATCCATTGCTGAAAGATGGTTACCAGGCACCACATCC 

AGACAAGGTACCT 

Sequence 154 

CCGCGGTGGCGGCCG CCCGG GCAGGTACCACGGTTGTAAAGCAATAAGATTTGAGATGAA 

CACTATT GAAA CTTCGCTTTTTGCTAAAAMTAGCAAGTTGAATAGTAATCAAAAAACAT 

AGAAAGATTTTAGTTCAAAATGATTGCTCCTTTCTCTACCTGGACTTrTAAAAAATCAAT 

TGTCATCTAATATGAGTTTATTTGTCTATAGACACAAGTATCAATGTCTAAAAAAAATCA 

TGACTTTAAACTTCCACCGATGAGGCAGGTAGGAGATAAAGATGAATTCTGAACTGTTAC 

TAAAAGTACGMCCAGCTCGTTATTAGATGCATTGTAGACAACATCGATGATCCTTGTTT 

TAACGAGTACCT 

Sequence 155 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGCTTA 

TTCACATTTrTACTAAATCCAACACAACNTTCACANATGGNAAAATGATTGCCTCTTCAA 

AGCAATGCAGCCTAGTTTTTGGTGGGTTCTGGTCACTGCTTAGCTAAGTCTTTGTTGGGC 

AGAGTCCTGGCTCCACAGTCTCCTTCGATGGGCTCCTTTGATACACGANGCTTTATAACA 

TGCCCTTTTTTGAAAAAACCATTTNTGATGATCCCNGTTNGGGCCCCNCTNTTAAAAAAN 

GGGAAAAAAAANTTGNGGNTTNNATTNNTTTTTTTGGGNGGCNGGGAAAAATTT^ 

GNNGGGNNGG GAAAAAAA AAAAACCCCCCNCNCCCNGNANAAAAAA A I I M H 1 1 I M I I 

TTNGGGGGGGGTTTTTTTNAAAAAAAAAAAAAAAAAAAAGNNNANNTNC'I rTTTTTTTTC 

CNNNNAAAAAAAAAANNNNNNNNNNNNNGGGGGGGG 

Sequence 156 

CACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTCGCGATGCCCCC 
GAGTGGCCTCTGCCAGGCTGTCCCGGGCCTCTCGGCTTCCCGGGGACCCAGTGGTGTAGG 
CACGGACCATGTTGTAGGCACCATCCCGGAGACGCCGCTGGACACAGTATGCCTCATACA 
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GCTCATCGATCTTGCTGGCATGGAACTCCAGTCGTCGCAGGAAGCGTTCAATGGACTTGA 

CTTGCTTGTCCAGATCATACAGGAAGCCCAAGCGGGAATTCCTCTTGGACTCCCTTATCT 

GCCCCTGGAGTTTCTCTTGCTCCTGCTGGTGCACTTCCAAGTAGGCCGTCAGGCCCCGCT 

TCAGCGCCCGTGTACCTGCCCG 

Sequence 157 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGCGGAGAAGTTTGACCACCTAGAGGAGCACCTG 

GAGAAGTTCGTGGAGAACATTCGGCAGCTCGGCATCATCGTCAGTGACTTCCAGCCCAGC 

AGCCAGGCCGGGCTCAACCAAAAGCTGAATTTTATTGTTACTGGCTTACAGGATATTGAC 

AAGTGCAGACAGCAGCTTCATGATATTACTGTACCT 

Sequence 158 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATNNAAATNATACGT 
TTTAATAAAATACATCTTTAGATCAAAGCTGAAAGAAGACATCAGTAGTAGATCAGAGNA 
TTCCATTT 
Sequence 159 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGGGTCTGAAAG 

TCGATGAAGGACGCGATTACCTGCGATAAGCTTCGTGGAGTTGGAAATAAACTATGATAC 

GGAGATTTCCGAATGGGGTAACCTAACTGAGCAAACCTCAGTTGCATTTTGATGAATCCA 

TAGTCAAATTAGCGAGACACNGTTGCGAATTGAAACATNTTAGTAGCACCGGGAAAAAAA 

AAAAA 

Sequence 160 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATCATTTGCACACAGCAGATCAATAGGTGTC 

AGTCACCAGCTTAAGTTACACTTGTCAATATTCAAACTTGAATAAAATAAACACACATCA 

CAACAGCGACACTTTGCACTATCAACAATGAAGCTTGCCCTCAACAATTATGACGTTACT 

GGTTTTAGTAACATAAAAATACATTGCTGGTTGACAGGAAGGATAAAAATGACATCAAGA 

ATCTCAAAAAGTTATGTTGAGGTGCCGGTCACCACATTTAGGCACTCAGGAATTAAAAAT 

CA 

Sequence 161 

CACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACATACATTTCTTAG 

GGACACAGNATACTGACCACATNACCACCCTNTTCTTCCAGTGCTGNGNGGACCATCTGG 

CTGCCTTTTTTTCTCCAAAAGATGCAATATTCAGACTGACTGACCCCCTGCCTTATTTCA 

CCAAAGACACGATGCATAGTCACCCCGGCCTTGTTTCTCCAATGGCCGCGATACACTAGT 

GATCATGTTCAGCCCTGCTTNCACCTGCATAGAATCTTTTCTTCTCAGACAGGGACAGTG 

CGGCCTCAACATCTCCTGGAGTCTAGAAGCTGTTTCCTTTCCCCTCCTTCCTCCTCTTGC 

TCTAGCCTTAATACTGGCCTTTTCCCTCCCTGCCCCAAGTG 

Sequence 162 

CTTAGGGCAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACATCGTCTTCGTCTTCCTCC 

TGGGCGCCATCTGCACTCGAGAGGCCCTTCGCTATGACTTCCTGATAGCGGAAAAGGTAT 

CCCTCATGGGCTTCCTGTTCCTTTTGGGCTTATACATCTCGTCCCTGGCTTCCTGCATGG 

GAGGACTTTATGGAGCTCCCCGCATCCTGCAGTGCATTGCCCAGGAGAAAGTGATCCCTG 

CACTTGCCTGTCTGGGACAAGGGAAGGGGCCAAACAAAACACCCGTGGCTGCCATCTGCC 

TGACCAGCTTGGTGACCATGGCCTTTGI \ 1 1 1 GTGGGTCAAGTGAAACGTTCTGGCCCCC 

ATCGTCACCATCAACTTCATGCTGACAATACGTTGCAGTGGACTACTCTTACTTCTCCCT 

GTCCATGTGTTCCTGCAGCCTGACCCCGGTGCCTGAGCCGGTGCTCAGGGAGGGCG 

Sequence 163 

CCGCGGTGGCGGCCGAGGTACCACAGTCTTGCACATAAGTGCAGATTTGGCTCAAGTAAA 

GAGAATTTCCTCAACACTAACTTCACTGGGATAATCAGCAGCGTAACTACCCTAAAAGCA 

TATCACTAGCCAAAGAGGGAAATATCTGTTCTTCTTACTGTGCCTATATTAAGACTAGTA 

CCTGCCCG 

Sequence 164 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTACAAGCTAGAGTGG 

TAAATTTCAGGAAAGCAGAAGTTAAAGGCAAAATTGTAAATCAGTCGAGATCGGGTGCCT 

TCAGGGTGGTATGGCTGTATACCAAAATTGTAAATCACTACATGAAGCTTATATATTGGT 

TTGGCCTGAAAGGTGAAGTGGGGTAGGCAGGGGGCGGGCTTACAGGTTATGGNGGATTCA 

AAGACTCCCTGATTTGTGATTGGNTAAGGAAGCAAAGCTTTGTCTAAAAACTTGGGGTCC 

GCANAAAGGAACATTAAGGTCTGGCCA 

Sequence 165 
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TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAATATGGA 
AGAGCAGTTTGTAATATGMTACATT1TCTCTAGACGAGATCACAGTTTTATTTTGTAAA 
TATTACATTTAAGTATATATATACACACATATATGTACCT 
Sequence 166 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACCTCAGGG 

TTATTTACGAAGCCAMGGACTTTGCTATATCAAGTAGTTCATTTCTTATCTAAGACCAA 

CTATAGGTATGATGCTACTGTATTCAGGCAATGCCGACTGGATTGGAACATGCTAATTTA 

AGGTGAGTTGGTACCT 

Sequence 167 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTTTTGCTCA 

AAATATATGAAAGGAAAGATAATCAGTTCTTAAAGTTTGGTATGCTAAATAAGCACCCAG 

GGAACTTACC TGCTC CACCTCACCCCAGAAATTGATTATTCAGGTCTGGAATATGGCTCA 

AGGATTC7TAATTTTCAACCAAC ACAATATAG AATCCTG TCAGTTATCC TTGG ATCA7TT 

GCTAAGTGATCTTGCAAAATCAAAACTCCTTTGAGACTATTTCATTTTCAGAMTAT^ 

AAAACTTGTTCAGAAACATCTTAAAACTTGTTAGCTTATAGGAAAAAGTNCNTNGGCCGT 

TTNANACNNAGNNGGANTCCCCCGGCCTGNAGGATTTCANTTNAGNTTNATGANTNCGGC 

C ACCTNG NGGGGGGGGCC 

Sequence 168 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACACCCATCAA 

GCCACCAGCCTGGCCAAACGGGCATCTTTCTGTCCAGAGCTCATCAATAGGAACTGGGCA 

ACCAGGTGGGGAGCC GCCAGGGGAGCTTGGAGAG GGCAGAGTAGG AGCCTCCATGAACC A 

AGTCAGGCTGGAAGGGGTCAGCCCTCGTGGTCCGTCAGCGTGCGGCGGAGTCCCTGTGCC 

TAAGTAGCAGAGCGGTAGTCATTGAAACAGGCCGGGATGCCAGCAAGAGGGAGGCGAGCA 

TGCAGTGAGCGATAGGTACCT 

Sequence 169 

CCGCGGTGGCGGCCGAGGTACCTTGAAGCAATATACTCAATGAGCTCTAAATCTCACATT 

CACTAGTGATCTGCAAGTGAAGCTGAATAAATATATTCTCTTTGTTCAGTCATGGAAACT 

CAACATTAGAAATGACATGTGGGAATAGATCACCTTCTCTCTAGACTAATATCCCATTTC 

TGACAGTGGTCTCCATGGCCCATTGGGGCTTATAAAGTAGCTGCCTTCTGTGACATTCTG 

CTTGAAGACAAAGGCTTTGCCCAAACATAACTCCTTTTAAAAAAGCACAACTTGTTTAGG 

CACACAGAGGTTAAGTGACTTACACAAATCCACCCAGGATTGGTGATTGGCACCGGTCTA 

ATCTACTGAA AAATA TAAGGCCAGGCMTTATATAGATATCTTTTAACAAGTTAGTAAAA 

TAGTGGGGACTTTTCTGGCTAAAAATTGGTAATTAAATGGGNATGGTTACCTCTGGTTAT 

AACTAAACAAACAAA 

Sequence 170 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGAACCATGCTCGTTATTA 

GATGCATTGTAGACAACATCGATGATCCTTGTTTTACGAGTACCTGCCCG 

Sequence 171 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTAAATAATAACAGAAAC 

CAAGAGTTTGTTTCTAGTAGTAGAAGTATAAGTGGAGAGAGTGCTACATCAGAGAGTGAA 

TTTACCTTAGGGGGTGACGACAGTGGTGTGTCAATGAACCCAGCTAGGAGTGCACTTGCA 

CTGTTGGCCA TGGC CCAATCTGGGGATGCAGTCAGTGTCAAGATTGAAGAAGAAAACCAA 

GATTTAATGCATTTTAACCTTCAAAAGAAAAGAGCTAAAGGAAAAGGGCAAGTTAAAGAG 

GAAGACAACAGTAATCAGAAACAGCTGAAAAGACCTGCCCAAGGCAAACGCCAGAATCCA 

AGGGGAACAGATATTTACTTACCGTATACTCCTCCTTCCTCAGAAAGCTGCCATGATGGT 

TATCAGCATCAAGAAAAAATGAGACA 

Sequence 172 

CCGCGGTGGCGGCCGAGGTACCAACTCACCTTAAATTAGCATGTTCCAATCCAGTCGGCA 
TTGCCTGAATACAGTAGCATCATACCTATAGTTGGTCTTAGATAAGAAATGAACTACTTG 
ATATAGCAAAGTCCTTTGGCTTCGTAAATAACCCTGAGGTTTTGTACCTGCCCG 
Sequence 173 

CCGCGGTGGCGGCCGAGGTACCAACTCACCTTAAATTAGCATGTTCCAATCCAGTCGGCA 
TTGCCTGAATACAGTAGCATCATACCTATAGTTGGTCTTAGATAAGAAATGAACTACTTG 
ATATAGCAAAGTCCTTTGGCTTCGTAAATAACCCTGAGGTTTTGTACCTGCCCG 
Sequence 174 

CCGGGCAGGGTACCAGTGCCCCTTTTCAGACAGTTTTTGATTCGCTCTAGAC [TTTTTTT 
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illl I AATAGGGAGGGAAAAAATTTGATAATTTTC I I I J 1 1 CTACATGCACTTAAGACTA 
AAACACAGGTTTGGATTAATTTTATTTGCTTCC I I I I I CCGCTTTTCTTCCCGCAGAGCC 
TGATGGGAGAATGTCCAGGGCAGGGAAACCACA I Illl I GTAGGTGATAACTCAATGAAA 
ATTGGTGCTTA I I I I I IACACTTCTCTCTTGNGGCTCTCTTNGTGGTGCTATCTGTTTTA 
AAGGGCCTCTTGAAGGCGCACTTGGGGTCCCTGGCCATGCCTNGTTCTCCCTGCTTTCTT 
TAATCCTGGTATTGCCTCCACAAGGTCTGTTGCCAAGGACTCTTAAAGATCAATGGCACG 
TCACTTTTCCTTTCC 
Sequence 175 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTGG 

ATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAGAGGAGTGTGAATGTGTGT 

TTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCAGTGTCTGGCACACTGAGA 

CACCTCCAAGAAGGAGATTGATGCATCAGGTTCAGTTTAACCTGGAATATCTGACTACCC 

CTGAATCCACCCAGAAAGGGGGCCCAACACCCTTGTCCATTTATGGGGTAI II I M I iCG 

AAGTTATTAAGCATATTCCTTTTCCACGAACCTCTTCTGTACCTCGGCCGCTCTAGAACT 

AGTGGA 

Sequence 176 

CTATAGGGCGAATTTAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACGCGTCGCAA 

TTCAGGATGGCTCTCTTGGATCTTAGCTTGCAACTCGGCCTCCAAGCTTGTCACAGACAT 

GGAACTTTTCTGAGGAAACAGGGCACAACACAGGGGTGTAGCAAACACCAAACAGAAGCC 

AACTAACCCAACTTGAATGGGTGCACTCATCCATGGGAACCTCTTCAAAAAGGCTTTCTT 

TTCCAAAGTGTTCATAATGAATGGAGGGATGGCCATGCCAGGGGCTGCCATGAGAATCCT 

GGACACGACAACTTGCGTGATGGCTTGTTTCGCAAGCGTTCGCCGACTCCCCCAAGCGGT 

TCCCATTCTCATTCCCGTGACGGGGAAT 

Sequence 177 

ACTTAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGNGGTTACAAGCTATTATATATTT 

TCATAATATAGAACTTTAAAGTATAGTAATCAACAGTGGCAAAATATTTCTGTCTCTGAC 

ATTATTATTTGTGAGATCCAACCTGTAATTGAGATCAGAGAAAACAGTATGGGAAAACAA 

ATCCATGAGAACAGGATAAAAATATCATGAACAGCAAAATATGTCTGAGTATTTAAAACT 

GTATAGAAAAAATAAATTAGATGMCMTTGTGTAAATATAAATTAAAAAAGATGACATT 

CTTAATAAATAAAGTTTACTATTTAC A I I I I I ATATATTGCCAAAAGATTT 

Sequence 178 

CCGCGGTGGCGGCCGNGGTACACGAAGGTTAGAATTTTTGACTCCAGGCAGCAGTTTGCT 

CAGTGATCTTGAACAAGTrATCCAATTGCCTCTACATTTGCATCAGTTTCTCTAGCTGCA 

AAATGGGGATAATACTATATACCTACCTCACAGNGGGAGGGCAGGAGATTTTGAGGCCCT 

GAGGTTTTAGGTGGGCTGTGAGGGCCAACGCTTGACACAAAGTCCAT 

Sequence 179 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGGTACTTCCTGAGGAGTGAAGCTGTTT 

TCTGCTACCCTTTCCTGTCTGGTGGCTCANGAAGTTCAGTGTCTCGGTGTCTGATGCAGT 

GAATACCTGGGATTAGAGTCTGGCATGCTCATTCCTTAGAACACAGGTTTATTCTCAGGA 

TGATrTATAGCAAGTGATTCTGGTTTCCAGTCTGGTTGAAAAAAGTGAATATTGAATGTC 

CGGGCCCAGTTTTTAAANNACTCTTTTTTTNTTGGGAAAAAAGCGGGGNAAAAACCGGCC 

CTTGGGGCCCCCTTNNNNTANGGGGNNGGGGGNCCCCCCCCC CCTnTN AAAAAANNANG 

GGNCCCCCCCCCGGGNNNGGGNGGANAANTTNAAMNAAAAGNTTTTTTCCCCCCCCCCC 

CCGGGGGGGGGGGGGNCCCCCCCCCCCCNCNI lllllllll.il I AAAANANNNGGGGNGC 

CCCCCCCCCCCCCCCANA 

Sequence 180 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTTTATAGACT 

CAAAACATGATGAAAATTCATTTCATTGTCTTCACTGAATAGACTTAACCTTGACAGTAA 

GCTTAAAGCATTAGTGTCATTTAATTACATGTCTGAGGAATTTATGCTMCTAGATTTTG 

GGCTTTrGCAGCGAATTTTAGTCGGGGCCTCATTCCTGAGCCCAAGTGACC CTTT CACCT 

CAGCTTCCCAAGTAGCTGGGATTACAGGTGCACACCAACTGTGCTTTGCAGTTTTGTTTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTA I rTTTTGGTTTTTTTI I AC AAGCCCAC AACAT 

CATAGCTTTTGGCAGTTTTTAACCAGGAGCTGGTGGGAGGATCTTTGAA ATCA TTTAGGC 

CAATTCTTTTAGNTACATTTATGATGAAACTANGGGATCTACCTTNAAGGTTTTGGGG 

Sequence 181 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCATGG 
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AAAGTATTAAAGATCTTTAAAAAAATCGAATGCTGTCATGTCAAAGTGAGGCATGAAAAG 
TA TTATT AATGTGGCTCCATCGATrrGGGGGTTCCAAfCTGAGGAGGGCTTTTTATTTAA 
TGTTTTAATGAATCAAAATATCTTCTTAGAATGTCTCTGAAAATGGCATCTCATTACCTT 
A 

Sequence 182 

CACTNCTATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAA 

AGTTCACCCACCCTGCACTTTGGCCCTTAGATCAATCCTAAGTAGCCATTGCCAGTAGGC 

CAAGTTTAATCAGAGGACAGTGCCTACCAGTAAATACTGAATAGTTACAATAGTTATGTC 

CATCCAACCAGTAGCAGATGAACAGCTAATACATCATGATGCTATGCTCTCCTAACAGGG 

TCCCCTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATTCTCTA 

CCAGGGATTMCAC CTAAC TACTTCTCTAAGCCAGAGGCAATTCCCTTTATTTCCTTACT 

CTCGGTCGNNCTTNTTTTTAACCCAAATCTTCTGACCAATAGGGTAAAAACAAAA 

Sequence 183 

TCACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACACACCGGCA 

TGTTTATACACAGGGAG TGTA TGGTTCCTGTAAGCACTMGTTAGCTGTTTTCATTTAAT 

GACCTGTGGTTTAACCCTTTTGATCACTACCACCATTATCAGCACCAGACTGAGCAGCTA 

TATCCTTTTATTAATCATGGTCATTCATTCATTCATTCACAAMTATTTATGATGTATTT 

ACTCTGCGCCAGGTCCCATGCCAAGCACTGGGGACACAGTTATGGCAAAGTAGACAAAGC 

ATTTGTTCATTTGGAGCTTAGAGTCCAGGAGGAATACATTAGATAATGACACAATCAAAT 

ATTAAATTGCAAGGATGTCACACGTGNTGATGAAGGGTAGAGTAGGAGAGACCATGANTA 

TTGTGGTAACAGGAGGACCCAGCATTATTCTAGTGCTGTACCTGGCCCGGGCGGCCCGCT 

TAAAACTAGGNNGGGATCCCCNG 

Sequence 184 

CACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCT 

ACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATACCGGATATAGTTATTTATC 

GCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACAT 

TAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCT 

GTACCT 

Sequence 185 

CCGCGGTGGCGGCCGANGGNTCNNNTNCNTNNTTNNTTATTTNTGGGANGNTGNNTNTTT 

TTATTCCTAGATNTGGAGGMNCATATTTNTNTNTTNTTATAAMCTGATTGTAATAATT 

CTTTCCTATCAAAATCTCCATAGGTCAAAATATTGNNGGMTACTAAAATTTGCAACCTT 

GTTTACTTTAAAAGGTTGCCACTTTCAGTGCAGAATACTACCGGCATCTTGTTACTGCAA 

TAGTTGGAAATAAMTGTGAAAATTAGCAGAAAAAAAAAAAAAAAAAAAAAAAAAACCCT 

Sequence 186 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCTGATAAAGTTCT 

T GAGA TCCTGTATGTTCTTCAAGACCTTTATCAGAGTTTATGTGAATGCGGCTATTTTCT 

GTTTTAGGACAAACAGGACTAAAMGTTCAAGGGAATATGCTTGGTTTTGTCCACTTTCA 

TATGCTTCTGTAAAACCATCAAGCTGAATCGAATTCTCTTCAGGTATTCTTATCTCCCCG 

TAACTTGCCTTTGGTTCAGTCTCCATGTTTACATTCCCTTTATTTACAGGACTCCGCCTT 

TTATCTTTCTTTTGAGCTAAAAAAGCAACCTGGCTGGGAGGTAATTATACTGAGAAATTC 

TGTATCAGTTGATATTTTAAGGTCTGAGACCTTCCTAACTGCTGCTTCAGACCTTGACTT 

ATCTACCTGCGT 

Sequence 187 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCT 

GTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAA 

GAAATGCTAACCTGTGGAMGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCA 

TCCAAGGAACTTCATGCTTGATTTGACCACGGACAGCCTCCACCTTGAGCACTATTCTAA 

GGAGCAAATACCTTAGCTCCCTTGAGCTGGTTTTCTCTGATGGCACTTTTGAGCTNCTAA 

GCTGCCAGCCTTCCCTTCTTTTCTGGGNGCTCAGGGCATGCTTATTANCAGCTGGGTTGG 

TATTGGAGTTGGCAGACAGGGATGTTCAACTTAATGAAAAAATACAGNTTAAGGGNCNTT 

GCCACAACACCCTGCCGTAAGTTACTTGGCTTGNGGGGNGGGGCATNTTAAAGTTAAAGG 

GNACNGGGNTTTAATNCCCTATCCCTTTTTTC 

Sequence 188 

CCGCGGTGGCGGCCGAGGTACATTTCTAAATAAATATCTTTGATCTATAACCCTTGAAAA 
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TATTACAGTTTATGGTGACATGATAGAGACAAGTCAAACAGTATTATAAGTGAAAATGAA 

ACATCAATTTAAAATGAACAAGTGTAAAAACACGCGGAACTGAAATTATAGAAAATCAAA 

ACACTAAATACACAGTAATTAAATATCTAGCACTGGAATCACCAAATGCATGCCGCAGTA 

ACACACTTTCMCTGATATAAAAAGCCCTTTAAGATTCACATAATCTAAATAAATAAGCA 

GAAACACAAAGGGGAGGTACCTGCCCG 

Sequence 189 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGAGGGAACTGTTGCA 

AAGCCATTTCATCGAGAAGGGGACAGAAGGAGAAATACACACATGTATACACAAACAGAA 

TGGTTGAGAAAACGTTTTMTAAAATGTGAGGGTTGTATGTGTGCGTGTATATATTTACA 

CTTAACCTCTAAAATTCTCTTCTACAGTATCTCTGTTATGAATATGATGGAAAAGCAACA 

TTTTGGTGGTGAGACTATTGTTAAAATAAATTTGAGAMGACGAAAATTTTGTGAGTCT^ 

GATAATT 

Sequence 190 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTTTTGTTCMTACATTTTA 

GATTACGGATTGACAAAGTAAAGATACTGCTATGGMTGATACATTGTATTTTCTGCATT 

GTGTG AAATAGTTTTTATTG AAAG N C AAGTGAC ATTTC AAAAG AAG NTCTATAAC AATT A 

TGTTTCATGCTTAMGTAAAAATTCCCAGAGTTTAGTTTAGAAAATGTAATCTTTTAM 

TTCAGACTGATATATTCCCAGTATTTTCATAATGCCATGTTTTGATAAAGTACCTGCCCG 

GGCGGCCGCTCTAGAACTAG 

Sequence 191 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCTTATAAAAGATTCT 

TTCTCCCAGAATTATATCTCCTCAGAGCAACAGCAAGGTTCTCAGGATCGAAGCCTACTC 

TAGCCTGAAGGGCTAGGAAGATTAGGATAAGGATAAGGATAATAATCCAAAAGTCTCGAC 

AATTCCAGTAGTCTCTGGATGGCTCCAACATCATAGAAATTTAACACTGTTCCACTTGTT 

TACAAAATCTAACACTGGCTTAGACATTCTGGACTTTCAGTGAGGGTTCCAGCATCTGAT 

GTCCCTCAACTCCTTTCCAGGGTGAGAGGCCCACTTACAGGAAACTTAACTTCTCACCAT 

GTGGACCCATGGAGGGGTTTTTCTCTTGCAGAAGGGCTAAAAAAGTGAGGAGGCTATGGA 

AAAAATGGTGGGTCACTTTTAAGGCAGACTGTTNGGGTTGGGTGGGAATTTTT 

Sequence 192 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGATTCACTAGGGCATCCTGCGAGCCTCACTAG 

CCTTCTGGTTCATGCCTTTGACAAGCA I I I I I GTGCCCCCTCTGCTTACTGTGACAGTCG 

ATGATGAATCTTGCGTTGCCATTTTCTGCTGTGGGTAACTGCGTGCAGNGTCTTGCCTTG 

CTTTCTCTTNTTACTGTCCCACAGCTTGGTTTCATGTTACAAACAGAAAAGCTCGAGGCT 

CCCACCCCGCCACA TCCCA ACTTCATTTCCCCCTCACTGTAGCCCATTTCCACCCCACCA 

CAAAGTTGCCACAGGTTTTCTTTGTATAGAATATGTTA^ 

TATTTATTTTAGAMGTCTACTA7TGNAAGAGTTCTTC 

AGGTAAAAACTGGAA 

Sequence 193 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCATAAGCTTGGATTTTAA 

ACACTGATAGTATCTCATGAGTAATGTGTGTTTTGGGAGAGGGAGGGATGCTGATTGATA 

TTTCACATTGTATGAAATACCATGTTTGAAACTCATAGCAATAATGCTATGCTGTTGTGA 

TCCCTCTCAAGTTCTGC ATTTAAAATATA \ WW I CTTTATAGGAATTGATGTATACCAT 

GAAAGTCATTGTCAGTTGTAGTAGGCTCTGATGTTGAATGAGATATCATGGTTTAAGCAT 

TNCCATTTTACTGGCCTAGGGGAANAANACCACTTTTTCTTGGCTTNCATTTGGGANGGA 

TCCCCCAGGGGGNGTNTTTGGGGTGGTCCCTTTTTNTNGGGGGAAGCCAACCATTTCCAC 

TAGGCCTC I rTTTTTl I NAANCCTTTNAAAAATGGGGAA 

Sequence 194 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGTGGACCGTGAAANGCTGTAG 

TTTGCAACCCTGCGAGTATGTCTGGATCACAGGAGAATGGTCAGAGTGCTCAGTGACCTG 

TGGAAAAGGCTACAAACAAAGGCTTGTCTCGTGCAGCGAGATTTACACCGGGAAGGAGAA 

TTATGAATACAGCTACCAAACCACCATCAACTGCCCAGGCACGCAGCCCCCCAGTGTTCA 

CCCCTGTTACCTGAGGGACTGCCCTGTCTCGGCCACCTGGAGAGTTGGCAACTGGGGGAG 

CTGCTCAGTGTCTTGTGGTGTTGGAGTGATGCAAGAGATCTGTGCAATGTTTAACCAATG 

AGGACCAACCCAGCCACTTATGCCACACTGATCTGAAGCCAGAAG 

Sequence 195 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTGGGGTCATAATT 
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ATCGAGTCTCTTGATATTATCACAATTACTGTGCCCCCTGCACTTCCTGCTGCAATGACT 

GCTGGTATTGTGTATGCTCAGAGAAGACTGAAAAAAATCGGTATTTTCTGTATCAGTCCT 

CAAAGAATAAATATTTGTGGACAGCTCAATCTTGTTTGCTTTGACAAGACTGGAACTCTA 

ACTGMGATGGTTTAGATCTTTGGGGGATTCAACGAGTGGAAMTGCACGATTTCTTTCA 

CCAGAAGAAAATGTGTGCAATGAAGATGTTGGTAAAATC 

Sequence 196 

TACTTAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGAGGTGTGCTGCACAGATTGATA 

CATTAG C CTTTGCTTTTTCTCTTTCCGG ATAACCTTGTAAC ATATTG AAAC CTTTTAAGG 

ATGCCAAGAATGCATTATTCCACAAAAAAACAGCAGACCAACATATAGAGTGTTTAAAAT 

AGCATTTCTGGGCAAATTCAMCTCTTGTGGTTCTAGGACTCACATCTGTTTCAGTTTTT 

CCTCAGTTGTATATTGACCAGTGTTCTTTATTGCAAAAACATATACCCGATTTAAGCAGT 

GTCAGCGTA l III! ICTTCTCATCCTGGGAGCGTATTCAAGATCTTCCCAATACAAGGAA 

AATTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCAAAATAGTCATTA 

TGGGCTCAAATAGAMGAAGCTTTTTAAG7TTTAAATCCAGTTAATCTGGT 

Sequence 197 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTGTTCTACTGT 

TTTAAAAAGTTTTCCGCAGAACAGTGCATTTATGGCAATGCTATGTTTAATGAGTTAGGG 

ACATCAAATATATAGTAGTTCCTTATTTTCAGTTGTGAAAATGAAATGGCTAAAGCAGAA 

GAGACGTCTATTTTAGTCTTTTAAAAATGTGTGTGGGTGGTCTTTTTTCCTCAGAAGCCC 

AAAGCACATGTATATTTTGTTATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACT 

TTAAGAAAAGGAACAAGAAAAAGGTAAATTCATGTGTTCCCCACTGCTGTGTCTAGAACC 

MGATCACATTATATCATTGTTAAAATTGTGTTATCTAGGAAAGGTGCAATATAGGGAAA 

ACACTCTAAGAATCTTTTAAAMCCTAAGGGTTTCCCTTATTTTGTCAGAATATGTNGGG 

AAGTGGCCATCCCATAAGGAATCTTTTTAACC 

Sequence 198 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTG 

GAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCT 

TCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCT 

ACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 199 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGATGAACA 

CCAACATGTTCCAGAAGACCCAAGAGAAAAATCACAAGATGAAGTCTTGAGAGATGACCC 

TCCAAAAAAAGAACATCTACGGGATACAAAGTCTACATTTGCTGGCAGTCCAGAGCGTGA 

GTCCATTCACATCCTGAGTGTTGATGAGAAGAACAAGTTGGGAGCCAAGATTATCAAAGC 

AGAGATGATGGGGAATATGGAATTAGCTGAACAACTTAAAGTTCAACTTGAAAAGGCAAA 

TAAATTCAAAGAAACTATAACACAGATACCAAAAAAATCTGGGGTAGAGAATGAAGACCA 

GC 

Sequence 200 

CCGCGGTGGCGGCCGAGGACCTGTGTTNAAAAGATGAAAGAAATGGNTGCTGAACCCAAA 

ACATGTTTAGGAGGAAAACCAGAAGAACAGAGCAGTGGCACTGGACCTCCTGTCTGTCCG 

TGAGGCTGCAGAACACAAGGCAGCTCCCAATGCCGCTGGACTCTCTGACCTTGGAGAGNG 

TAAACATCATATATGATTCTGTGTTCCTGAGATCTGGGTTTAACGATCTGGGCTTTGTCA 

TGGTGTT 

Sequence 201 

CACTACTATAGGGCGNATTGGAGCTCCCCGCGGTfeGGCGGCCGAGGTACAACCTCTCGGC 

CCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAACTAACTAATGTTCCGCAC 

CCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACT 

ATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTAC 

CTGCCCG 

Sequence 202 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAATCCAAAAG 

TCATAAAAAGCAAAAGCTATCTTTTTTTCACTCTGGCACCCATCTGTTCTTCCCTGGAGT 

CAAACACTATTACCAATTTTTAGGTATACTTCCAAAGATACTTACTG CATTT ACAAGCAC 

AGACTTATATTGATTCTAAAAGAATAAGAGACAT7TTCAGCATGTTGTTT TGTTCAA CAC 

CACAGTATATCTTAAAGATGGTCCCCCATCAATACATATAGAGATCTCTCTTTTTTATGG 

CTTCATAGTATTATCATGTGCTGATATGCCAAAAACTAATCCATTAGACCCTTTTGGGTA 
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TATATTTACTATTTTGCAGTCTTCTGCTCTTAGGAATAAAGTGAAATTCAATCACCTGTA 

TAGAAAGTATTTCACACATGTTTGAATATATATGTAGAATAAATTCCCAA 

Sequence 203 

CCTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGCTATAM 

GGAACGTTrTTAGAGAAAGCACTGAAGACACACATTTTGCTGACCTAAAAGATTTTAAAA 

TGAATTAGAATAATTTACATCATATAAAGAGGTATTTAGTCTTTAAGTGGAGAAAGTTGC 

TAGTCACATGTAAGAAAAACAAGTATTATGGGCCTTCCTAAGACAAATGGAATAAATTCC 

ATCACTTTTGGCTTTTTTATAAACCAGACTTCTAACAAATAACCATATTAATTT^ 

AATCAMCATATCCTAAMGAGGAAAAAGAGAATGTCTACTTTTAAATCACATCCAATGT 

GAATTTGCGTGAATGCAAGGCATAAAATACCT 

Sequence 204 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGCAG 

GATGGCAGGCACAGAGCCGCCAGCTCTTTCCCCGCCCACTCTCGGTGAAGCAGTGCAGCT 

TGCCCGCCAAGTAGAACGTCGTGAAGCCAAGGCCCGAAAAGGCAAAGGAGGAATGGATGC 

TGGGGMGCTTTTGCGGCCCTCGGACACCAGATCGGGGTCACCTGTGCAATGCATTTCCG 

AGTTCATCACTCCATCTTCCCACTATTAATTTAATAGTGTTTGTGCAGACTCCATTCAAA 

GCAAGAGCCAAGGACACCGCTMGAAGGCTTCCTTAATTTCAGTCTTGTCTGTTCGCCGG 

ATAATTTTCACCACACAAATAACAGCCAGGGGTGTGAGGAAAGAAATTGCAAACATGAGG 

CNGGGTAGGGTATG 

Sequence 205 

CTACTTAGGGCGAATTGGAGCTCCCCGGGGTGGCGGCCGCCCGGGCAGGTACCATATCTC 

CTGGCTTCCTTCTACATGGGTCACTTAGTTAAGAGGGAGGCCAAGGGAGTTCCGATTTCA 

GGCAGTGTGTGGCAGGGTTACTGTCCTAGCAACCTGGCTACTCCTCACTGTGAACGTTTC 

TCATAGGTGTCATATGGCAGGATGAAAAACATATTTGCCTCCCAGTGAAAGATGGCACAG 

GCTTTTGCCCAGCCAGGTTGGCAAGAGAACAGAACTCTTAACCCCTTGCTCGACAGGTTT 

GAGTTCAAGGGGTTGGATGCTCCAAGCAGAGGGCCAAACCCTGATTTATGAAGCATGCTA 

GGTCAACAGCCAGTCAGACCT 

Sequence 206 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACAGTGCCCCCTG 
CTTGTGGTGCAGGGGGGTGCCATTGCCGTGGTGCTTGGCATTGAGGGGCTGCAGGTGATG 
CATTGCACGGGACACCAGGTCACTCAATTCGGCGATGCTGCCTACGCGCGTGGCCAGCAA 
AGACTGCTCGATGGTCCTCAGCTCCGACTGGCTGAACATGGGCAGTTCCCCTGGACCT 
Sequence 207 

AGGGCTAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCTAAAGAAGG 

CAGGAGAGAAAGTTTTTGAGGATGGTGTGTGTAATCGGTCACTATTTACTGTG TGAGT TC 

CACTGGAGACAAGCACACTTTCTCCAATCCCAGAGTCTACACCTGACTGGGAATTTTTCA 

TTCTGGATTCTGATTGAGGGAAAATACTGATATCCAATGGGCTATAAGGAGGACCTTTCA 

GTTTCTGTAACAGCATCTGAGTTCTATGCATGTCCTGGAAAGGCACCTGCCCACTGGCTA 

ATTCACATGCTGTAATCCCAACACTGTAAATATCTGACTTCACATTATACCCACGTAAAT 

CCTGTCTCAGTAGTTCTGGACTCAGCCACGGCTGCACTGATGTGCTGAACTGTGG 

Sequence 208 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCTAAAGAAGGCAGGAGAGAAAGTTTTTG 

AGGATGGTGTGTGTAATCGGTCACTATTTACTGTGTGAGTTCCACTGGAGACAAGCACAC 

TTTCTCCAATCCCAGAGTCTACACCTGACTGGGAAl I I I I CATTCTGGATTCTG ATTG AG 

GGAAAATACTGATATCCAATGGGCTATAAGGAGGACCTTTCAGTTTCTGTAACAGCATCT 

GAGTTCTATGCATGTCCTGGAAAGGCACCTGCCCACTGGCTAATTCACATGCTGTAATCC 

CAACACTGTAAATATCTGACTTCACATTATACCC 

Sequence 209 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTTTACTAAATTATCT 

ATGTCAAAAAAATTCTGTGCCTGGCGTGGAATTTCACTCCATCMGTGTTACAATGATTT 

TTTCATTTTCATTACAAGCAGGAGAATGAATGTAGGACAAGTGTTAGGAAACATGGCAAT 

AAATTAGAATATAATTTACAAAAGCAAAAAAATTAACAGTGTACCTGCCCG 

Sequence 210 

CTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTCTCCCAGTCTACA 

TGTCCTGGCTGTTAATCATCTTGGCCCCTTGAGGCACATCACAGTTTGAAGGACCTGTTT 

AAGTTGAAATAGACTTTGCTTATTTATTGGGATTCTAAAAAATTCTGAGTGAGTTTGCAG 
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TATGAGAGGAAATAAGATTTCCTCCTCCTTCCTCTCATTTTATATTGACTGTTTGCCAGA 

MCTGTTTTCTTCTGTTTTCTTATATTTTGTTTTTGAGATGGAGTCTCACT^ 

AGGCTTGGAGTGCAAGTGGTGCAATCTCAAGCTCACTGCAACCTCTGCCTCCTGGGTTCA 

AGTGATTCTCCTGCCTNGGCCTCCTGAGTAGCTGGGAATTACAGGCCCGGGCCACTACCN 

CCCGGCTACT7TTTGGATTTTGGTTTTTAAGAAAAACCG 

Sequence 21 1 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTNTTTNNCTTAAATTGCT 
TTNTAGNTCTAAGATTGTANAATGANCCTNNCAAATTGGACTCTTTTCTAACAGAGNTAT 
TTTAATATACTTGNTTTNTTAAAAAACAAAAAAACTACTGTCAGTATTAATACTGAGCCA 
GACTG N C NTCTAC AGATTTN AG ATCTATN ATTTTATTGATTCTTAAGCTTGTATTAAAAA 
CTAGGCAATATCATNATGGATACATAGGAGAAGACNCATTTACAATCATTCATTGGG 
Sequence 212 

AGGTACTGGATGTTAGAACGGGCTCCAGGAGCCAGGAATCCTGGCTCTGCACTGTTGGAA 
ACATCTAAGCTAGTCTTGCGCCCATTTACTGGCTTGACAGGGATGCCTGAGGACTTCTGG 
ATCTTGCTGAGAGTGGCTGAACCACCAGTTTGCATGACAGTGGCTGTGCCTGTGGCAGGA 
GGAGGCTTCTTGTAGCCAAAGGATCCCGAAGTGGAGGGGCGAGCAATGCCCGAGGGGGGC 

Sequence 213 

CCGCGGTGGCGGCCGAGGTACTTGAGACCTCATGCATATAAAACCAGCTTACAAACTACA 

TCGCACTATATGAAGAAATTATCACTGGGGGCAAATCACCAAGTAGGGAGCACAATACAC 

AGGGTGTGGATGTTAATGTCATTCCCTAGCCTTCTCATTCCCTTCTCTTGGTCCTTTATG 

CATATGGAACAGTTCCATTATTAAATTTTGTAATAATAACTGAGAACCTGACTCCCAGCA 

AGGGAGTAGTTCAA GAAG TTGAGGGGAGTTTAAATCTGAATGAGTAAATAAAGCAATTAT 

ATCATTAGCTTAAAATTTTATCATCAATTAAAAATAMTTTAAAAACAAAT^ 

AAAAAAAAACC 

Sequence 214 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCTCATAAACTATCTCAGCACTGCATCCG 

TATTTGTGGGCACCTGTGCCAAAAGCACCTTGATATGTTTCTTAGTGTAATTGGCCAGAG 

ATGGCCAATAGATGTCTTCTCAAGTGTTTTTGATCACCAGGTCAGTGCTATTGGGAGTGA 

TATAATATGGGTGGTTCCTAAAGTTATTTTTGGTCTC rTTTTTTTTTTTTn I AAGTGCT 

ATGA TAGA TTAACAACACCTGCCTTGGATGCAGGAATTAGGTGGGGCTGCACACGTGCAT 

GGTATTTTCTATCCCTATGCTAGGTTTTGNTTCTGTTGGGCTGTCTTCACTAGACTTGAG 

ATGACTTGATTTAC 

Sequence 215 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTAAAGTGAAAAGGGATGT 

GCANAAAAATAAAAA AAAA CAACAAANAAAGCTAACCTTCTATTAGAAAAGGGGACAGGG 

GAATGAGTAAACTTCTTTTATTGCGGACAAATGTGCACATAGCCGCTAGTAAAACTAGCC 

TCAAACAGGATGCTCATAGCTTAATAATAAAAGCTGTGCAAAG 

Sequence 216 

ACGACT CCTA TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATA 

CAGATCTmACTGCCT TGGTG TTAAATTTATTCCTAAGTAAGTA I I 1 1 I H I ATGCTAT 

CATAAATG GCATC TTAATTITTCAGTTAGGTCATTATTTTCATTAT^ 

GCTAATGCl I I l IGTATGT TGATTTTGTATTCTGCAACTTTACTGAATTCACTTGAGTTC 

T AACA AACAGA I I \ I III IGTGTGTGTATCTTTGGGGTTTTTACATAGAGCCCAGTTTTA 

GTTTTAATAGAAATGTTTATAATAGTGGATGGAATAAAGACCTTTGTATGATTACTGTGA 

ACCACTATCTCTCTCTATATATATAGATAGATATGAATGAGTATGTGTATAAATCATAAA 

GCAAAGGCTTTGAGAAACCAMCTCATCTTTGCTTTCTGTACATACTGTGGCCTTTCTGG 
TTTATT 

Sequence 217 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAATAACATC 

ATAAATAAC ATAAATG AG AAG TT7TC AAG TATTTC AATG TGC CTTG AAAAG TTTC AAAC A 

GTAATTCTCACAATTGGTTAGACAATCA ! I ll M I I I CTTTTCTTTTCCTCTCATTTAGT 

CTCAGAGCCTAAAAAGAAATGCCTCCAGTCTCTGTTAGCCCCGATGTTTGGAATGAAATT 

AACAAGATTCTACCTTAAAAGAGAAAACTTATGTGGGCTTTTCAAATTGTGAAAATTTGT 

TCCCTCTTATAAAATATAATCTTTCCCCTGCTGCTGGTTTATAAATATGGGCATATAGCC 

CAGAGCCATTATAAAGAAAAAACAACCAACCACAATAAACTAAGGAGCCTCATCTGATGG 
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CTMGGGTTCTCAGGAAAAAATGAGAAGAGCCCTTTGATCAGGAAAGGAAAAAAGGATAG 

AGGATAAAACAGTAACCTTTTA 

Sequence 218 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTGAGACTC 
AAGACTCTCACTGATTG GAGAGC TTGTGGAAAACAAAACACACCATGCCAATAAATGAGA 
TGAAAACTTGAGTTTGCCTTTTTAACTATTTATGTTCTAAGTTAAGCTTTGA 
AAATGTCAAATTCTCTCATTCTTATAAAAAGTTGAATTAATTGCCTGTAmATTTTAGC 

MTTATTCAATGTATTTCCAGTATAGGATGTATAGTATAATTAATTTTTTGTAMTAAM 

TATTTTACCT 

Sequence 219 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTANTCTTGTTTCTATGAA 

GGAAAAGTTTGGCTACTAACAGTAGCATTGTGATGGCCAGTATATCCAGTCCATGGATAA 

AGAAAATGCATCTGCATCTCCTGCCCCTCTTCCTTCTAAGCAAAAGGAAATAAACATCCT 

GTGCCAAAGGTATTGGTCATTTAGAATGTCGGNAGCCATCCATCAGTGCTTTTAGCTATT 

ATGAGTGTAGGAAACTGAGCCATCCGTGGGTCAGGATGCAATTATTTA 

Sequence 220 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGATCTGT 

ATCACTCCGGAGCAACGACCTGTCTCTCTGAATGATCTCTGGCATGGTAAAAGAGTTAAC 

TGTATTTCAAAAGGTGAAACAAAACTTTTTTTGCTTTGT^ 

AATGGTTCAGGCCTGGAGTATGCATCAGAATTATCTGTGAAGCTTAGTGAAAGTGCTGAT 

GCCAGAGCCCACCCAAAGCTTACTGCTTCANAACCACTGGGGACCTGGGCAGTTCTGATG 

TGCACGACTAGTCAAGAATCCTTTAAAGGCTGATAAAGCAATCTTGAAATCACTGTCAAT 

GCAAAAGTGGGATGTTCCTTATTAATAAGAACTCAAAAAATAAAGTACCTCGNCCCGCTC 

TAGAAACTAGGTGGATCCCCCCGGGCTGCANGAATTCCGATATCAAGCTTATCGATACCC 

GGCCGACCTTTGAGGGGGGGCCCGG 

Sequence 221 

CCGCGGTGGCGGCCGAGGTACGCGTTTCTTGTAAACACGAGGCACCCCAAGATAAGAAGA 

CAGATAGAGCAAGGGATGGACATGGTCATCTCCTCAGTGATTGGAGAAAGTTACCGGCTT 

CAGTTTGATTTTCMGAGGCAGTGAAGAATTTCTTCCCCCCAGGAAATGAAGTGGTTAAT 

GGAGAAAATTTAAGCTTrGCATATGMTTCAAAGCTGATGCATTATTTGAmCTTCTAT 

TGGTTTGGGCTCAGTAATTCCGTTGTAAAAGTAAATGGAAAAGTTCTGAATTTGTCAAGT 

ACCTGCCCG 

Sequence 222 

AGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAA 

TGAACTAACTGCCATGTAGGCTAAGAAAGAAATGCTAACCTGTGGAAAGTTGGTTTTGTA 

AAATTCCATGGATCTTGCTGGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTG 

ACAGCCTCCACCTTGAGCACTATTCTAAGGAGCAAATACCTTAGCTCCCTTGAGCTGGTT 

TTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCT 

CAGGGCATGCTTATTAGCAGCTGGGTTGGTATGGAGTTTGGCAGACAGGATGTTCAACTT 

AATGAAGAAATACAAGCTAAGGCCTTGCCAGCAACACCTGCCGTAAAGTTAC 

Sequence 223 

CCGCGGTGGCGGCCGAGGTACAGTCACCATCATACTTGTCTCCATTTGGAAAGATAAAGC 
TGATCTTATACACTTCTGAAGTTGGTCGAGAGGTGTTAGAGAGATCTTCCAAACTCTGCG 
ACTCCCCCTGGGCGGGCCAGCTCTGCGGCCACCTTGCCCG 
Sequence 224 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTACTC 

CATCTACCGAGGCAGCGCATGGCATGACTGAACGGCTTGGAACAAACACAGAAATTAGCA 

CCACAAACATTCAGGAACCAAATATAATCTGCTATGGTCACACCACAGACAATGCAGGAA 

GAGGCI I ! I I ATTGCTTGTGTGTGTTTTCAAATCAAGTAGGCACAACCCAGTCCTCATAA 

CAGCCCTCGGGAGTGGAGGCACCGGGGCGGGAGAGCGCAGGAGGGAGTGTGCTTGGCTAT 

CCTGCCCAGTGGAAAGGAAAGTGACGTGCATTGATCTTAGAGTCCTTGGCAACAGACTGT 

GGAGGCAATAACAGGATAAAGAAAGCAGGGAGAA 

Sequence 225 

CCCGCGGTGGCGGCCGCCCGGGCAGGTAAAAAAAAAAAAGGAGACACAAGACTTACTGCA 

AAAATATTTTTCCAAGGATTTAGGAAAGAAAAATTGCCTTGTATTCTCMGTCAGGTAAC 

TCAAAGCAAAAAAGTGATCCAAATGTAGAGTATGAGTTTGCACTCCAAAAATTTGACATT 
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ACTGTAAATTATCTCATGGAATTTTTGCTAAAATTCAGAGATACGGGAAGTTCACAATCT 

ACCTTATTG 

Sequence 226 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCATT 

CATTCCAGTGAGAAGCACTAGMCTTTAAAATTTCAGAATTGGATGAAAATAACAAAGAT 

CACCCAGTTGAATCATATTTTTTTGTTCAGAAGCAAAAACTGAGGCTCAGAGACAAACTG 

TCCTGTCCAAGATTATACAGAAAATGGTGGGAAGAGTTGAGAAGTTTCTCAATAATGCCA 

TGACTGTTGAATCATAGGAACAAAAGCCAAGAGATGGATCCTTACAATAGTTTAACAATA 

TAAACCACATGGAACATTTCAAGAAAGAGTTAMCAATGATGTATGTTTAAAAGCTATCT 

CTACC TG GCATAGTATCTC AC ATTCTATAATCCTAGC ACTTTTGG 

Sequence 227 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCCCGCAGA 

GCTGGCCCGCCCAGGGGGAGTCGCAGAGTTTGGAAGATCTCTCTAACA CCTCTC GGCCAA 

CTTCAGAAGTGTATAAGATCAGCTTTATCTTTCCAAATGGAGACMGTATGTTTTTCTCC 

TCTGTTTAGATGGTGACTGTACCT 

Sequence 228 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAATAACATCATAAATAACATAAAT 
GAGAAGTTTTCAAGTATTTCMTGTGCCTTGAAAAGTTTCAAACAGTAATTCTCACMTT 
GGTTAGACAATCA I MINI I I CTTTTCTTTTCCTCTCATTTAGTCTCAGAGCCTAAAAA 
GAAATGCCTCCAGTCTNTGTTAGCCCCGATNTTNGGATGGAATTTACCAGGTTTNTCCTT 
XAAMNAAAAAANTTTTGGGGGGTTTTNAAAAANGGGAAAAA^ 

AAAAMNAATTCCCCCCCCNGGGGGNTGTTTTTAAANAAGGGGGGGNNNNCCCCCCCCCC 
1 11 111 11 1 I ANAAAAAAAANCCCCCCCCCCCCCNAANNNNNNNGNNNNCCCCCCNTTGG 
GGGGGGGGGGGGGGNNAANAAAAAAAAAAAAAAAAAN CCCCCCNTTT CTCGNGGGGGGGG 
GNGGGNGGGGGGGGGGGGNNNAAANAAAANN I rTTTTTTTTTTTTTTTTTl HNCNCCCC 
CCNCNANNNNNNNGGGGGGGGGGGGGGGGGGG 
Sequence 229 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAATAAC 
ATCATAAATAACATAAATGAGAAGTTTTCAAGW 

ACAGTAATtCTCACAATTGGTTAG ACAATC A I NIMH I CTTTTCTTTTCCTCTCATTT 

AGTCTCAGAGCCTAAAAAGAAATGCCTCCAGTCTCTGTTA GCCC CGATGTTTGGAATGAA 

ATTAACAAGATTCTACCTTAAAAGAGAAAACTTATGTGGGCTTTTCAAATTGTGAAAATT 

TGTTCCCTCTTATAAAATATAATCTTTCCCCTGCTGCTGGTTTATAAATATGGGCATATA 

GCCCANAGCCATTATAAAGAAAAAACAACCCACCACAATAAACTAAGGAGCCTCATCTGA 

TGGCTAAGGGTTCTCAGGAAAAAATGAGAAGAGCCCCTTGATCAGGAAANGGAAAAAANG 

GTTGGA 

Sequence 230 

TGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGTG7TTTGATCTCTG 

TTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAAG 

AAATGCTAACCTGTGGAAAGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCAT 

CCAAGGAACTTCATGCTTGATTTGACCACTGACAGCCTCCACCTTGAGCACTATTCTAAG 

GAGCAAATACCTTAGCTCCCTTTGAGCTGGTTTTCTCTGATGGCACTTTTTGAGCTCCTA 

AGCTGNCAGCCTTCCCTTT 

Sequence 231 

CCGCGGTGGCGGCCGCCCGGGCATGGTACTAGAAATTGCACTGTCCTAGAAATTCCTCCA 

TCCCATGAAGCCA7TTCCATCATGAAACCTTTGACACACTTGAACACAAGTTCCGCTCTC 

CTTAAGCACCAGGGTATAGTCATTTNCTCTGACATATAATGTACTTAAGCNCTGCAGAAT 

NGCTCNAGTAATATNCCNATANGTNCNTTTCCAATTTCAATTA 

Sequence 232 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAGCT 

GACCAAACGGGAGGTCGAATACATTGCCCTGTCAAGGGACACCACTGAAACTGATCTCAA 

ACAGCGACGAGAGATCCGTGCAGGCACAGCCTTTTACATTGATCAGTGTGCAGTTCGTGC 

AGCCACAGAAGGCAGAACTCTCATTTTGGAAGGTTTGGAAAAGGCAGAGAGGAATGTTTT 

GCCTGTTTTGAACAACTTGCTGGAAAACAGAGAGATGCAGCTTGAAGATGGACGCTTCCT 

GATGTCTGCTGAGCGTTACGACAAACTTCTCCCGAGATCATACCAAAAAAAGAGTTGGAT 

TCTTGGAAAATTGTCCCGAGGTTAGTGAAAATTTCCGAGTGATTGCCTTGGGGCTTGCCA 
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GTGCCCAAGGGT 
Sequence 233 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGATAGAGATAATA 

G AAGACA AAGAGATTAAMGGAMTAAAMTGCATGATTAAAAACTAAGAATAAAAAACC 

TATTTTTATGTTTCCTAMGGAMTTGTTTATTCTACAGACTCAGTAGGTAGACACAAAC 

ATAMGATTTCCCTAGMGACATAGAGTGGGATTTGATAACACTGTCTGTTATTTTNTGT 

ACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGTTCCCTT 

TANGGNGGGGTTA ATTTG CCCCCNTGGGGGGAAACATTGGGANTAAGCTTTTTTCCCCCG 

GGGGGAAAAATNG \ I I I I CCCCGTTCAAAAATTTTCCCCANNAAAATTTNCNGGGCCGGG 

GGGNAANAAAAAAAGGNAAAANANCCCCNGGGGGGCCCCCAAAANGGGGGGGGG 

Sequence 234 

CTACTATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTrTGTTTTGTTTT 

AMTTAAGAGATTCCCAMTGCCTTTTTCCCCCTCATCTTGAAATGAGATGAGTTTTTAT 

GTGTAAGCAATATTTATTTAACTATTCTATAAAATTATTGAGTGCCTACTGAGGCCTTTA 

AGCACCGCTAACATTCCTTTCCATCATTCTTTTAAATGACATAAAATAATTGTGCAATGT 

TCCTGATGATGTACCTGCCCG 

Sequence 235 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTAATGTTTGCAAC 

ATTTATTGAGATCTAAGTCTGTCTTGCCTTCATTTCTCTTTTTATCTCCCCCTTGCCCTC 

ATTCTTGAACAGCTGGAGGMTACATTTTATTCTGTCCATGAAGCATACACTATGAAATT 

CAAGTGCTTAAAAATACTTCTATGACTCTNTGCTATCCCACTGTATAGATCCACAGGGAG 

CAAACACTTAGAAATGATAGAGAACTGAAGGAGATCAATGGTTTAACAGTTATCCATGCC 

MGTCCCATTGNCAGAAATATTCTTA7TACTCAGGCAAACACTCTTTGAGCTTCCCTTNC 

TAAAGGTAACCATTCCAGTGAATAGATGTGCCCT 

Sequence 236 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTTTAATCATTCATATTG 

ACTTAATCA GGTT GACAAAATCTCAGGGTTGGTCTGGGATCATTCACAAATCAGAATTAC 

TCAGAGAAATTTTCACAGCAGTTGCAGCAGCACAAACTCTAACGCTGATTTAAATTAATA 

ACTGTTATTAATGAACAAACTTGAAAAAGGMTTTCATATAGAACATTACATTTCTCCCT 

GACACTACCGTATCATAAAAATTAAAMTGATATAAACATGTTTTAAAGTCATAGTGGTA 

GAACCAGCCACACTACAGTGCGGCATTGTTAGGAAGTTCGCTCTCAAGTTGACTCAGCTT 

AGTAAAGCAAATTGTTCTCTGAGA7TTCAGCTTTGGTTAGTCCTCTTTGGCTGA 

Sequence 237 

CCGCCCGGGCAGGTACNTTGTTGTCACAATTTAAAACTTCCTCTTCTACATACATGTTCC 
AGAGAGCATCTTCCAAACCGGATACATTTTTGACTGCTACTGTTAGATCTAAGAAGTCTT 
CCTGCCTCTCGCTAACGTTCTTACATTCTTTACAAACAATCTGGTTAACAATGGTTCCAT 
GG TACCTT GGGCCGCTTCTAGMCTAGTGGNATCCCCCGGCCNGNNGNATTTTNTTTTCA 
AN NM I 1 1N GNNTCCCCGGCCCCCTTTGGGGGGGGGCCCCCCCCNCCCCCN H H \ 1 II C 
CCNI I I 1 I GGGGGGGGNTTANNNNNNNCGCCNGGNGGGNTNNTNTNGGGGNTCANNNTAT 
TTTTTCTN N N GGGGGG AAAAAAAATATTTT 
Sequence 238 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTTGAGGACATTTT 

TGTCAGATTAACTATAACAGTGTAGTGTAGT1TTTAAAATTGCAGTTGAAAAGTTTAGCT 

GTCTTGGAAGTCAAATTTATCCAATTGTTCAGACTTCTGTTACTACTTAATATGAAGCCA 

CCATGCTGGCTTGGACAGAATTAATTTCATTCATGTTATGGAGMTTCTATATTACAAAT 

CTGGTCCCCTATAATATGAACAGTGAGCAGTCAGAAATATACAAAGGGTTAAATAGGGTA 

AAGACTTrGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTCTCTAATTAA 

AAT GACT GGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATCTAGAAATA 

ATCTTTTCAGGTTAAAAAACAAC 

Sequence 239 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAGCTG 
GAAAAACCATATAGAAAACATGATGACCATGTTGAGTTGTTGGTGCTGAGGCTGTGAACT 
CTTTGGCTTCCTTCTGATGGCCCTCTTGAGAACAATCTGTACCT 
Sequence 240 

NGC N AATTGG AGCTCCCCGCGG NGGCGGCCG AGG NGTTGGGTTTGATTTGCTTTTGTTTT 
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TAATGCAGCTCTTTTAATATAAAGATTCTTGATACAGTGAAATCTCTTATTTCAAGTGTA 
AGTTATTCTTCACCCACCCCTTCCCCTGCCATTGTATTTCCCATCTGTTNNANGGNGTTN 
NANCANTTNNCATTGNNTCGNATNCAGNAGGTNCCTGCCCG 
Sequence 241 

AGCTCCCCGCGGTGGCGGCCGAGGTACTTATTGAGTGCTTGTATGCACCAGGCTCTGAAC 

TAAACGCGCTCCATATTACCCCCATTTAAATTCTTACCATTATTCCTGTCTTACAGAAAG 

AAAAAAGGCATGGTGAGTCACACAGTTCACAAATGAGGGCTAGAACCAAACCCAGGCACT 

GTGACCTGCTACACAGGAGAGGCAGATATGAAGATTTCTTCAAGGGTGCTTATTACAGCA 

TCATTTATAATAGCAAATAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTA 

Sequence 242 

CCGCGGTGGCGGCCGAGGTACTCCTCCAGATCCTGGTGCTGCTTTTGTGGTGGTAGAATG 

TCCAGATGAAAGCTTCATTCAACCCATCTGTGAGAATGCCACCTTTCAGAGGTACCTGCC 

CG 

Sequence 243 

CCGCGGTGGCGGCCGAGGTACTTTTTAAAAAATCTATAAATTTAATGCACTGTCCAAGTG 

AAATGTCCTAGTTGTCATTGTGATTAAGGGGCCAACTTTCCAGGCAGCTAGCAGAGATAC 

TATTCTCTTCCTCTCCCAGCAAATTTGTATTCCTTCGCCCACGCATTCCTGCTATACTAG 

ATGGCAGCCAGTGATGGAACTATAAAGATGTCTGTGGTCATATGTTGAATGTGGCAGCTT 

GAAGATGTACCTGCCCG 

Sequence 244 

CCGCG GTGG CGGCCGAGGTACACGAAAAAGGATTGGAATTGTGTTTAGTTTTTGGTGTTT 
AGATTATTTTCTTTGTGTCTTTCTTCTTTTTC ITTTTTTTTTTI I CTTTTTTACAATAGG 
CCAGTAGAATTCTTATGAGMCAGGCTTTTT^ 

GTAGCTTCCAGAAGGTGCCAAACAGGTTTCAACATCATATTAATTAGAAATACCCATAAT 

AATTATAACAACAATAAAAAGGTATAATATGGTACCTGCCCG 

Sequence 245 

GCTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGCTTTTCT 

TT TGAAA GTATG AAMTG ATTTTTACAATGTAAAGAGAGTCTGAGAAATGCAATTGCTAT 

TCTTTTTATATACTTTTT^ 

CAATTTTATATACTGATTTTTTCAATCAATGTATATCATGAGTTTTTAAAA^ 

AATCTTCATAACTAATATTTTAATGGGGAGTGACTGCTTAATTGGCATGAGGTTTCCTTT 

TGAGATGAGGAAAATGTTCTGAAAATATGGTGATGGTTGCACACCATTGTGGAATGGTAC 

CTGCCCGGGCGGCCCGAGGTACTTGCCTGGGAGAAG 

Sequence 246 

GCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGGTCCCCAGCTTACTCCAGGGATGA 

TGTCAGAAAATGAGGTCCTAAACATGCAGCTTTCGGATGGAGGACAAGGAGATGTCCCTG 

TTGATGAAAACAAACTCCATGGTAAACCTGATAAAACCTTGCGCTTTTCCCTCTGCAGTG 

ATAATCTGGAAGGAATATCTGAAGGTCCTTCAAATCGCTCCAATTCAGTGTCCTCCCTAG 

ACCTAGAAGGAGAGTCTGTGTCAGAACTTGGAGCAGGACCTTCTGGCAGTAATGGAGTTG 

AAGCTCTACAGCTGTTAGAACATGAGCAAGCTACAACACAGGATAACCTTGATGATAAGC 

TAAGGGAAGTTTGAAATTCGTGACATGATGGGGATTAACAGATGATAGGGGACATATCAA 

AAACAGTGAGTGAGACTTGGAGTCCTTCGGCCCCTTTAGAACTAGGTGGATCCCCCCGGC 

TGCAAGGAATTCCATTCAAGCTTATCGATCCGTCCACCTCGAGGGGGGGGCCC 

Sequence 247 

ATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTNC I (TTTTTTTTTTTTTTTTTTl 

TTTTCTTATCAAAAATATTTTATTTNCAAAAAATTAATTATNCTA 

AAATNCATTGAATAATTGCTAAAATAAATNCAGGCAATTAATTCAACTTTTTATAAGAA 

GAGAGAATTNGACATTTGAATGTTATCAAAGCTTAACTTANANCATAAATAGTTAAAANG 

GCAAACTC AAGTTTTCATCTCATTTATTGGC ATGG N G NGTTTTGTTTTCCAC AAGCTNTC 

CAATCAGNGAGAGTCTTGAGTCTCANATGTACCTCGGCCGNTTTAGAACTAGGGGATCCC 

CCGG 

Sequence 248 

TTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGCACATCTCA 
GAGAGATGGAGGCAAGCTGGGTTCCTGATGATTTCCAGCTAATACAGGTCCTTCCTCCTG 
CATGCAGGCGCCCCGTAACGCTTTATAACAGACGCCTCTAGACTTCTGTGGGGGTAAAGT 
GAAGGACCCAAAGCGACACAAGTAGTGTCTGTTCACACTTCCACTTTCAAAGCTAACTAC 
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TAGCTGTTCAAATATACTCCATACAGCTTTCAGCAAATCAAAGTGTTTACCTCTCCCACA 
CCAAGGGAAGAAAAGATGCAGACTGCCTTTAAAGCACCTGTCAGCAAGGCGAGGGGTTTT 

AAAAGGATCAAGCCTTGAGAATCAAAGCAGCAGCAGAAGTGTCATTCTTNCAGTGCTCTC 
CTTTCCGGTTC 

Sequence 249 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCCATTGGTCTTACAGA 

CAAGCATCAGGAACCAGAGGGCCTGGTGGGGCTGGGAGGAAGCTCGGCAGTGACAGCTGA 

GGTGC TC ATGTCCTTCC ATCC C ACTGCCC AGTGGATAATG AGCTCATTAGTCAG ACGAGG 

ACCAGCCCAGAATAGCCAGGAGTAAGCATGTCACATTACAGAGCTGTAGCCAGCTTCTGG 

GTGGAAATAGCACTATCTGGTACCTGCCCG 

Sequence 250 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCTTTTTGCACACTGTCA 

GCAAAACACGTCCCTTTGAGTATCTCCGGCTCACCAGCCTTGGAGTTATTGGGGCCCTGG 

TGAAAACAGATGAACAAGAAGTAATCAACTTTTTATTAACAACAGAAATTATCCCTTTAT 

GTTTGCGAATTATGGAATCTGGAAGTGAACTTTCTAAAACAGTTGCCACATTCATCCTCC 

AGAAGATCTTGTTAGATGACACTGGTTTGGCTTATATATGTCAAGACGTATGGAGCGTTT 

CTCCCATGTTGCCATGATCTTGGGTAAGATGGTCCTGCAGCTATCCAAAGAGCCT 

Sequence 251 

TACCCGGGC GAAG TNANTNNGGGGAAAAh4TNCCCCCCCGGGCCGNCCGGGCANTTTTACN 

GGCACA NGGT TTTAAGCCNCNGAGGGGTTTTTGAAANCCCCCCANGANACCNNANGGNGC 

AGGGGGTTTTNNTTTAAAANAAGNNGGAACCGCCGGANGANGAGGGANGGNAAAAACACC 

CNGGGGAAACCNAANAANCCCNAAAAGGGGNCAAGGNGGNAAAACCCCCAAGCNAACCCN 

GGGGGGNCCANGGAAANNCNGGGACNAAGACCCAAACCAAAANGGGNNGGANGGCCGCCN 

GGGGGACAANNNCGGGCCNACNACCAAGGGGAACAANCAANNGGCCGGNAANNGGCCCNC 

AAGGGCCNGGNAACAAAAANNNNNCNAACCCCGGGGAANNAAAGNAAAGGGGGCAAAAAA 

GGCAAANAAAACCNCGCCCAAAANGGGGAAAGAAAAAANNCAAAGGGNGGCCCCGGGGGG 

GGGGNNNCCNNGGAAAAAGNAGGGGGNAAAAAAAAAAAAAAANAAGGANCNNNAAACCCN 

NNCNAAGGNAAGGGGGAANNNNNAAAAGGGGNANNNNNNNGGGAAAAAAAAAAGGGGGGA 

NAACCNGGGAAAAAAGC 

Sequence 252 

CCGCGGTGGCGGCC GC C CGGGC AGGTACAAAATAACATCATAAATAACATAAATGAG AAG 

TTTTCAAGTATTTCAATGTGCCTTGAAAAGTTTCAAACAGTMTTCTCACMTTG 

acaatca i nrrm I CTTTTCTTTTCCTCTCATTTAGTCTCAGAGCCTAAAAAGAAATG 

CCTCCAGTCTCTGTTAGCCCCGATGTTTGGAATGAAATTAACAAGATTCTACCTTAAAAG 

AGAAAACTTATGTGGGCTTTTCAAATTGTGAAAATTTGTTCCCTCTTATAAAATATAATC 

TTTCCCCTGCTGCTGGTTTATAAATATGGGCATATAGCCCAGAGCCATTATAAAGAAAAA 

ACAACCAACCACAATAAACTAAGGGAGCCTCATCTGATGGCTTMGGTTCTCAGGAAAAA 

ATGAGAAGAGCCCCTTGATCAGGGAAAGGG 

Sequence 253 

CCGCGGTGGCGGCCGAGGTTGGAAGAGAAGTTACCCCGATGACTTGGTTTGGAAGGGGTT 

AAGGCACCAGTCATCCTCTTCTAAAGTGATTTATGATGATGTGTGGAGTTrAAAAACTTT 

ACCCCACCCCAAAGAACAGCCCTCTCACTCCTCACTGAGTCCACTCTGAACGTGCTAAAA 

TGGGAAGGAGGCGGTGTTTTGCTGATCTGTTAAATTCTTAGTGMGTTTCCTTGATTTCC 

AGTGGCTGCTGTTGTTTGAGTTTGGTTTGGAGCAAAACTGAGGTMGTCCTAACATTTCT 

GGGACTGAATCCAGGCAAGAGAAAGAAGAAAAAGANGAAGAAAAAGAGGAGGAAAAAGGT 

AGGGAGAAATAAAGGGAGGAGAGAAGCACAGTGAAAGAAAAAAAAAGTCC 

Sequence 254 

GGCCGGTACCTTTAACTCAATTTAATATAACAAGAAATCGTAAAATACTTATAACCTATC 

TTAGAGAAATGAGTGCTGGTTTTGAGAGTTGTTTTTTAACTGAAAGATTATTTCTAGA^ 

GGTAGGGGGCTTTGTGCTGGTTTCTGCTTCCATATATTTCCCAGTCATTTTMTTAGAGA 

AGATACTCTATGGTAGAACTAAGGCCTTTCCTTTCTTGGCCAMGTCTTTACCCTATTTA 

ACCCTTTGTATATTTCTGACTGCTCACTGTTCATATTATAGGGGACCAGATTTGTAATAT 

AAGAATTCTCCATAACATGAATGAAATTAATTCTGTCCAAGCCAGCATGGTGGCTTCTAT 

TAAGGTAGTAACAGGAAGGTCTGMCMTTGGATAAATTTGGACTTTCAAGACAGCTNAA 

CTTTTTCACTGGCAATTTT 

Sequence 255 
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ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGGGGCCGAGGTACATTGTGGTAGGTC 

CAGGAAATATGACATTTTCCCCCTTGATGTGTTATTGTTGTTGTTGGGTGGGGTGGGCAT 

TTTGTTTATTTGTTTGGTGGCAATCAGTGGTAGTAGGGAGTGGGAGGGCTTATATTGGTT 

TTrCCAGCTATTMGGGGACATATTGTGTCGTTGTGCTTTTCACGTTATAAMTGTTTAT 

ATTTACC AGTACCTGC CCG 

Sequence 256 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCATAAGCTTGGAT 

TTTAAACACTGATAGTATCTCATGAGTAATGTGTGTTTTGGGAGAGGGAGGGATGCTGAT 

TGATATTTCACATTGTATGAAATACCATGTTTGAAACTCATAGCAATAATGCTATGCTGT 

TGTGATCCCTCTCAAGTTCTGCATTTAAAATATA I I I I I I CTTTATAGGAATTGATGTAT 

ACCAT GMGT CATTGTCAGTTGTAGTAGCTCTGATGTTGAATGAGATATCATGTTTTAGC 

ATTCCATTTTACTGACTAGGGTAGAAGAACACTTTTCTTGGCTACATTTGGAGGATACCC 

AGGGAGTCTTGGGGTGTTCCTTATCTGGGGAAGCAAAACATTTCACTAGTCTCTTTTTTT 

TCATC 

Sequence 257 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTTTTTAAAGTTCCTCAAAAAATTCTGCTTC 

TAGGCAAATGTMTAGATATTGTGCGTTGCTCATGTTTTTCCTATCAGAACCATGATTTT 

AAAAAAACTATCCTTTCTGTCCCCATATATCCCTCATGTGGCCTCCACTTCTACTCTCAG 

CTCCAGTGGACCTGACTGCCCTAAAGGTTATTCTAGACCCTTTGCCAGTGACTGGCTCAG 

GGATGGACAGGTCTAAGTTACTTTAGAGCAATGAGACCTAAGGATATGGAAAGTGAATTC 

TGGGAAAGCTTTCTCATACTAGGACAGGGCCCCTAGAATCCAGCCTGACTCTTCCGCTGC 

ATGTAAATGAAGATGCACAAGGCTNCAACTGCTACTGGCAATCATCCCATGACCACGAGG 

GTAGCCAGCCTTAGGATGATGCCCCATGTTTGTGGGTTGAAAANGGGGGGGG 

Sequence 258 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAGCTGGAAAAACCATATAGAAAACATGA 
TGACCATGTTGAGTTGTTGGTGCTGAGGCTGTGAACTCTTTGGCTTCCTTCTGATGGCCC 
TCTTGAGAACAATCTGTACCT 
Sequence 259 

NNGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACAAAAGCCTGTGAACTGGCTTA 
AACCAGTTTAC ATG CTGGATTC CGACCCAGATAATAATGGATTCATAAATGAGGATTTTA 
TTGTTTGGATGCGTACC 
Sequence 260 

TAGGGCGAATTGGNAGCTCCCCGCGGTGGCGGCCGAGGTGNCTGAAAGTCGATGAAGGAC 

GCGATTACCTGCGATAAGCTTCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGA 

ATGGGGTAACCTAACTGAGCAAACCTNAGNTGCATTTTGATGAATCCATAGTCAAATTAN 

CGAGACACCGTTGCGAATTGAAACATCTTAGTAGCATCAGGAAAAAAAAAAAAACCNNAA 

AANNANAAAGGCTTGACCT 

Sequence 261 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTATAGGCCATATTCCTT 
TACGTGGGACATTAATGAAAAGCTGTCTTTTCCAAACTGAATGCGCCTATTAATAAAAAA 
ATACACCTGACCATCTAATGCTGAAGTCATCCGCCAAGGTGGTACCTGCCCG 
Sequence 262 

CCGCGGTGGCGGCCGAGGTACAGATTATGGAAAATATGAAGGACTAACAAAGAATTACAT 

GGATTATTTATCCCGACTATATGAAAGAGAAATCAAAGATTTCTTTGAAGTTGCAAAGAT 

CAAGATGACTGGCACAACTAAAGAAAGCAAGAAGTTTGGTGTTCATGGAAGTTCGGGGAA 

ATTAACTGGATCTACTTCTAGTCTAAATAAGCTCAGTGTTCAGAGTTCAGGGAATCGCAG 

ATCTCAGTCATCTTCCCTGTTGGATATGGGAAACATGTCTGCCTCTGATCTCGATGTTGC 

TGACAGGACCAAATTTGATAAGATCTTTGAACAGGTACCTGCCCGG 

Sequence 263 

TACTATAGGGCNATTGGAACCTCCCCGCGGGGGCGGCCGCCCGGNCAGGGI rTTTTTTTl 

MINIMI I ATACAAAAAGCAACAAGAGTTTAATTCTCTTTTTACATGGCCACAGGCTC 

TCTTCAGTCAGGGGAACTTCAGCTGGTGCCTCTCTCTTGCAGCTATGAGGCGACAGTGTG 

GTGACATGCCTCATACAGACTGTCCCAGTAAGCCAGGACAAGTCACCATTAAAATCTTGC 

ATGAACAGCCCTGGGCACGTGGGAATGTTAAGAAAGAGCCACCGCCTCCTTAGTCAGCTT 

AACCACAGCTCCAAACGCAGTTTGTCCCAGCTGGCAAACGCCTCAAACACCAATCATGCC 

GTCGTGCTCCTATTCTGGGGTTTTATAAA 
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Sequence 264 

TAGGGCGATTGGAGCNTCCCCCGCGGGGGCGGCCGCCCGGNCNGGTNCAGATTTNTATTT 

ATCCATAGATAGGGTAT CTATAC ATACACATCTCAAGTGCATCTATTCCCACTCTCATTA 

ATCCATCATGTTCCTAAATTrTTGTMTCTTACTGTAAAAAAAAGTGCACTGAACTTCAA 

AACAAAACAAAAAACAACAACAACAAAAAACAAGTCCAAACTGATATATCCTATATTCTG 

TTAAMTTCAAAAGTGAACGAAAGCATTTAACTGGCCAGTTTTGATTGCAAATGCTGTAA 

AGATATAGAATGAAGTCCTGTGAGGCCTTCCTATCTCCAAGTCTATGTATTTTCTGGAGA 

CCAAACCAGATACCAGATMTCACAAAGAAAGCTTTTTTAATAAGGCTTAA 

Sequence 265 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAAGGTACACGGTGGACC 
TGGAGTCAGGGCTACACTACCTCCTGCGGGTGGAGCTGGCAGCCCACAAGTCCCTGGCCG 
GAGCAGAGCTGAAGACGCTCAAGGACTTTGTGACTGTCTTGGCCAAGCTGTTCCCTGGA 
Sequence 266 

GCGGTGGCGGCCGAGGTACTTTGGGGGAGAAAAACATGATTCCATTTACGGGGAAAAAAG 
CCATTGACACTCAGTAAGCAACACTGCCATCTAGTGGAATGGTGACACACCACCAAGAAT 
TTCA AGACC CG AT AGGA AATGTGAGTGGATTTG GTTTCMTTTTCACCACAAAACAGCAC 
TTTTAATAAGCTGGTTTTCAGAGMCTTCAGATTTTTTTGAG 
A AMTG CATAAATGTATGTGTTTTCTCTG 

ATTTTTCTTATTAGAGCATAAAGTTCTCTACTTGAAAGGTAGAAAACACTTTGGGGCAAC 
CTGCCTCCTTGAACATCCATCACTTCTGCTTGACAGTCTAACAGCACCCCTGATTTTAAC 
AGACAGTTTATAAAGGAAGCAAAACTC 
Sequence 267 

CTNC TTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGAG 

GACAl I I I IGTCAGATTAACTATAACAGTGTAGTGTAGTTTTTAAAATTGCAGTTGAAAA 

GTTTAGCTGTCTTGGAAGTCAAATTTATCCAATTGTTCAGACTTCTGTTACTACTTAATA 

TGAAGCCACCATGCTGGCTTGGACAGAATTAATTTCATTCATGTTATGGAGAATTCTATA 

TTACAAATCTGGTCCCCTATAATATGAACAGTGAGCAGTCAGAAATATACAAAGGGTTAA 

ATAGGGTAAAGACTTTGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTCT 

CTAATTAAAATGACTGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATCT 

AGAAATAATCTTTCAGTTAAAAAACAAC 

Sequence 268 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCATGGTCAAGCTCTAAGA 

CACAACGAGCTCTGTTATTCAAGAAATCAATTCCAGTGGATTTCCAGTTCCAATTCCTGA 

GAACTAGGGTAAGGGGGAGAGCTAATGGTTGCTTCCTAAGGCCTTCTGGGTTTATTAGTT 

CCATTTCAGGACATGACAAGAAAATGTACCTGCCCG 

Sequence 269 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTA 

AGGTAGTTAACACCACACAGGCTGATAACTCAGGGCAAGAGAGAGGCCTGGAGAGTTGAC 

AAGAGTGTGAAGATACTGGAGAGGGGTTCAAAATAGGATCMTTCTTCTAATACTATTCT 

TATTTCTT GCTAA TAATTTGTCCTTGTAATTATAGTGAAGTAATGGTCTGGATATAAATG 

AGATATGGTTTTGTGAGGGAAGAAACATAGAGGATTATTCTTAACAACTGTGGCAGCCTT 

GCCAATATGGAACC 

Sequence 270 

CCGGGGTGGCGGCCGCCCGGGCAGGTGCGNTCTCAGCTCATTGCAAGNTNTGCCTTTNTA 

AACATGCTATTAAATGTGGCANTGAAGGANTGTTTTTANTGTNATCTTGCTATTAAGTGG 

TAATGAATGTTCCCAGGATGAGGATGTTACCCAAAGCAAAAATCAAGAGTAGCCAAAGAA 

TCAACATGAAATATATTAACTACTTCCTCTGACCATACTAAAGAATTCAGAATACACAGT 

GACCAATGTGCCTNAATATCTTATTGTTCAACTTGACATTTTCTAGGACTGTACCT 

Sequence 271 

TTGGAGCTCCCCGCGGTGGCGGCCGNGGTACATCCACAGGAGGAATCGGACGAGAGGATG 

TGACATTCGGTCCAGGAGA GAAA GAGCAGTTTCTGTTAAAGATGTAACAAATGGATTTCC 

AAAGTCTACATGACATTCACTTTTCAAACTTCCCACCAGTTGAATTTCl I I l k I ICCTTA 

AGAAACAGGTGATGTCTTGGAAAACAGCTCCTTATGTCTCTCTGTGCATCTCCATTTTCC 

TAGTCTCTGGAGTCTCAAAAAGAGTGGCAAAGCACTTTACAGTAGTAACTGAGGAATCAG 

AGTCTCTGCTTCAGCGATATCTAGTTTGTACCTGCCCG 

Sequence 272 
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CCGCGGTGGCGGCCGAGGTACTTGTCAAATGAAAGAACAGGGATTGCCAGACCTTCAAGG 

CAATGGGAAAGGAGCAAATCTGCAAAGGTAGGATCTCTTTGGAAGGCAGGTATTGGCCAC 

CAAGTCAAACTCCTTGAGTCTTATATTCTGATTGGGATGATCTCACATGGATGTTCATCT 

CTTATATGTGMTGCTCATTTGTGAAAAATAGTAAGAGCCAGCTAGGATATTTGGATTCA 

GTCAGGCACCATCAGAATAGTGCAGTGAAAGGCCAAACTGGCCACAAGACAGAGGAATGT 

TTTCAGTTTTCTGGTTTTCCTCTGGTCCATGATAAAGCTCGGAGTAACTCTTCTATCAAG 

ATGGGGCTATACCTTCNCATGACAGAGGCTGGCAATTGAGCTACCCAGCAGAACGTGTGC 

TCTCAAAAGGGGAAGTCAAGGGAAC 

Sequence 273 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGCAAGGAGAACTGTGCTATATTGTTGGTGCT 
TGCTG TTTTGCTGGACATCTTCMTTATTGTTTTTACC TGAAATCAGTAAGACTTTGACA 
G G ATATC ACCTG AATTATTAATGAATGCCC AGGAAG TAATTTTCTTCTC ATTCTTCTAAA 
ACTACTGCCTTTCAAAGTGCACACACACGCGTCCACATACACTGCATTCGTTGCTCCAGT 
ATAMTTACATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATG 
AAGACACCAGAGTGTATGCATACAAATCTCACTGTATTAAAGATGCAGGTTTTCTAATTG 
TACCTCGGCCGCTCTAGAACTAGGTG 
Sequence 274 . 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I I M 11 M IATCAAAAAAT 

ATTTTATTACCAAAAAATTAATTATCCTATACATCCTATACTGGGAAATCCATTGGAATA 

ATTGGCTAAMTAAATACAGGGCAATTAAATTCAACCTTTTTATAAGNAATGGAGAGNAA 

TTTGACATTTGNAATGTTATCAMGCTTTMCTTAGAACATAAATAGTTAAAAAGGGCAA 

ACTCAAGTTTTCATCTCATTTATTGGCATGGTC 

TCAGTGAGGAGTCTTGGAGTTTCAAGAATGTACCCTGCCCGGGGCCGGCCGCTCTAGAAC 

CTAGGTGGGATCCCCCCGGGGCTGCAGGGAAT 

Sequence 275 

GCGGGCCCGAGGGTACACACCGAAAAGGTGGTGGTATTTTTACCTTCCCAAAGGTGGTGG 

ATAGGGTmATTGGGCCCTTmCNTAAAAAAGGAAAATANAAAMCCAAAAAAGGm 

AAAAAATAATTGGAAMGGTTTTAAAAATACTTTC NTTTTCACC AGG AAAA ATAAA AACC 

TTATTTCCAATTTTTTAAAAAANGGGTTAGGAGGAGGAGGANGAAM 

CCTmCTAATCAATTGGAAACCATCCAGGTTCAAAACCCA ANGCCTTTC NACAATrm 

TGGGGGGCCAGGGGNA 1 1 1 I I 1 I I 1 AAAGGTTANTTAAAAA I l i 11 1 i 1 1 i AAAAGGAAA 

ACCTTTAATTTTTTTAATTTTTTAAAAAAAAAANG 

AN N M 1GGGCCAAGGAAAAAAANGGTAAAAAAAATAAGGAAGGAAATT AATGG CCCAA 

AAAAT7T7TGGTCCTTTTTAAAAGGAATTN GGGGCCATTAACC AAAAAAAATTTTAAAAA 

AAAAACCAACCAAGGTTACCCCTTGGCCCCCGGGGGCCGGGCCCCGCT TCTTA AGGAAAC 

CTAAGGGTGGGGGATTCCCCCCCCGGGGGGCCTTGGCCAAGGGGAAAATTTTCCGGANTA 

TTCCAAAGGGCCTTTAATCCGGATTACCCCGGTCCGGACCCCTTCGGAGGGGGGGGGGGG 

CCCCCGGGTTACCCCCAAGCCl (1(11 1 GGTTTCCC 

Sequence 276 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTATAT 

ATACATGTGCATATATATGTAMTTTATATTTATATATAATATCTTTATATAGATAGATA 

TCTGCAGACAGGTTATTGATTATAGAGACCCAAGAAAGCAACTCAATAATTGTTCAAAGT 

TTTCCTCACTGACTGCTGGTGTGTAGTTTAAGAAGCCCCCATTTGTTTGCACTCAGAATG 

CCTTATCTTCTTTTAATGGAACACTTGATGTGGAATTTTAAGTCTGAGAAGTGAGGTGCC 

TTCTGAAGAAGAGATTTTAGAGACTCCCTTCCTCTATAAGTTGGAAAT GACCA GAAGTCT 

TAAGTAGACCACAGTTAGCTGACTTTGACATTGTAGGACGTAATCAGCTTTTAACCAAAT 

TATAGAACTGGTAAAGGGTAAGATCAAATTTGCCAAAAAA 

Sequence 277 

CCGCGGTGGCGGCCGAGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACT 

GCACAGTCCAGTTTTTTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGT 

TTTGAGAAACAAATCAAACGAAGTAAACAAGAAACATAAAAACCAAATAGCAAA TGAATA 

AAAGCCTGTTCTTGTAACTTATTCAACTTTTGCCAAATTCCTACCAATCACTTGCTTT^ 

AAAAGAAATGTATAATAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCGGGCGGC 

CGCTCTAGAACTAGGTG 

Sequence 278 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGI 11 I I I I 1 I M I I AAGTTT AAAATTTT A 
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ATATTTATAAATTTAAAAAATTTTACACGTGCTGAGTGGTAGCAGTGCTAACATTT^GT 

GACCATTAAACCACAAACTCACTGCATAGAGCTTCTTTCTTTGTACCT 

Sequence 279 

AGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATACCGGAT 

ATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAAAATGG 

GGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTTTAGCC 

GGGCCGAGAGGCTGTACCTGCCCG 

Sequence 280 

AGG TACAG TGCCAGACCATGACTGTCAATCGTCAGATGAAGCGCTACAACGTTCCGTTTC 

TAACTTTTATTAACAAATTGGACCGAATGGGCTCCAACCCAGCCAGGGCCCTGCAGCAAA 

T GAGG TCTAAACTAMTCATAATGCAGCGTTTATGCAGATACCCATGGGTTTGGAGGGTA 

ATTTTAAAGGTATTATAGATCTTATTGAGGAACGAGCCATCTATTTTGATGGAGACTTTG 

GTCAGATTGTTCiGATATGGTGAGATTCCAGCTGAATTAAGGGCGGCGGCCACTGACCACC 

GGCAGGAGCTAATTGAATGTGTTGCCAATTCAGATGAACAGCTTGGTGAG 

Sequence 281 

CCGGGCAGGTACTTGTTTATAGACTCAAAACATGATGAAAATTCATTTCATTGTCTTCAC 

TGMTAGACTTAACCTTGAC AGTA AGCTTAAAGCATTAGTGTCATTTAATTACATGTCTG 

AGGAATTTATGCTAACTAGATTTTGGGCTTTTGCAGCGAATTTTAGTCGGGGCCTCATTC 

CTGAGCCCAAGTGACCCTTTCACCTCAGCTTCCCAAGTAGCTGGGATTACAGGTGCACAC 

CAACTGTGCTTTGCAGTTTTGT7TGTGTGTGTGTC 

TTACAAGCCCACAACATCATAGCTTTGCAGTT 

Sequence 282 

AGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACTGCACAGTCCAGTTTT 

TTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGTTTTGAGAAACAAATC 

AAACGAAGTAA ACAAG AAACATAAAAACCAAATAGCAAATGAATAAAAGCCTGTTCTTGT 

AACTTATTCAACTTTTGCCAAATTCCTACCAATCACTTGCTTTTTAAAAGAMTGTA 

CAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCG 

Sequence 283 

ATTGGAGCTCCCCGTGGTGGCGGCCGCCCGGNCAGGTACGGATACAATTCCGCTGAGTTA 
AG ATTCC AAATTCTAACCTCTCCATCACAC G CCCCAGAAAGGACAGTAGCCAGCTTCTCT 
GGATGCTTTGCCAAGCAATTGACTCCATCACGGNGACCATCCAGCGAAGCAAGGAANGGT 
TTTGCAAATACTCG NTCC AGTTTGGTAGCATTTAAAGCTCTTATATATTCTCGN GGGACC 
TCAAAAGGATGTAAAACCT 
Sequence 284 

AGCTCNCCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATA 

CTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAAGAAATGCTAAC 

CTGTGGAAAGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCATCCAAGGAACT 

TCATGCTTGATTTGACCACTGACAGCCTCCACGTTGAGCACTATTCTAAGGAGCAAATAC 

CTTAGCT CCCT TGAGCTGGTTTTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCT 

TCCCTTCTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCAGCTGGG 

Sequence 285 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACATGGGTGTT 

TTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAG 

GCTAAGAAAGAAATGCTAACCTGTGGAAAGTTGGTTTTGTAAAATTCCATGGATCTTGCT 

GGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTGACAGCCTCCACCTTGAGCA 

CTATTCTAAGGAGCAAATACCTTAGCTCCCTTGAGCTGGTTTTCTCTGATGGCACTTTTG 

AGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCA 

GCTGGGTTGG TATG 

Sequence 286 

CCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACAT 

TTGAACAG GGCA AAATGAACTAACTGCCATGTAGGCTAAGAAAGAAATGCTAACCTGTGG 

AMGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGANAAGCATCCAAGGAACTTCATGC 

TTGATTTGACCAC TGACA GCCTNCACCTTGAGCACTATTCTAAGGAGCAAATACCTTAAC 

TCCCTTGAGCTGGTTTTNTNTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTT 

CTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCAGCTGGG 

Sequence 287 
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AGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAA 

TGAACTMCTGCCATGTAGGCTMGAAAGAMTGCTMCCTGTGGAAAGTTGGTTTTGTA 

AAATTCCATGGATCTTGCTGGAGMGCATCCMGGMCTTCATGCTTGATTTGACCACTG 

ACAGCCTCCACCTTGAGCACTATTCTAAGGAGCAAATACCTTAGCTCCCTTGAGCTGGTT 

TTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCT 

CAGGGCATGCTTATTAGCAGCTG 

Sequence 288 

AGGTACAGTAAACATTATTCAGTGAGAGATCAGAAGGAAACAAATGGCATCTTTTCAGAA 

CGCTGTCAACTGTTCCCACAACCCAAGTCTGTTTTTCCAAGTTGCACAAGTGCTTGGAAT 

MCTCTGAAACAATTCTCTTCAGAGTTTAAAGGCTTCAGAGTATAGGTGATGCTTCCTAA 

AACAAGMGCCTGTATTAACATCACAAATTGGAACTCATTCCAAAGCCTCTTCTTAGAGA 

AAAAAAAATAAAGAAATAAAAAATAAAAAATGTAAAGMGGTC^ 

AG CTAATTATCGTAAAAAATATC AAACAC 

Sequence 289 

AGGTACAGCATTGATGAACACTTTCAGCCGAAGCAGATTGTCAAGTCTCTTATCCCTTCG 

TGGAACAAACTGGTTTrCTTTGAAGTATCTCCTGTGTCCTTTCACCAGGTGTCTGAAGTG 

CTGTCTGAAGAAAAGTCACGTTTGTCTATAAGTGGCTGGTTTCATGGTCCATCATTGACT 

CGGCCTCCCAACTACTTTGAACCCCCCATACCTNGGAGCCCTCACATCCCACAAGATCAT 

GAGATTTTGTATGATTGGATCAACCCTACTTATCTGGACATGGATTACCAAGTTCAAATT 

CAAGAAGAGTTTGAAGAAAGTTCTGA 

Sequence 290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGTGGTAGGTCCAGGAAATATGACATTTTC 
CCCCTTGATGTGTTATTGTTGTTGTTGGGTGGGGTGGGCATTTTGTTTATTTGTTT 
GCAATCAGTGGTAGTAGGGAGTGGGAGGGCTTATATTGGTTTTTCCAGCTATTAAGGGGA 
CATATTGTGTCGTTGTGCTTTTCACGTTATAAAATGTTTATATTTACCAGTACCT 
Sequence 291 

GAATACGACTCACTATANGCGNNAATCTGANAGCTCCACCGNGGTGGGCGAGCCCGCCCG 

GGGCAGGGG I I I H i III I I I I I I I I I G G TAAATTTTC AATAAG TTTAATTTTTAAATAA 

ACCCMGGGTTTTACAATTAAACCAGGTCAACGGTGGAATGGAAACC FTTTTTn CTTTT 

AAATGGTCAAGCNTNAAAACCTTCAAAAAACCACCMGTTTTTGGTTTCACGGGTTTCAA 

AAACCCAAAAACCAGGCTTCNTTTCAACGGTTTTCCCAAGAAAAGCCTGGCCCTTCAACA 

GGCTTAGGCCACCAAGAATCAACAAGGGAAGAATTTAACTGGTCNTTGGTCCCAT TAACC 

CCAACCCAAGAACCACCAGGGAAACCTTGGAAACCACCCCCAACACCACCCCAAGTTTTT 

TTCAAAAAAGGGAAGGGGGGAAACCTTTACCAAAATGGAAAATGGGCTTGGGCCTTGGCC 

CCAAGGGGGCCAAGGCCCAATTGGAAGTTGGGTAATTCNTGGGGGGGACCTTCAAAAGGC 

CTTGGGGGAAGG I IN I II ICCCAAGGGGGGGGGAGGNAAAAGGGCCCTTGGGGGGAAAG 

GCCTTTTGGTGGGGCCAAANGGGGGAAAAGNTTTGGGGGNAAA 

Sequence 292 

CCCGCGGTGGCGGCCGAGGNACNGANTNATTTTCNGNATTACTGAACNCCTTTGAACTAT 

TTTGAACTNTGMTCTCTTCTCTGAATAACTGCCGAAACTCAAAGATAATAATNAGTTGA 

TGAAGGTTATCAATTAATAAAATAACNCAGACCANTCTTACCAAACTCTAAATACATTNT 

AAAAAATTTTAACTGGCAATGATAAAAAGAATGTTGAGTACCTGCCCG 

Sequence 293 

CCGGGCAGGTACAATAAAGTTCACCCACCCTGCACTTTGGCCCTTAGATCAATCCTAAGT 

AGCCATTGCCAGTAGGCCAAGTTTAATCAGAGGACAGTGCCTACCAGTAAATACTGAATA 

GTTACAATAGTTATGTCCATCCAACCAGTAGCAGATGAACAGCTAATACATCATGATGCT 

ATGCTCTCCTAACAGGGTCCCCTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCAT 

ATCCCTTACATCTCTACCAGGTATTAACACCTAACTACTCTCTAGCCAGAGGCAATTCCC 

TTTATTTCCTTACTCTCGTCGTCTTCTCT 

Sequence 294 

CCGGGCAGGTACATAACATTTCAAATATAAGTGGAAGGATCATCAGTAGTGTTATCAAAA 

TGCATAATACAGAAACTTTTTAAGAAAGGATAAAAAATT ACACTC AGGACCCATAACTCT 

TCCTCATTATAAGCATATGTAGTGATTCATTCATGCAGGTTTTTATATGTAGATAGGATT 

TTTTTTTCCTTTTCAAGAATTCCATTGTAGCCATGAGATGAAAMTGTAT^ 

GTATAGCTTrCTTCTAT rrrGC TTTTAGTGTTAGGTTTGCTAAAAGCTTA 

CCAACTGACATAATGTGTTTTCAATAAGGAGGACGCT 
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Sequence 295 

AGGTACTGATTTAACACTCTGTATTACTGAACTTCTTTGAACTATTTTGAACTTTGAATC 

TCTTCTCTGAATAACTGCCGAAACTCAAAGATAATAATTAGTTGATGAAGGTTATCAATT 

AATAAAATAACACAGACCAGTCTTACCAAACTCTAAATACATTTTAAAAAATAGGAACTG 

GCAATGATAAAAAGAATGTTGAGTACCTGCCCG 

Sequence 296 

CCGCGGTGGCGGCCGAGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACT 

GCACAGTCCAGTTTTTTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGT 

TTTGAGAAACAAATCAAACGAAGTAAACAAGAAACATAAAAACCAAATAGCAAATGAATA 

AAAGCCTGTTCTTGTAACTTATTCAACTTTTGCCAAATTCCTACCMTCACTTGCTTT^ 

AAAAGAAATGTATAACAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCG 

Sequence 297 

CNCCCCGCGGTGGCGGCCGAGGNACAGNACTGTTTTTTNNAAAGACTGCACCCCAGAACC 

ACACT GCAC AGTCCAGTTTTTTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAG 

ACGAGTTTTGAGAAACAAATCAAACCGAAGTGNACNNGAANCATAAAAACCAANTAGCAA 

ATGAATAAAAGCCTGTTNTTGNAACTTATTCAACTTTTGCCAAANTCCTACCAATCACTT 

GCl I 1 1 1 AAA 

Sequence 298 

CCACCGCGGTGGCGGCCGAGGTACTATGTCAGNTTTGTATGTAACTAGCTGTCAGGTCTT 

TCCCCGTTGCCTCAGCCTTTCTACACAGACTGGCCTTCAACTTCCCCTGAGTCCAGAAGT 

AGACTCTTTCAGCAACTCTATTCAGGAATCTGCAGCAGGAAAACTGCTTCCTCTATTAAC 

ATCTATGACTGAAGCACAGATGTGTCTAATAGAAATCACCCTTCACCCAAAAGCTGGGTG 

CAGAAAGGGAAGCCCTTAGCTGACTATAGGAGGTGCCTCTTGTGGCTCCACGTGCTTCTT 

ACACACCACCCCCCAGCTTGAGCGATGCCTCAGCCAGCTCACCCTCATCCACACAATCGC 

TAGAAA 

Sequence 299 

CCGGGCAGGTACCATATCTCCTGGCTTCCTTCTACATGGGTCACTTAGTTAAGAGGGAGG 

CCAAGGGAGTTCCGATTTCAGGCAGTGTGTGGCAGGGTTACTGTCCTAGCAACCTGGCTA 

CTCCTCACTGTGAACGTTTCTCATAGGTGTCATATGGCAGGATGAAAAACATATTTGCCT 

CCCAGTGAAAGATGGCACAGGCTTTTGCCCAGCCAGGTTGGCAAGAGAACAGAACTCTTA 

ACCCCTTGCTCGACAGGTTTGAGTTCAAGGGGTTGGATGCTCCAAGCAGAGGGCCAAACC 

CTGATTTATGAAGCATGCTAGGTCAACAGCCAGTCAGACCA 

Sequence 300 

CCGGGCAGGTACCTAAGGGGTTACTTGTTTAATGGGATGGCATTGACTTTTTGAAAATCA 

AGTGGACTGAGTCATTGATAAAACATTTCTAAGAGTGGGGCTAGAGAACATACTTTACAT 

CTGACATCCTTTGGCCTAACAACATCTATTATTATAGTGCTCAGCAGTGTGGGCATTGAA 

GAGGCGCAGAATGCTTTGAAAGAAACTAATCAGAATCTTGGAACATCATGATCATGCCAT 

TCTTAAGTAAATCAACTATTTTCAACACTGAAGAAAMTGAAACATTATTTAGAAAACAA 

TGAGATTACAAGTTCCAAACTCAGCCA 

Sequence 301 

CCGGGCAGGTACCCTCTACCTCCTTCCCTGTCGGAAAAGTCACCACTTTAGTCCCTGGCC 

ACCAGCACACCCCAGGAGGGTGAGTGGCCTGAGGTAGTTACCGGCACTTAAAACTCCCTT 

GCTACCGATCTGGAACTCAAGCCCCAAGACATCCCCTTAGATGATCTGAATACGCATTCA 

GGGACAGATCTAGGCAGTTTCTAAACAACACTTAGACTGGGGTCTAACGTTGACAAATCC 

TTCTAGAATTTGCCTCTTTGGGACTGAAGTCTAAGGGGCTGAGACCAAGAAGGGAGAGCA 

CAAGACTAACTTTGGTCTCTTGACCTTTT 

Sequence 302 

AGGTACCTTCTAAGGTAGTTAACACCACACAGGCTGATAACTCAGGGCAAGAGAGAGGCC 

TGGAGAGTTGACAAGAGTGTGAAGATACTGGAGAGGGGTTCAAAATAGGATCAATTCTTC 

TAATACTATTCTTATTTCTTGCTMTAATTTGTCCTTGTAATTATAGTGAAGTAATGGTC 

TGGATATAAATGAGATATGGTTTTGTGAGGAAGAAACATAGAGGATTATTCTTAACAACT 

GTGGCAGCCTTGCCAATATGGAACCCAATTTCAGAGTTTGTGAAATGGTACCTGCCCCGG 

GCGGNCCGCCCGGGCAGGTACAGTGACCTAAAGT 

Sequence 303 

AGGTACTCAAATGCATTGATTTTTCCAGCTGGCTTTTCAGAGCAGTGCTAGTGAGAGGCA 
CCAGTGTAACTTGAAGCAGGAATAGTGATGGCTCAGGTCCCCACAGGTAGCTCCTCCATA 
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CCTGTGGGACTCAAAGGCTAAGGGCACTAACTCATAG6GCTCAAAGGCTGGATGAAGAGA 
ACAATCAATGTGTTAAATGTCCTTCAGACCTAGGTCAGAGGACAMTTTTGACTAATTCT 
TTTGACTAA7TCTTTTTGCAACTTCTCTCAACAGGTACCTGCCCG 
Sequence 304 

CCGGGCAGGTACCTATTCACCATTCCAACGTGAAGAAGCTCTGCAGTAGGAAAAATAATT 

AACACACTTATAGTCTACTGCCTATGTAAGGATCAGCTCCGGCTAAGAGGCCAAAGATGG 

GTGACATCGTTATGCTCTGCCTTTAT f I I I ! CTTTCTTACCCACTTAGCTTCCTAATTGG 

AGGAAGGAGGCGTGGTAAAGGTATATGAAGACTATGGTTTAATTAGACCAGAAAACACTG 

TCATMTCTCTGGGGTCATCAGAATGTCCAGTT7TGTCTTTGGGCCAAGATAAGGGCAGT 

GG GATTTATGATGTGTTGTTTATAGTCTG AAACTACTC 

Sequence 305 

AGGTACGCATGTCGCAGGGTTAAGTATGATGCAGAGGTTAAAGTCTGTTTGAACAAAAAC 

AAA TGC C C GGGG AAATTrCATAGCTATAAAGTTAATMCTAAATTTTGTTCACTAAG AGG 

ACCTTTTCTATGGATTTCCTTCATCTCTCAGTCACACTGCAAACTTATCTGAAGTGCACT 

TCCCAAGTGTAGTTAGAGAATAGGAAGGAATGTAAAA I I I It I M I I AATTATTTGTTTT 

GTTTGTTTGTTTGTTTTTTTAAAAACAGCCCTTATTAACTCT^ 

TGTAMCTCATAAAAGCCTTTTATAGAAAGCAAATTC 

Sequence 306 

CCGGGCAGGACTAGGGTGGACTAATGTCTCACTTCAAAAAAAATATGAGTGTAGTATACT 

TGGCCTAAAATTGTTACTCAGTGTTACTAAAGTGTAATCATTTTCCACATCTCCAAATAT 

ATTCCMGTCTCTGAAATACTAGTGCATTGGGAGGTAGAACATTTTCGAGCTTCATGTCT 

GTAAATGGTTTCTCTTTACTGTCATTCAGTAAAATCAGAATAAATGATTGATGTAATACT 

TAAAAAAAAAAAAAAAAAAAAAAAACCT 

Sequence 307 

CCGGGCAGGTACATTCAAATCAGGACACCTGGGTATATGGACAAAGATATCCTACAATTA 

AAATGGAATAGAAGTTAAAAAAAAAATTCAGTCCCCTACACCACAGGCCTCAAACATGCT 

TGCTGATGGATTCTGGGAAGGTTTTGGATGAAGGACGGAGATATGGGAGGAAAGTGAGAA 

AACAGTGGATTCCCTTTGAAAAGTATGCTAGCAGACMGCATTTGGTTTTATGAAAGAGG 

CACTCTTATAGAGAAAGAAGCTAGTATGTGGTGTATAAAAAGCCCCTAAATCATCACCAG 

AATGTCTATCCATGATTGTACCTCGGCCGCT 

Sequence 308 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGGTACCTGCCCGGGCGGNCGNCCGGNCAGGNCATAATACATGATTGA 

ATACATGANCGNATNTAACATGNN I I INK I CTGNNGNGGNCAAATAAACATIMCTNAAA 

G NNGCANCTGC N NATCAG NN ACCCTTAG AAAA 

Sequence 309 

ANCCCCCCGCGGNGGCGGCCGAGGTACGCATGTCNTTTGGNNNAGTATGAANCACNGGTT 

AAA GTCTG TTTGAACAAAAACAAATGCCCGGGGTAATTTCATAGCTATAAAGTTAACAAC 

TAAATTTTGTTCACTAAGAGGACCTTTTCTATGGATTTCCTTCATCTCTCAGTCACACTG 

CAAACTTATCTGAAGTGCACTTCCCAAGTGTAGNTAGAGAATAGGAAGGAATGTAAAATT 

I I 11 i I i I AATTATTTGGTTTGGTNGGTNGGTTGG \ WW I'AAAAACAGCCCTTATTAAC 

TCTTCCTTCCACTGATTCTACTGGAAACTCATAAAAGCCTTTT 

Sequence 310 

CCGGGCAGGACTAGGGTGGACTAATGTCTCACTTCAAAAAAAATATGAGTGTAGTATACT 

TGGCCTAAAATTGTTACTCAGTGTTACTAAAGTGTAATCATTTTCCACATCTCCAAATAT 

ATTCCAAGTCTCTGAAATACTAGTGCATTGGGAGGTAGMCATTTTCGAGCTTCATGTCT 

GTAAATGGTTTCTCTTTACTGTCATTCAGTAAAATCAGAATAAATGATTGATGTAATACT 

TAAAAAAAAAAAAAAAAAAAAAAAACCT 

Sequence 311 

CCGGGCAGGTACATTCAAATCAGGACACCTGGGTATATGGACAAAGATATCCTACAATTA 

AAATGGAATAGAAGTTAAAAAAAAAATTCAGTCCCCTACACCACAGGCCTCAAACATGCT 

TGCTGATGGATTCTGGGAAGGTTTTGGATGAAGGACGGAGATATGGGAGGAAAGTGAGAA 

AACAGTGGATTCCCTTTGAAAAGTATGCTAGCAGACAAGCATTTGGTTTTATGAAAGAGG 

CACTCTTATAGAGAAAGAAGCTAGTATGTGGTGTATAAAAAGCCCCTAAATCATCACCAG 
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AATG TC TATCC ATG ATT 
Sequence 312 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTMGTGAAGCTGTGAAGCTCCTTTCTATTAGTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGGTACCTGCCCGGGCGGNCGNCCGGNCNGGNCATAATACATGATTGA 
ATA 

Sequence 313 

AGGTACAAACAGTAGAAC MTACTG ACAAATGCAAACTTAGTCACATGTGCTTTAATAAC 

TGACAATACATTCAAGCAGGTTTTTCTAATTCAAGTGTATAGCACATATTCAAATAATAG 

GAAACAAATCTCAT GCAAA GTAAATAAATCTTGATGTNTAAAAACTGATTAGAACTAGAA 

AAGMGTGGACATGTTTTATTATCTTTGCATTACGTTCTAACAAAAGCAGATTATCAGGG 

GCTCTTACTCACTTGCCATTCCTGACATGAGCACTATAAGTGAATACTATGAGTTCTACA 

AACAGAACATTTTTCCACATGAATTTGACTTGCAA 

Sequence 314 

CCGGGCAGGTACGCCGTTTCTTGTAAACACGAGGCACCCCAAGATAAGAAGACAGATAGA 

GCAA GGGATGGACATGGTCATCTCCTCAGTGATTGGAGAAAGTTACCGGCTTCAGTTTGA 

TmCAAGAGGCAGTGAAGAATTTCTTCCCCCCAGGAAATGAAGTGGTTAATGGAGAAAA 

TTTAAGCTTTGCATATGAATTCAAAGCTGATGCATTATTTGAT^ 

GCTCAGTAATTCCGTTGTAAAAGTAAATGGAAAAGTTCTGAATTTGTCAAGTACCT 

Sequence 315 

AGGTACATTTTGTATTAACTGMCTCAGCTAAACAGTAAAACCTGTTTTACTTAATCCCT 

GTCTTGACTACCAGACTATCATGATGTTTGTTGGAACTGTAATTCCTGCTCTCCATTTCT 

CTCTGTCCCCCAAGTTGAAAATATAACCCAAATCTrrAGAATTTTTGCCTCTCATTCCTT 

GCAACTCCAGTGGGCTAGAATCTTGATCCAGTTTTCTCCAGACTAGATTTCAAGCTACTC 

TGCATGATGTAGAACATGTAACCATGTAACCTCATCAGCTACCTTTTCCCTTTTGACTTT 

TCCTGTTGCACACAGTTCAGTCTGACTATC 

Sequence 316 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAATGCGATCACT 

GTGGGAAGGCCTTCAGCATAGGCTCCAACCTGAATGTGCACAGGCGGATCCACACCGGGG 

AGAAGCCCTACGAATGCCTTGTCTGCGGGAAAGCCTTCAGCGACCACTCATCCCTGAGGA 

GCCACGTGAAAACTCACCGGGGAGAGAAGCTCTTTGTGTCATCCGTGTGGAAAAGGCTCC 

AGTGAGCGCGCCTGCTTTAGAGACACAGGATGATTCAGACCGGAAACAGACCTCGTGGGT 

GTAAGAGGAAGCCTCTGTGAGCTCGCACCTTACTGGGTGCAAAAGAATCCACGGAACTTG 

GGAGAAGTCCAGTTCCTGTAAAAACTGGGAAGACGAGGCGTTCTCATCCCATAGGAGGTT 

TGTGAGAACTCACGCCGGGGGTGAAAATGTACCT 

Sequence 317 

GGAGCTCCCCGCGGNGGCGGCCGAGGTN I ll 1 I I I 11 I AGTTTAGTTTATTTCTAGTTCA 

AAAATAATCTGTAATTGCTGTAAGAAATGTCAACCACTTACCTAGGATGTTTGACAATTG 

GGATGAAGTCTACATATACTMGTMTGGCNAGACAATTATTTTATTGCTCAAAAGAAAG 

TCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGGCAGGATTTCAAGTATGACCTTTAA 

GMTCAGGAAAAGACTAACmTGCTTTAGGATTAAAACAATCAAATAATTAMTTAGTT 

C 

Sequence 318 

TCTGGCGAACATACATGGCATATAGAATATCNCCTTCTTCCACATGCAGCCTGAACTTCT 

TCACCTTCTCAAATAAGGCCTTAGCCTGCTCACCTTCCTTTTTATCCAGAGCCCCTGCAG 

TGGATTTATCAACTACAAACTTCTCAGGAATGGACTTTAAAGACTGAGAGTTGACCAGGC 

TGCCTFCTGGACCAGATTGGATGGTGTTGGACCTGTTCATGTTGTTCTTCCTGTTGTCCT 

TTTCTTCTGAGTCCTCCCCAGACACTTCAGATAAAGCCCCGTGTGGTCCAAGGGAGAGTC 

AAAACACTTCCCCAGAATGGGAGATGAAAATGTTCTATTTTGGGAGCTGGGACCAGGGGA 

ATTTGAACCAAAAAGTTACTGGCAGAGCATAAANACCCAGGGTNTNGATGAAGGCCAAGG 

TAAGGGNAATCTTGGCTTACCAGTGGAGGGCTCGCTTATACACATTTTGNTNANAACGGN 

NNCCTTCCCCGGGCCGGCCCCCCGGCAG 

Sequence 319 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAACAT 
CTCTTAGTAATTGAGAAAATTGAAAGAAAAGAAAAAAGAGAAAGGGAGAAAGAGAAACAG 
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CGAAAGGGATAATGAAGGCGAGAAAGAAGAAGAGAAAGGAAGAGGAAGAA AAGTA AAAAG 
GAGGAGGGGGAAGGAAGAAGGAAGAMGGTGAAAAGAAAGAATGGTAAACTTTTTAACAA 
CATAATTTATCCTTCTAGAATATGAATGTTGGTCTATTTGATGATGTCCCACAAATTCAT 
TAGTCTCTGCTCATTGTTTA'I III! I ATTCTTTCTGTTTCTCAGAGACAGTATTTTCCAT 
TTTCTTCTCTTC AAGTTCATGG CTTCCTCTGTGTGTGCAAATATACTCTTAAATCC CTCT 
GGTGAT I I I I AAA I I I I I ATCATTGNAGTTTTCCACTCCAGAATTTGGTATCCTCTTTGC 
TGATATTCCTAC I I I I IAATAI Mil I CTGATTCCCTTATTTCTTTGGTTATGGNTTC 
Sequence 320 

NCCGCGGTGGCGGCCGAGGTACCGCTTATGCCAGGCCGATCCCGACCTGGACTCAGACAA 

GGATGAAACAGGCGCCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAA 

CTACCTGAGACACGGCAAGCTGGTGATTAACAAAGACCTCGCGGAGGAAGGAGTGTTGGA 

GGAAGCAGAATTTTACAATATCACCTCATTAATAAAACTTGTAAAGG 

Sequence 321 

GGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTTTATTTCTAGTTTAGTTTATTTC 

TAGTTCAAAAATAATCTGTAATTGCTGTAAGAAATGTCAACCACTTACCTAGGATGTTTG 

ACAATTGGGATGAAGTCTACATATACTAAGTAATGGCAAGACMTTATTTTATTGCTCAA 

MGAMGTCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGGCAGGATTTCAAGTATGA 

CCTTTMGAATCAGGAAAAGACTAACTTATGCTTTAGGATTAAAACAATCAAATAATTA^ 

ATTAGTTCAATTTTCTAACATAGTCTCTATCTTCAGTTAAAGTGCATCATTGCATGTTAT 

ACATTACTAAAATTCCACAGTGCATAATTGTTACCCATGTGACTATTTAATTCAGGGTCA 

ACTGTCTAAAGGCTCAAGGTGTCACATTAAGGGTATAATCTAGAATGAATTGGACNA 

Sequence 322 

AGGTACATACTCAGACATGTCTAATACAATTCCAGTGTTTTATAAACTGGCAGAATTGAT 

GTGTTTTATAACAATTCAGTATCCAAAAGCCTGAGGAGACTTGGACCATTTTCCTAAGGG 

ATGAAGCGGGAAACGTTTATTAACCAGAGCCAGATTGAGACCTTAATAATAAAAAGACAT 

GGGTCCAACCCATAGGTCACTCCAAATGGCATCAACACTAAACCATAAATAAGCGTTAAC 

AAAGTGCAAAGTTCTAAATTTTCGCATAGCTACTTCMGTCCTCTCTTTTTTAAACATTA 

TTCTTATGTGGCTCACCCGTCACTAATGCTCAGTGTGCCTCTTGGGTGGCTCAGCAGTGG 

GAAGTCCACTTTCAGCCTAACTACTCCTAACAATGAAGCTGTTCTCCAGAGAGCAAGAAT 

AGCCAGCCACATGCCCCCTTCTCTGC I I I I \ CATCCCCCTCCATCTG ATTC AATGGCGGT 

GTCCCTGGCTTGCAGGCTTCAGTATGCTTCA 

Sequence 323 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGAATCATCATGTAAGG 

AG I II I I AAAACATTGTTGCCAGGGCCCCTTTCTAGACCAAGTTAGTCAGAATGTTGGAC 

MTGAGGCCCATGCATGGGTATTTTTACAAAGCTCTCTGGGAGATTCTAATGCTTAACCA 

AATTGAGAAGCACTGAATAAGAATATCCTGGGCCGGGCGCACTGGCTCATGCCTGTAATC 

CCAGCATTTTGGAAGGCCGAGGCGGGTGGATCACTTGGGGTCAGGAGTTCGAGACCAGCC 

TGGCCAACGTGGTGAAACACCGTCTCTACTGAAAATACAAAAAATTAGGTGTGGTGGTGC 

GTGCCTGTGTTCCCAGCCACTCGGGAGGCTGAGGCAGGAGAATCGCTGGAACCTGGGATG 

TGGAGGTTGCAGTGAGCCAAGATTGCACCACTGTACTCCAGCCCG 

Sequence 324 

CCGCGGTGGCGGCCGAGGTACATTGMTCATCATGTAAGGAGTTTTTAAAACATTGTTGC 

CAGGGCCCCTTTCTAGACCAAGTTAGTCAGAATGTTGGACAATGAGGCCCATGCATGGGT 

A I I I I IA CAAAGCTCTCTGGGAGATTCTAATGCTTAACCAAATTGAGAAGCACTGAATAA 

GAATATCCTGGGCCGGGCGCACTGGCTCATGCCTGTAATCCCAGCATTTTGGAAGGCCGA 

GGCGGGTGGATCACTTGGGGTCAGGAGTTCGAGACCAGCCTGGCCAACGTGGTGAAACAC 

CGTCTCTACTGAAAATACAAAAAATTAGGTGTGGTGGTGCGTGCCTGTGTTCCCAGCCAC 

TCGGGAGGCTGAGGCAGGAGAATCGCTGGAACCTGGGATGTGGGAGGTTGCAGTGAGCCA 

AGATTGCACCACTGTACTCCAGCCCG 

Sequence 325 

CCGCGGTGGCGGCCGAGGTACTGTCCTTACMCAGTTTGGACTGAATATTCTTGTTTCCA 
TTCACAAACATTTCTTTCTGAACTAGCAACAGCACTAATCAATAGCTACACTTAAGCCAT 
TTAAGACTAAGCTTTAGATGTGTGTCTAATTATCTGGCAAA AATAAA AGAACCAAAGAGT 
TTCATGTTAGCAGCAAATTAACTAGATTTAAGGGTCTTGA I I I I II I CATAAGCCTTTGG 
AAGCCTGCATATTAAACACTGGTGAGTTTTAAATTCT 

ACTGAACCAAGCCAGCTCATATTACAACG ITTTTTTTTTTTTrm CGTGGTTCTGAGTG 
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CAGTCTTTATAAAGCCCAGTGCTGTACCTGCCCGGGCGGCCCaCTCTAGAACTAnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCCACCTCGAGGGGGGG 
CCCGGNCCCAACTTTTGGTNCC 
Sequence 326 

CCGGGCAGGTACCGCCATGATGAGAACATCTTGGAGTCCGAGCCCATTGTCTATCGACGG 

ATCATAGTGCCCCACAACGTGAGCAAGGAGTTCTGCACGGACTCTGGCCTGGTTGTCCCA 

AGTATTTCCTATGAGCTGCATAAAAAGCTGTTGTCCGTGGCTGANAAGCATGGGCTGACC 

CTTGAGCGGAGACTGGAGATGACAGGTGTGTGTGCCAGTCAGATGGCACTGACCCTCCTN 

GGAGGACCTAACAGGTTGAATCCCAAAAATGTTCACCAGAGGCCTACAGTGGCTCTACTG 

TGTGGACCTNATGTGAAGGGGGCTCAGGGTATCAGCTGTGGAAGGCACCTAGCCAACCAT 

GATGTCCAGGTCATCCTTTTCCTGCCCAATTTTGTCAAGATGTTGGAATNTATCACCAAT 

GAGCTGTCGNTCTTCAGCAAGACCCAAGGCCAACAAGTGTCTAGCCTCAAAGATCTGCCC 

ACTAGCCCTGTGGACCTGGTCATCAACTGCCTGGATTGCCCTGAGAACGTCTTCCTGCGC 

GAT 

Sequence 327 

CCGCGGTGGCGGCCGAGGTACACATGCATGCATACACACCCATACAAACATCTGTGTGAG 

GGCAGTTCTGGAGATGAGCAGAGAGAGACCGGAATAAACTCMTCTTTTCTTTCCCAAGC 

TCCTAGCCAACACTATCCTTGGGAGAAAGAAATTTGCAGAAACTGCTAAGACCAAGTGTG 

GAGATGTCAAGCTAGTTCACACTCTGAGGCTCAGAATATGTAGGACATGCACAATTGTGC 

AGTCCTTTGGGATTGGAAGTGAAACAGTCTGTGATCCCCTACCTTCtAGGGAACTAGGAC 

NTAGGAAGAGGTAAAGATTATCAGGTATGCAAAGCGCCCCAATTNTTCTGCTGCCATGGG 

GGATTTTACCCCA 

Sequence 328 

AGGTACATATTTGTCCTTCTAGTTTGTTACTATCCTTCCCTGAAAGAGCGGAGCTGTTTA 

TAGGAAGCACAACATTTGAGTCCCA7TCACTGCTGTTTGTATTACATTTTCACAAAGCCT 

GCTTTGAAAGCTGGCAAACACTGCAAGTTACATGTTACCATATTACACATGGGAGGACCC 

ATACACATTGTTATTTTTAAATTTAACATTGAACAAAAAAAAGGAGGAATCAC 

GGAAACAAAACACCAAAGTTAGTCAGTGGAATGGTCAAGAGCAGGACAAGCCTGCCAGGG 

CC AC TG AACTCC AGTCAC AAGTGGTCAAAAGTATG AC CG AG AAC AACTGGATGCAGG AAC 

ATCAGCACAAAATACAGATGGGAATATTCTCGGGAGTGGTACCTGCCCG 

Sequence 329 

CCGGGCAGGTGTGAAAGTTTAACGTTTGCGATAAACTGCCGTMTTTTGATACATCTGTG 
ATTTAGGTCATTMTTTAGATAAACTAGCTC^ 

AAAAACTTCTTTAGGCATTTGCCTAAGTTTCTTTAATTAGACTTGTAGGCACTCTTCACT 

TAAATACCTCAGTTCTTCTTTTCTTTTGCATGCA'i I 1 1 I CCCCTGTTTGGTGCTATGTTT 

ATGTATTATGCTTGAAATTTTAA I UHHIli I GC ACTGTAACTATAATACCTCTTAAT 

TTACCTTTTTAAAAGCTGTGGGTCAGTCTTGCACTCCCATCAACATACCAGTAGAGGTTT 

GCTGCAATTTGCCCCGTTAATTATGCTTGAAGTTTAAGAAAGCTGAGCAGAGGTGTCTCA 

TATTTCCCAGCACATGATTCTGAACTTGATGCTTCATGGAATGCTGCATTTATATGTAAG 

TGACATTTGAATACTGTCCTTCCTGCTTTATCTGCATCATCCACCCACAGAGAAATiSCCT 

CTGTGCGAGTGCACCGACAGAAAACTGTCAGCTCTGCTTTCTAAGGAACCCTGAGTGAGG 

GGGGTATTAAGCTTCTCCAGTG I I I I 1 1 GTTGGCTCCAATCTTAAACTTAA 

Sequence 330 

AATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATGATGCCAGTGTTCCGGGCTGTGA 

TGGGTGGTGAATCAATGTCCAGGCGGCACATGTGCTCCAGGAATGTGTCAGCCATGGCTG 

CGTGCAGCTGCTGGGTCTGAATGAAGGCAGTCCCGGCTTCACTATGGGGCTTCGACATGG 

CTGCTGAGGTCCTCCGGCGCTGACCGACTCGACAATTAGAAAACCTGCATNTTTAATACA 

GTGAGATTTGTATGCATACACTCTGGTGTCTTCATTTTGCAGCCCATTATATTGGCTTAA 

AAGCCAGAAAGGTGCTCATGCATGTAATTTATACTGGAGCAACGAATGCAGTGTATGTGG 

ACGCGTGTGTGTGCACTTTGAAAGGCAGTAGTTTTAGAAGAATGAGAAGAAMTTACTTC 

CTGGGCATTCATTAATAATTCAGGTGATATCCTGTACCTGCCGGGCGGCCGCTCTA 

Sequence 331 

ANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGACTCTTAT7TNCTGCAGC 
TNTTGAGAATACTAAAGATACCTGAGTNATTrTAAATAAATTTCAGGTTNCTTTACCAAA 
CTACCCTTAACCGGATTCTCCTTGAAATGACATCACCTGACACCCATGGCATGCTGNATG 
CCCACAGNTAGATTACTTTTTAGTGCGCCCACCCNCTTCTGCTTTTATTAGGNGAGGCAN 
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MTCAAGATTCCCTTTTGNTGNATCTTGNACCTGTTCATGCCACTTTGATATTCTAAATT 
CATACATAAACCAAT 

Sequence 332 

TATACGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTGTTTTT TAGA TACA 

GAGACTTGGGGAMTTGCTTTTCCTCTTGAACCACAGTTCTACCCCTGGGATGTTTTGAG 

GGTCTTTGCAAGAATCATTAATACAAAGAA 1 1 I 1 1 I I I AACATTCCAAT GCATTG CTAAA 

ATATTATTGTGGAMTGAATATTTTGTMCTATTACACCAAATAAATATATTTTTACCTG 

CCCGGGCGGCCGCCCGGGCAGGTACATAATACATGATTGAATACATGATCGTATTTAACA 

TG TTTTTTTT1 I CTGCAGTGGACAAATAAACATCCTCAAAGTAGCAACTGCAAATCAGTT 

ACCCTTAGAAAAGCAAGACCAAACACTGTAGTTACACTATTAAGCAGTGACCAAAAAGGG 

CTAATATTTTCTAAGAATAGTTTAMTTACAGACATTTGGTATATTTACCTTATGTGAAA 

TACATCACTATTTAATTACATTAMTTTTAACATCTGTTGNGTGGAGGTTGJATAGGTTC 

ATGCCAAAGCCTGGGGGTTG 

Sequence 333 

GCGAATTGGAGCTCCACCGCGGGTGGCGGCCGAGGTACTCTTTAAAAAAAAAAMGTTTT 
AGCATGTTACATATTCCTMTAAGTGAAATTTTATGTCTGNNCTAAM TTTAC TTCACTT 
CCTCGNGTAAGGAATGCCTTGCTTAMCJCAATATATAAAMTCAGAATTTTAAAAAAAT 
TAAAMCCAATTAGAAAATGGAAAGTTTATTCACT^ 

TTCTAAAAl WW I MGTTATTCTTTTACNTTTTTCACACAATTTTTATGGTTACCCTGG 

GCTTTGCCCAATGGGTGGGGGGCCCTTTTGGAAAGGTTTCCATTATTCNCCCCTGGACCA 

AGGNAAAAGGGGGAAGGMTCCTTGCTTCTGGAAGGTGGTTTGCCCTTAAAAGGACCATT 

CTACCAAAAACCAAAMCCCCTGGGGNGGTNTTTNAATTCCCA GGTGTGG AAAAAAAAAA 

ATGGAATAAATTTCTNGGTGAAAGGCAATC NTNGGGGTCTTTNTTTTTTTTAC CTAAAGC 

CAAGGCCTCTTAGGNGGTNTTGGAGGCCCTAATCCGTTTNGGGGAAACCTGGGNGGGAAA 

AAATTAATTTTCTTCTTAAAGTAAGCCTTCTTTTAA AAAAT NGGGGANTCCCCTTGNCTA 

ACAGGAGGANGMCTTAMNGCTAATNGGGCTTCTTTTCCTTCAGGGGGGCCTTTAGGCT 

CTAAAAAAATGGGAAAGTTrCTTTTNTTNGGACTGGAAAACTANGAGGAAAAAGGTAAAA 

Sequence 334 

GGCCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGG CTTGCC CCGGAC 

CACANGCCTCGTAACGGTAACCCCTGCTTTCCAGGGGCCTGGCACCCCCATTTTTAAGNG 

AAGCTGTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGC 

TATTGTTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACTCAGCAGGAAAAGAA 

AAAGCAGTTCCTGGTTGTTATTTCACCTCGGCCGCCCGGGCAGGTACTTGTATCTGGGCC 

AGCTTGATAGGGAGAGAAATCCTAAGGAATGTAGAGATGATGACCCTACCAAATTTGGAA 

CTAGACTCCAATGAGTTTAGTTAAAACAAACACTATGGTGAAGAACTTTCTTCATTAGTA 

TCTTGTCACTTGGGGGAGTGAGGGGGGACAGAATTTGCTCTTAAAAACCCAGGGGTTGCT 

TTCATGTGTTATTACCNATGGTCTTTGTAGGCCTTCCATTTNGGATTGAGTGGGTNAAA 

Sequence 335 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGGGGGCGCAGG 

TCCAGGTGGCGGGGGATATGCTGCCCAACTCCACCGAGCGGGCCATCACNATCGCTGGCG 

TGCCGCAGTCTGTCACCGAGTGTGTCAAGCAGATTTGCCTGGTCATGCTGGAGACGCTCT 

CCCAGTCTCCGCAAGGGAGAGTCATGACCATTCCGTACGGCATGATGAGTTCTGAGCTGC 

GGAGGAACCCTCATTTCCTCAAAAGTAATTTATTTTTACAGCTTCTGGTTTCACATGAAA 

TTGTTTGCGCTACTGAGACTGTTACTACAAACTTTTTAAGACATGAAAAGGCGTAATG^ 

AACCATCCCGTCCCCATTCCTCCTCCTCTCTGAGGGACTGGAGGGAAGCCGTGCTTCTGA 

GGAACAACTCTAATTAAGTACCTCGGCCGAGGTACCGGATTCTCTTCTTTAACCCTCCCC 

TTTCGTGGTTTCCCCCAATGGTTAAAATGTTTGGATGGT 

Sequence 336 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGG CTTGC CCC 

GGACCACAGCCTCGTAACGGTAACCCCTGCTTTCCAGGGGCCTGGCACNGNCATTTTTAA 

GTGAAGCTGTGAAGCTCCTTTCTATTACTCATCATGCNATAAATAACTATATCCGGNATT 

AGCTATTGNTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACTCAGCAGGAAAA 

GAAAAAGCAGTTCCTGGTTGTTATTTCACCTCGGCCGCCCGGGCAGGTACTTGTATCTGG 

GCCAGCTTGATAGGGAGAGAAATCCTAAGGAATGTAGAGATGATGACCCTACCAAATTTG 

GAACTAGACTCCAATGAGTTTAGTTAAAACAAACACTATGGTGAAGAACTTTCTTCATTA 
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GTATCTTGTCACTTGGGGGGAATTGAGGGGGGACAGAATTTGCTCTTAAAAACCCANGGG 

TTTGCTTCCATGTNTTATTACCATNTTNTTTTGTAGGCCTCCAT7TGGGATTGANNGG7T 

AAAAAA 

Sequence 337 

CCCNCCCCCCCATNANTANTTNNATANTATTTNNTNGTNGNNGGGGGTTGCACATTTTAN 

GGCGAATGGAGCTCCACCGNGGTGGCGGCCGACCACTGACTTTGTGACTTAGGCGGCTGT 

GTTGCCTATGTAGAGAACACGCTTNACCCCCACTCCCCGTACCTGCCCGGGCGGCCGGGG 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTGANNNCNANCNTATTNGNTTCCGT 

CNCCCTCGAGGGGGGGCCCGGGGCCCNCCNCTTTGTTCCCTTTAGCNNCGGTTAATTGCN 

GCGCTTGGCGTMTTCATGGTCATAGCTGTTTTCCTGTGTGAAAATTGGTTATCnnnnnn 

nnnnnnnnnnnnnnnnnnnCGAGCCAGGGAGCATAATAGTGTAAAAGCNCTGGNnnnnnn 

nnnnnnnnnnnnnnnnnnnnnATTAAATTTGCNGTNGCGCTTCACNTGGNNCGCCTTTTT 

CAAGTCGGGGAAAACCTTNTTCGTGCCAGCTGCCATTAATTGAATTTCGGCCAACCTCGC 

CNGGGGAAGAAGGCCGGTTTTTGNCGGTATTGGGGCCGCCTTNTTTCCCG 

Sequence 338 

AATTGGAGCTCCCCGCGGTGGCGGGCGNGGTACTTTGGTCCAGATAACACTGGTGATATC 

ATGACCCTGAAATTCTTGAC TGGAC TTCAGAATTTCATAAGNGGCATGGNTACTGTTCAC 

TTGGTCACTGCAGATGGGAGTTTTGATTGCCAAGGAAACCCAN GCTGAACAAGAAG CTTT 

AG TTTCT TCTTTGCATTACTGTGAAGTTGNCACTGCTCTGACCACTCTTGGAAACGGAGG 

CTCTTTTGTTCTAAAGATGTTTACTATGTTTGAACATTGTTCCATAAACTTGATGTACCT 

GCCCG 

Sequence 339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCTTCATCCA 

CCAACCTCCAGAGAATAACCTGGCTGCCCTCACTTCTCCAGCTTTTCTCACAAGGCACTT 

GTTACAGGGCTGCTGATGGGAGAGGGAGGAAGGCCTGGGAGGGGGATCTGGTGGTGGCGC 

CCAATCTACAGGGTCCGGAATCAGGGTGGAAGTTGTCAGGGACTAGCGGCTGCCATAGTA 

CCTCGGCCGCCCGGGCAGGACGTGTAGTGGGCCGGAATGGAGAATCCAGTGAACTGGACC 

TACAAGGCATCCGAATCGACTCAGATATTAGCGGCACCCTCAAGTTTGCGTGTGAGAGCA 

TTGTGGAGGAATACGAGGATGAACTCATTGAATTCTTTTCCCGAGAGGCTGACAATGTTA 

AAGACAAACTTTGCAGTAAGCGAACAGATCTTTGTGACCATGCCCTGCACATATCGCATG 

ATGAGCTATGAACCACTGGAGCAGCCCACACTGGCTTGATGGATCACCCCCAGGANGGGA 

AAATGGTGGCAATGCCCTTTATATATTATGTTTTTTACTGGAATTAACTGGAAAA 

Sequence 340 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGACCGCTCCGAGATCTGTATGAG 

TTGGAGGCAGGCCAGTGTGAAGGTGTGGGAGGAATCCGCCCACACNCAGCTTCATACAGC 

ACCCTGAGGACAATGTGGCTTCCCTGATTCACACCTACTGAGCCAAGGCCCCCTCTGAAG 

TTAGGTCAAGAGGGCCCACCCTAGCCGGGCCAACTCATTCCTTTGAGGACCTGNTACTCA 

ATGAACTGCTTACTACCAAAAAMTGCTNAGGATTCAGCAAAAAGTGCMTTTCCATTTG 

AGAAGTAAGATTCTNCCCAAAACATGACACCACCTTTTCATCCTTTCAGTACCTNGGCCG 

AGGTGGACTAACTGCAACGGAGAGACTCAAGATGATTCCCTTTTTACCCATGTTTTCTCT 

ACTATTGCTGCTTATTGGTAACCCTATAAACGCCAACA 

Sequence 341 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCACTCAGAT 

ACTA AATAGTAGTTTATTCTTTAATATAAGTTACATTCTGCTCCTCAACCAAATGCAATT 

TTTTGTGTGTGTTTGAAAGC TAAT T TGAGAAA ATTTCATAGGTTACATTTCCTGCAGCCT 

ATCTTTATCCA CAGAAA GTGTTTTCTTTTTTTAAATCAAGACTTTTA^ 

TCCCATCACTGTTTTTTGAAGGTCCTCCAAGTCCGTGTTAAGGTAAATATCTGTTTTCTT 

CCTGATGTCACAGCCTGAGCATACTCTGTGCATTAGGAAGACCTGAGTGCATTTCCCACC 

ATTGTCCTTTCCACATTATGTTGTAGCTGGCTGGCTGTCAGGCGACTACAAGACTGAGGG 

TCTTGTGCCTTATAGATCTTTGTATCCCCCATGGCTGACACATAGTAGGTACCT 

Sequence 342 

NATTG GAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCATGTTACATATTCTTCT 

TTAAAATTTTGTAATAAACATTGACAGTGTTTGGTAGGCACAGGGAAACAGGATAACGTA 

GAGTCATTACAGAAGAAAAAAACTTATTGCTAACATTGCAGTATTCCTTTTATCAGAATT 

AGGTGAGTATTGATTGTAAAAGCTCTATCAACTCTTGCTCTTATTTGATGACTTTGAGAC 

TTTTTTACTCTTGCTATAAAAAGAAGGCTACTTrCTTTCCCTAATATATTTCTACCAATG 
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CGAATAATTCAGGAAACAATGAGAGAAAAGTAATTCACACTTAATGTGTTGTTACTTAAG 
AGATTTGACGGATAAAAATAAAATCAAATTAAATCATTTGAAAAGGCANGGCTTAGACGC 
CCTTAAAAACACCGNTGGTGCCTACCATAATTTGAGCACTTTTTCATGAAAAAAAAAAAA 
AAC 

Sequence 343 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNNGGGAACNANAGNANAACTAAACGAT 

GATGAATGCNNAANNTNCANNAAACGGNGGANNGGNTGNANNGATGNGGNNCCCACTGTN 

N N CAN C N N N ATTACATGACATNAAATCAGANACN ANGGAGGCCCTGCTACAAAGAAGC AC 

AAGATGTGGTGTTGCTTAAAMGTCCTGTATGTGAGGAACTCTTTCATTTTCTTGGGGAT 

TGGCAGTGCTAGGACTTGGTAAGTTGTTAGGAAACTTCGAAGGGCTTTCCGGCATAAGTG 

CTTCAAGTGAGGACAGGACCCTAGGAGCTGTCCAAGAACTGGA 

Sequence 344 

AGGTACCAGACTCTTATTTCCTGCAGCTTTTGAGAATTCTAAAGATACCTGAGTCATTTT 

AAATAAATTTCAGCTTTCTTTACCAMCTACCCTTAACCGGATTCTCCTTGAAATGACAT 

CACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACTTTTTAGTGCGCCCACG 

CACTTCTGCTTTTATTTAGGTGGAGGGCAGNAATCANAGATTCCCTCTTTGTTGGGATCN 

TTGAAC C CTN GATTC ATGCCACTTTTTGGATAATTTCTAAAAATN CCATTACC ATAAAAC 

C CAAATTG AAAATTATATTG NTGTTTANGAAAAAAATTTGGCTTATNTT 

Sequence 345 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCCGGCCTGCANAAG 

CCCTCGTGGCACCCGCCCCACTGNGTGCTGGGCCCTGTCTGGGGCACGCTCTACTCAGCC 

ATGGGGTACTATATNTTTAGCTTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGT 

GAACTGTTCCAGCCCTGAGCGACACACAAGAAGTTATGATTACATGGAAGGAGGGGATAT 

AAGAGTGAGAAGACTCTTCTGTCGAACACAGTGGTACCTGCCCGGGCGGTCGNCCGGGCA 

GGTCTCCAGAGCCTTCTCTCTCCTGNGCAAAATGGNAACTTTTAAGGAA 

Sequence 346 

ATGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCAACTTCCATTCCTCGCCC 

TGCCCCCGGAGCCGAGTCCTGTATCAGCCCTTTATCCTCACACGCTTTTCTACAATGGCA 

TTCAATAMGTGCACGTGTTTCTGGAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

CCCGGGCGGCCGCCCGGGCAGGTACAAGTATACATAAAAATGGCATAAATGGCATAATTG 

AACCAATTACTGGATTCAACTATATTAAGACTATTTCCTTAAATCCTACTTCAGACTAAA 

TTATTTTACCTACATTCTTTTCCATATTTTGGAACTTCTGAGTCATTATTTTCC . 

CACATTAAAATAATTTAAAATTACATGTATCCCTTCTCAATAAGTTTAATCAGCTAACCC 

TAAGCTAGAGGTCAAAATCTACTTCCTCTAATATCAAAACGAAAATT 

Sequence 347 

GGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACCGTTTCCTCAGCGGCG 

GACTGCTGCAGTAAGAATGTCTTTTCCACCTCATTTGAATCGCCCTCCCATGGGAATCCC 

AGCACTCCCACCAGGGATCCCACCCCCGCAGTTTCCAGGATTTCCTCCACCTGTACCTCG 

GCCGCCCGGGCAGGTACATTTMTCAGTAAATCAGTTTCACATCATGTATTGTGATGTTT 

CAATGTGAGACACAAAMCMTGGCTTGAAACTTGTGTATCATATGTGATTTTGAAATGA 

ACACCTTGAATAGCACTAATTTTTATTTGTGGNATTTTTCTATMCAAMCMGTAGCTC 

TAGGAAAAGAGGTTTTATTTTGTAAACGATCATTTGTGACCTCAGACACTCTCTGGCTAA 

T 

Sequence 348 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACCAGCTCCTCTTTTGCGT 

CCACACGGATCTGAGAAAGTGCACTGTAGGCATAAGCAGCTATCCTGGAGACCTGCTGCA 

AATCAGAGAACGGGATGGGCTCACTGGCCAGCACTTGGTGGGGGCTGGCTGGTAAGAGAC 

GGCAGCGGTGGCAGCTTCTTCCAATGGGTCAGGCTGCTGCTCAGCACAGCCAAGCGGGTG 

CTGTACTCATGACGGAAGTTGCCGCTCGCCCACTTGTGCAGCAGCGTACATCATTTAAAT 

AACATAMTGACTTTTACACAGCTTGACCTAGGAAAAAATAAAATCCATCATAGCCACAG 

CTAAAAAGCATGTTAAGATTCACAATAAGAATTTGTTTCTCTTATTATAAAGAGAAGAGC 

AATCATATAACCTCCTGGGGGTGGGGGGAGACCTCATAAATATTTTATATTGATTGACAA 

AACAGCATGCT 

Sequence 349 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATTTAACACTCTGTATTACT 
GAACTTCTrFGMCTATTTTGMCTTTGAATCTCTTCTCTGAATAACTGCCGAAACTCAA 
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AGATAATAATTAG TTGA TGAAGGTTATCAATTAATAAAATAACACAGACCAGTCTTACCA . 

AACTCTAAATACATTTTAAAAAATAGAAACTGGCAATGATAAAMGAATGTTGAGTACCT 

GCCCGGGCGGCCGCCCGGGCAGGTACTGAAGATTCTTCTTTCGGTGGAAGACAATGGATT 

TCGCCTTCACTTTCCTGTCCTTAATATCCACTTTGTTGCCACACAACACAATGGGGATGT • 

TTTCACACACTCGTACCCATAGC 

Sequence 350 

CCGGGCAGGTACAACTT AAATA GCATNCTAGGGTAAAGAGTAACATATTCCCCAAGAAAC 

AGAACTAAAATAT7TCCTATTTTATGAGAAGAGTGAGTNAGAACAAACAGGGATACCTNN 

CTCACCAGCNNCACTTAATAAAACATTTNATTTCCATACNAAATCCACAAGCCTTTTCGG 

TCAGACTTTAMNGAATGTATNATTCGAAAAAACAATNAGTTTATCTTCAATTTNAAAAA 

ATNTCANTTGGAAAGACNTAANTAAACCNAGTGGAATTCTTTTGGCTTNAACAAATTATT 

GGTTACAAATTTNCCCAAAMGGGGTTTNAAATTAAACCTTTANTNAAGNGTTATAAAAN 

TANACNCNGTTAAAAATTAANTTGGGNAAAATAACNTATTNTTTTAGNTTCCCTTGCGGG 

GCCCGGTTTCTNAANGAAACTTAAGNTTGGGAATCNCCCCNCCGGGGNCTNGCTAAGGGG 

AAATTTNCAATTATTTAAAGCCTTTAATTCGANTTCCCCGNCNCGAACCCTTNGAGGGGG 
GGGGGGGCC 

Sequence 351 _ 

TTAGGGCGAA TTGG AGCTCCCCGCGGTGGCGGCCGAGGTACATGCAGGGAATATTACAAT 

TGCAATCATTATTTTAACACCATCTCCTCCCCCTCTTCCCTTCCCAAAAGATATAAAGAC 

ATAAACATCATCTGCTGCCAAATAAATCCTGAAATTCGAGGGGCGATGCTGGGAAGGTAT 

GAAATTAGAGGGAGGCATGTTGCTTTTGCACTTTCAGATCAATGGCTGAATTCCAGACCC 

GGGGAAGCCTTTCAGATCCGCATGTAATTTGTTTAAACAATTGCTGTTCATCTGCCACAG 

CAGCACTTGTAATGCCGCAGGGAGGAACTTGCAGCATTTCTTGCTCACTTCAAGAAATCA 

ACTATAACAAACCTNTTNCAGAAGACTACCGACATNCATATTTCTGCAATAGTTAGATAA 

AAGTTAAAGAACTGGTCCTTTTTGTGAATATTCACAGTCCACTGCATATAAATCGGTTTA 
TGA 

Sequence 352 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCACGCTAAGTGATGCTCTCTTG 

TAATTGAAAGAGAATGGGTTTCCCTTCAAACCATATGCTATTACTAGTCTCTGAGTGCCT 

TCAAA GCAC ATACTCTTCCTTTAAAATCCAAGGATCCTTGCCTTCAAAAGACACAAATGC 

AGTAGTTTTCATTTCTTACATTCACTTAAAAATGGATTATGATTCCAGCAGGCAAATCTG 

CATAT TAAAT ATAGACAACCCAACCACAGTATCTGAACGTTAAAAGAAATCTATCACGAC 

AGACATTTTACCTAGTTAT AAATA ACGTAACAATAGATCACACTCAGATTTCTTAAACAG 

CTCACTCAGGAGTTTCCTC7TTTAAAAATTAAATAACTTATGGTACAACAGGCAGCAAGC 

CATGTATTTACCATGGCGAGGGTCACATTTCACCATCTGGTTGGCTGGCTCAAA 

Sequence 353 

CCGCGGTGGCGGCCGAGGTTTTGCTTTTTTTGAAAAAGGTAAGTTGCTGATTAAGTCTM 

TTGGAATTGATAATTCCATAGTCTTAGATTAAAATGAGGATATTTTCTCCTAGATTTrCT 

CATGTTATGCCATGCATTTATATATCTAACCATTAATTTCACACTAAGGATGCTTCACCA 

TATAATAAAAGGAGCAAGATGGMGCACTTTGAATTTTCTTTCATTGAGMTAACTGTTT 

TATGTAAGAATCTGTATTTATAAC ACCAG ATATTAAGATAGGCTTCCATTTTTTAATGCA 

AGCCACTTACTTAATCTTGTATTCTTTTTCAGGACTCAAATAACTAGCTTTGAACATAAT 

ATTAAAACACTACTTATAGAATAGATTTATTAATGTTAATACCTAGTGAATATCCATGTG 

GCATCCTGGTTATGTTATCGGTTCAGCGTTAATCCTATAGAAAAGTGGTTTGGAGG 

Sequence 354 

AGGTACTTGTTAGTTGTTGAGAAACATTTTAGGCAGTAAATAAAATAGTAMTATTATGT 

GTCCTATAATTTGACCTAAGCTGTGTGCCTTTGGAATCAGCTCTAAGCCAGGGCCCCCCG 

GGAGTCATGTTAAGTAACACGATCATAAAGTGACATACATCAAGGTTATCTTATATAGTC 

ATAACAGTATTATTGGGTCTTTCAAAAACAGCTGGCAGATTGAGCTTTTAATGGCTATAT 

TGATACAGTAGATAGAGCACCTCTAAGGAGTCCTGCCCGGGCGGC 

Sequence 355 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGAGACTATGATTTATATTGAT 

TGCACTTG CCTG CCATGATTTAGATAAGATTTTTTTGCATGGTTTTTATTCTTTCCT 

GGATCCTGTTTTATAATACTTCCAAGCCTGTCCATGGATATATCAAATGTCTTCACTTGT 

ATATTTTCATGGCTAGGTATTTCTAATGTTTATTCTTCCCTGTGTACTGCAGGGGATGCT 

CTGCCGGTCCAGGGACAACAGCCTTTCTCTTGCTACTGCTCTGATTCTTGTCTCGTTCCG 
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111 I I I CCTTCTCACCATCTTTCTGTGTGCTGTTTTCTTCATTCTGATCATGGTCCCGAC 
TGTCATCATCTTTCAAAGCGTCGMCTTGTTGTTCATCCTCTCGCCGCCGCTGTCGCCCC 
CGCGTACCTGCCCGGGCGGCCG 
Sequence 356 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCATGTTACATATTCTTCTTTAAA 

ATTTTGTAATAAACATTGACAGTGTTTGGTAGGCACAGGGAAACAGGATAACGTAGAGTC 

ATTACAGAAGAAAAAAACTTATTGCTMCATTGCAGTATTCCTTTTATCAGAATT AGGTG 

AGTATTGATTGTAAMGCTCTATCMCTCTTGCTCTTATTTGATGACTTTGAGACTTTTT 

TACTCTTGCTATAAAMGMGGCTACTTTCTTTCCCTAATATATTTCTACCAATGCGAAT 

AATTCAGGAAACAATGAGAGAAMGTAATTCACACTTAATGTGTTGTTACTTAAGAGATT 

TGACGGATAAAATAAMTCAAATTAAATCATTGAAMGGCAGGCTTAGACCCCCTTAAAA 

AC 

Sequence 357 

CCGCGGTGGCGGCCGAGGTACCTCTTTGCCTTAAAT TGCTT TTTAGTTCTAAGATTGTAG 

MTGATCCTrrCAMTTGTAATCTmCTAACAGAGATATmAATATACTTGCTTrC^ 

AAAAAACAAAAAAACTACTGTCAGTATTAATACTGAGCCAGACTGGCATCTACAGATTTC 

AGATCTATCATTTTATTGATTCTTMGCTXGTATTAAAAA CTAG GCAATATCATCATGGA 

TACATAGGAGAAGACACATTTACMTCATTCATTGGGCCTTTTATCTGTCTATCCATCCA 

TCATCATTTGAAGGCCTAATATATGCCAAGTACCTGCCCG 

Sequence 358 

CCGCGGTGGCGGCCGAGGTACTTCTTTTCCAGAGCAGGTGGCACAAAACGACCCCCCAGG 

TAATGGTAGCGACCGGTAMCTGGGTTGCAGATTTTTTGGGGGCTGTGAGGGAGATGAGC 

AAGTCTGGCTGGATCCCTCCAGCATTTCCCTTCTCCACGTCCCATCCTGAGGGAATGTCG 

ATGCTGGCAATGGGCACAGTGAGTCCCTTCAGGACACTCAGGATGCTGTGGAACGGTTCC 

CGAACATCGCCCTTGAAGCTGAAGCCAAAGATGGCATCCACCACCAGCT CATA CAGTTCA 

TCAATCGTCATGGGCTCTGCGGGCATTTCCCCAAGGA AAGG GATGTCCATTTTCTGACAC 

TGGGTCACCAATGCAGTGAAGAGGGGCTTGTTAGGCCTTTTGGGGTAATAGATGGTTGGC 

TCGTAGCCAAAGAGTTTGAGGTGTCGAGCACAGACCAGACCATCTTCTTCATTATTCCCC 

G 

Sequence 359 

CCGCGGTGGCGGCCGAGGTTTTCATGC ATGCA G ACTTTT ATTTAACCTCATTGAMTTAA 

AATTAGCAATTTAAAATACAAMTATAGCATTTTGGTTTTTAAAAATCA 

CTTTAAGATAAAAACAAATAGGGTTGGTCACCCTTATATGTCAA CCCAA AAAGTATCAAA 

AATACCTTGACTGCATGAAAGMCAGCTTCTTTAGGNCTATGCATTTTGACTTGGGCTTC 

TGCCAGATGATACCATCCACTTGGGCAGACAGGGCTTTCCTTTAACCCT 

Sequence 360 

CCGGGCAGGTACCAGCGCGTCCCATACGTGGAGCCGCCCGAATACGAGTTCTTTTGGGGC 

TCCCGGGCCAGCCGCGAAATCACCAAGATGCAAATCATGGAGTTCCTGGCCAGGGTCTTT 

AAGAAAGACCCCCAGGCCTGGCCCTCCCGATACAGAGAAGCTCTGGAGGAGGCCAGAAGC 

TCTGCGGGAGGCTAATCCCACTGCCCACTACC CTCG CAGCAAGTTGTCTTCTGAGGACTA 

GCCAAAGTTCTGGGAGGNCMGATTGAATGGGTTTTCNTGACCCCTCAACNC ANGG GCTT 

GTTGGGAAAGGGGNAGGGGTGGCTGGCGTTCATTAATAGGTATTTCANGGGATTTTAACA 

GGCTGGCAANTMmNCACNGmrTAAACCTTTTTTAMGTAT^ 

CTTTNCCGCNTCTTANGAAACCTAAGGTGGGGAATACCCCCNCGNGGGCTTTGCTAAGGG 
GAAATTTTCCGANNAATTCAMGNCTTTNATTTTGANT ACCCGG CTNCGTANCCCTTCCC 
AGGGGGGGGGGGGGGCCCCCGGGTTAACCCCMGCTTTTTTNGTTT 
Sequence 361 

CCGGGCAGGTGTGTGTATTCACGCAGTTACTCGCTTCCATTTTTATGACCTTTCAACTAT 

AGGTAATAACTCTTAGAGAAATTAATTTAATATTAGAATTTNTATTATGAATCATGTGAA 

AGCATGAC^TTCGTTCACAATAGCACTATTTTAMTAAATTATAAGCT TTAAG GTCACTT 

ATACTACCAATMCTTGTTAAATCAGGATTTGGCTTCATACACTNGAATTTTCAGNTATT 

TTATCTCAAGTAGNATATAGACACCTAACCTTTGATAGTGGATACCGTTAGGAGGGGTTC 

CTATTCTTCCCATTGTACCTCGGGCTCGCTCTAGGAACTAGGTGTNATCCNCCCGGGGCT 

GCAGGGAATTCCGATATCAAGCCTTTATCGATACCCGCCGACCTCNGAGGGGGGGGGGCC 

CGGTACCCAAGCTTrTTGTTTCCCTTTTAGTGGAGGGGTTTAANTTGCGCGCTT^ 

AATCATGGNTCATAAGCTNTTTTCCCTGGTGGTG 
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Sequence 362 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTCCTTCAGGGCATC 
ACCCACCTTGCGGAAGTCTGCCAGCTGCAGCCAGCATGTCTTCAGGCTACATGAGCCGGA 
C ACCCCATGGC ACTTGCAGGCCACATN AGCCAGG TTGTAC AG AAATTTG N ATATATGATG 
GTTCTTAGAACTTGTTTTMTTTTTGTGGNCCTTCT 

ATGATTATCCA TATGTGTTCT TMTTTTTAACTGCTGGAAGTGTTAAAACACACACACAC 
ACAC ACACACAl fTTTTm I GAGAACTCCAAAGTCCTGAAAATTTTGGTGGACAATGAT 
TTTTAAAAAACTMCTTTGTATMCCTAATATTTGNATTCTCTCATCTAT 
Sequence 363 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGCATATATTAGGCCT 

TCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATTGTAAATGTGTCT 

TCTCCTATGTATCCATGATGATATTGCCTAGTTTTTAATACAAGCTTAAGAATCAATAAA 

ATGA TAGATCT GAAATCTGTAGATGCCAGTCTGGCTCAGTATTAATACTGACAGTAGTTT 

TTTTGI I f I 1 IAAGAAAGCAAGTATATTAAAATATCTCTGTTAGAAAAGATTACAATTTG 

AAAGGGATCATTCTACAATCTTAGAACTAAAAAGCAATTTAAGGCAAAGAGGGTACCTGC 

CCGGGCGGCCGCTCTAGAACTAGGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCGTCNACCTCCAGGGGGGGCCCCGGNGCCAACTTTTTGGTTCCTT 

Sequence 364 

CCGCGGTGGCGGCCGCCCGGNCAGGTACAATGTTGTTATGGTAGAGAAACACACATGCCT 

TAAAATTTAAAAAGCAGGGCCCAAAGCTTATTAGTTTAAATTAGGGTATGTTTCAAGTTT 

GTATTAATTTGTMTAGCTCTGTTTAGAAAAAATCAMGACCATGATTTATGAAACTAAT 

GTGACATAATTTCCAGTGACTTGTTGATGTGAAATCAGACACGGCACCTTCAGTTTTGTA 

CCTNGGCCGCCCGGGCAGGTACAAAAAGGGATCAGAAGGGCAAGGGGCATGCTAACGTCA 

TCGGGGGGCCAGTCTCCAAGTCGCAGNCNCGTGCCCCCCGGAGGCCGGTGGCTATCTGAA 

CTCTATGCATTCGC 

Sequence 365 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCAGAATCTG 

GTAGACTGCTCTGGGCCTCAAGGCAATGAAGGCTGCAATGGTGGCCTAATGGATTATGCT 

TTCCAGTATGTTCAGGATAATGGAGGCCTGGACTCTGAGGAATCTTATCCATATGAGGCA 

ACAGAAGAATCCTGTAAGTACCTCGGCCGAGGTACTGGTCTGTGGCCTGGGGGTGGGGAC 

CCCTGTTTTAAGCAGTTTAAGMTTATTCTTTTTCAGAATATCTACCTCTACTGGGATAA 

GATCACGTTTCTTTACTGTGAGCACACAGGTAGGTGCTCAATAAATGGCTTGTTGCAAGA 

ATGAATGACCGGCCGCTCTAGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnCGTCGACCTTCGAGGGGGGGGCCCCGGTACCCACTTTTGGTTC 

Sequence 366 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTGTCACCTGGAAGTGGTTCTTCTTCTGGCA 

CACAAAGGCGTCGTCGCCCACCGAAAAGTTGAAGCCCTTGTCCGCATCCACGCGGTAGGT 

GAGCATGC^CAGCT CCTT GTAGTTAGCATCGTACCTATAAAGTAAAGCTAAMTGATTTT 

ATCTGTGAATTCAGATTTTAAAAAGTCTTCACTCTCTGAAGATGATCATFTGCCCTTAAG 

GACAAAAATGAACTGAAGTTTCACATGAGCTATTTCCATTCCAGAATATCTGGGATTCTA 

CTTTAAGCACTACATAAACTGACTTTATCCTCAGACTAGCTGAATGATTTTGTGCTGTTT 

CAGGATGTTTGCACTGMGAAAMCAGAAAGCTTATCTGAAATTTATAAMCTTTTTGTT 

TTGCTACATAGAAAACAGAAGGTA 

Sequence 367 

CTATAGGGTTTATGGGANCTNCCCGCGGTGGCGGCCGAGGTTCATTATTTTCCNTGTCTN 

CCAGGANTTGTGGCGCAAGAAGAGGTAGCCTAATATAANCCAAAAGTTTACTTAGATCTT 

TCCGTCTCTGTTCCAAATCATGACGGACCCAAGTAAGAAGTGCATTCAATATTGTCTCCT. 

CATTAGGAATGTTCATGTCATCACTAGCCAAGAGCTTTGCAATTTCGCTGGCTGGTAATA 

ATACAAATTCCTGGTTTCTGATTACTTCCATGAAATGCTCCATAGTATAATTGTGAGCCA 

CTTTATGCAAATCTGTACCTGCCCGGGCGGCCGAGGTACTGTATTAGAACACTGGGTGTG 

TCATACCGTTATCTGTGCAGAATATATTTCCTTATTCAGAATTTCTAAAAATTTMGTTC 

TGTAAGGGCTAATATATTCTCTTCCTATGGNTTTAGATGTTTGATGTCTTCTTAGTAATG 

GCATAATGTCATGATTTACT 

Sequence 368 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGAGTTTCCCTATTT 
TGGTGTTCAGCTTGAAAAAGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGTAAT 
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GTGCATTGAAAATMGGTTGATCATGGGAATATGCAG 

C AAATAAAATTGTAAT^AG AATTTTTAATC TTAG GTTTCTTG ATTAATTTATAAG AG AT 

CAATTATTGTCA GTCT TTTTTGTATGTTTTTTAAAAACATAGTCCAGAGCATGGG 

TTGACACCTCTCTTTTAAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATGG 

GGTTCAMTATATATCCTACACAACTGGGCAATACATTTTTGTTTGA 1 1 I I 1 AGGTCTGT 

GTATACATTAACAGTTCATGTAATTAATACCTGATCATTTGGGATAATGAAAGGTGAAAG 

TTAGTTGTAGATGAAGTAAAGTTATAAAAGGAGATTAAAAATGC 

Sequence 369 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTGAAAAGCACAA 

TMCTTGTATTAATTGCAC TTAAC ACAATGMCCTTTAGTTTCCAACCAGTmCATTCT 

CTGCAGACCCGGGCTTTCTTTTTATAAAAACTGCTTTCAAAGGGCATAGAGACACCACAC 

ATGGT CCACAGC AAATTCAAATAGAGAGGTGCAATAGTTGCAGTGGTAAACACACAAAAA 

AATACATTTTTTTGGACTAAAAATCTGGTCACGGATAAAAGCATGTGCCTTTTCATTC^ 

CTCTGGGATGTTACAACAGCAACACGCTCTAAAACAATTAAGTTACATGCATAATGCCAA 

AAGAATGTGAG C AATC CTATAACC AGCTTTAAGCC ATCTGCTTG ATTTC M I I 11 I I 11 1 

TGTAAGGMCATATATAGGAAAAGAGMTTNCCCTTTTGTTCCATTACATATAGAAACCT 

TTTGAAGCTTTCAAANAGCTTTGGGTTT 

Sequence 370 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCGCCATCTGGG 

ACACCTTGTTTCCCANCTGGGAGGCTTCAGCCCGGGCAGCATTGGCCAGCGGCTCAATGA 

GATGGGAGATCTCTTGGACTGCAGTGAGCATCTGAGTGTGCAAGGCCTNTTGAGAGATTC 

CCTCACGGGGAGCAAGCTGNTGGCTTGACTGCAGCGAGNGAAGCCNTGNTNTAGTTNCCA 

TTNNAAAAAATNTTTAAAATCNTCCAAANGGATTTTTTAAAAM 

NNN NTTNAAA AAAAAAAAGGGGNTGTTTTTNATTTTTTTNAAAAAAAN 

CCCTTTTTTTTNTAAAAAAAAAN NCCCC NTTCTTN N N AAAAAAAAN AATTTTATTT N N NN 

TTTNTATAAAAAAAAAAATN mTTTTTTI I NNNGGGNTTTNNNCCCCCCCCCCCCCCNC 

TNCCCCCNNNCANAATAAAN rTTTTTTTTTTTTTTTTTl I ATNATAAAAAAAAAAAAAAN 

AAAAAAT 

Sequence 371 

GATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACTCTGGCGTGCAACCTGGAAT 

TTTTCGGAGGTATTTGTAGGCAAAATACCAAGGGATTTATATGAGGATGAGTTGGTGCCC 

CTTTTTGAGAAGGCCGGACCCATTTGGGATCTACGTCTTATGATGGATCCACTGTCCGGT 

CAGMTANAAGGNNATGCATTANTCNCCTTTTNNGGAAAAGAAACTGCNNAAGAAAACCN 

NGAAAAAANGGGNGGANAAMNTNNTAAAANTTTCCCCCTTGGAAAAAANCCTTGGGGGG 

GGGGNCNTTTTTTTNGGGGAAAAAAAAAAAAAANNTNTTTTNNGGGGGNCCCCNCCCCCC 

CNAMNTTTNANNTCNCNTNAAAAACCCTTTTTTNNGNNGNGNAAATTTNNTATNTTTCC 

CCCCNNNGGGGGGNGGGGGGGGGGGGNNTNN I I I I I I I I N I I I I I I I I INCCCCTCCNN 

NNAAAAAAAAAAAAAAAAAAAA 

Sequence 372 

CCGCGGTGGCGGCCGAGGTACCTGAGGAACATAGATTCTCTGCATCTTTCTCAAGGGGAA 
CCCTCTCCAGCTTCCCTGGTGTGACCCTTCACATGCCAGATTGGGTAGGATCACTTTGAA 
CTGCCTGAAGTTCAGGAAGGTCATCAAGCTC 
Sequence 373 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACATTTCCATGGGCCCT 

GTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTT 

TATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGT 

GAMGCTGTAAMCACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAGTCTAAA 

AAATAGTACCAACGGGAATCATAATGAATACATGCNATGAATTTAAAATGNNAAAATGAA 

TCTAAAAAGTAAAAAGGGCTCTGNGGTGTAATTTTTCTTAACTACAANAGTCTAAATACA 

CTGCTTTTCTTTAAGAGTTCATTTTAATTAGNAACCGTCAAACAAAATTAT 

Sequence 374 

ACTACTCAGGGCGAATTNGAGCTCACCGCGGTGGCGGCCGCCCGGGCCGCGTGAAGAGGA 

AGAATGCCNNGAATTCCAGGGTGGGGCTGGGGCTGGAGACGACGAGGAGGAGGATTAAGT 

CCACCTGTCCCTCCTGGGCTGCTGGATTGTCTCGTTTTCCTGCCAAATAAACAGGATCAG 

CGCTTTACACCATGTTGTTACATGTAAACAAACTTCAATTTGAAGTGCAGCTATTATGTG 

GTATCCATGTGTATCGACCATGTGCCATATATCAATTATGGTCACTAGAAAGTCTCTTTA 
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TGATACTTTTTATTGTACCTCGGCCGCTCTAGAACTAGT 
Sequence 375 

CCGCGGTGGCGGCCGAGGACAAATGTGGTGTGTCTTCCAACTTTCATTGAAAATGCCATA 
TCTATACC ATATTT TATTCGAGTCACTGATGATGTAATGATATATTTTTTCATTATTATA 
GTAGAATA I I I I I ATGGCAAGATATTTGTGGTCTTGATCATACCTATTAAAATAATGCCA 
AACACCAAATATGAATTTTATGATGTACCTGCCCG 
Sequence 376 

ATACACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACGA 

GTTCAGCCTGACCCGTGAGACAAAGAAGCACGTGCTCTTTGGCACTGTGGGGGTGCCTGA 

GCACACATACCGCTCCCGAACCAACTTCACCAGCAAATACAACATGAAGGTCCTCTACTT 

ATCCGCCTTCACTAGCAAGGACGAGGGCACCTACACGTGTGCACTCCACCACTCTGGCCA 

TTCCCCACCCATCTCCTCCCAGAACGTCACAGTGCTCAGAGACAAACTGGTCAAGTGTGA 

GGGCATCAGCCTGCTGGCTCAGAACACCTCGTGGCTGCTGCTGCTCCTGCTC"» CCCTCTC 

CCTCCTCCAGGCCACGGATTTCATGTCCCTGTGACTGGTGGGGCCCATGGGAGGGAGACA 

GGAAGCCTTCAAGTTTCCMGTGCAAGMGATCCTACTTNTTTTGAGTCAAGCTGACCCC 

CTCCCCCCAATCCCTCAAACCTTGAGGGAGAAGTGGG 

Sequence 377 

CNGGTTCGCTGTGTGCCTAATACATGCATGTTGAACGGGATGTAGCAATACATTCAGTAG 

CGAATGGGTGAGTAACACGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACA 

ATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAG 

CTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAG 

ACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCCTCGGCCGCCCGGGCAGGTACAAGAG 

TGATGGCAATGTGACTGGAACAGAAATAGTTTCTACCAGGCACACAAAAGCTTCTGTAAG 

CCCCGTANTTCCGTCCTGCAAAGGGCCTTNAGTGGGAACCAGGTCTGCAGACCCCAGTGG 

GCANAAAGACCGGGTGGAAGCAGG 

Sequence 378 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGGAGAAA 

AACTATGTCTAAGGAGGGMGCACATAATAGAATTCTATTATTACAATAGAATTCTATAA 

ATACAAAATTGGATAAACTTTATCAGAAGCAGGTGTTTGTCACCATTTTATATGTGAAAC 

TCAGGAAGTTCTTGATTTTTTAAGAGCTGTATTCCTTAATCTGGTTACAGGCTATAAAGG 

AGATAATCATTTACATGATCATATTCTCAAACAGATGGTCACCAAATGGAATCAAAGGAC 

TGATTTGATGTAGCCGGTAGTATGATAATTTTTGTAGGTTAAATGGAAAAATACNGGATA 

GGATCCNAGAAAAATAANGTAATrrTNCCAGGTAGGCCNGGTTTAATAAAATTACCAGGA 

CCTAGGAATAAGCCAATTTTA 

Sequence 379 

CCGCGGTGGCGGCCGAGGTCCCCC I I I I I I t I M t M I I 1 I II M M t I CTTGACAGCAG 

CATCTGTTTATTGACAATTCCAGGTCATTCCTAACACGCCGNANCAGGGCTNTGTACCTG 

CCCGGGCGGCCGCCCGGGCAGGTACTTCATGAAGCANACCATTGGGAATTCCTGNGGCAC 

AATCGGACTTATTCACGCAGNGGCCAATAATCAANACAAACTGGGATTTGAGGATGGATC 

AGT TCTG AAACAGTTTNTTTNTGAAACAAAAAAAAATGTCCCCTGAAAACAGAACCAAAA 

TGCTTTTGAAAAAAAANNGGGGCCTTACCAGGCAAGCCCAATNAATGCCCCGGGGCACAA 

AGNMGGCCCAATTGTTNGGGGTAAAAANAAAAAAGGGGGGAAMTTCCCCATTTTTTNT 

TTNTTNTT TNAAANANC CCGGGGGGGGGGNCCCCCNTTTTANAAAAANTTGGGGGGGGAA 

AAAATTNN1 ( 1 I II II \ GNAANAAAAAANCCNCGGGCCCCN ANAAAAAAAAAAAACCCCC 

CCTTTTGANGGANCCNCCCNNNNTAANANAAAAAAAAAA 

Sequence 380 

ATTGGAGCTCACCCGCGGGGNCGGCCGCCCGGGCAGGACCTCTTTGCCTTAAATTGCTTT 

TTMGTTCTAAGATTGTAGAATGATCCTTTCAAATTGTAATCTTTTCTAACAGAGATATT 

TTAATATACTTGCTTTCTTAAAAAACAAAAAAACTACTGTCAGTATTAATACTGAGCCAG 

ACTGGGCATCTACAGATTTCAGATCTATCATTTTATTGATTCTTAAGCTTGTATTAAAAA 

CTAGGCAATATCATCATGGATACATAGGAGAAGACACATTTACAATCATTCATTGGGCCT 

TTTATCTGTCTATCCATCCATCATCATTTGAAGGCCTAATATATGCCCAAGGACCTCGGC 

CGCTCTAGAACTAGGNGGGATCCCCCGGGCTGCAN 

Sequence 381 

TAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGNNCTAAGCCCCAGGACCNATTGG 
TAG ACG AC NTAN N AN CN N AGGCGCATN N C ACTG AAAC ANGTCAGNGTATATG NTGGC ACG 
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TATTAAANTTAAGATGAANGNNGAAGCAAAAAGATTTACAAGAATTAGCNGTAACAANAT 
TG ATGCTN AAG AG AC ATN ATTGTACCTGC NCN 
Sequence 382 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCGGCAT 
TGTCTTAGGGGTTAATAAGGTTAACCTATGGTTACAAAACAAAGGGTAACTTGAAGTATT 
GAGTAACTGCTTCTGTAATATGGGGGAGAATAAAI I I I r AAATTTGGTTTATTTAG AGG A 
AAMTTTTGACTTAAAT7TMCACTGATTTGGCACATM 

AATGGTTTAGAAGCAAACCAGCAAAGGAAAAAAGCAMCCCTAGCAAACCTTTCACAAAT 
GTTAAAGAGACACTGCTCCATTTTAAGCAGTGCGGGTCATTTTGAGTTTAATGAATCCCC 
ATCAMTGGGTGGTATAGTAGATAGCTATGGTGTGGTTTGTAAGGGTTTGTAGCTTGCAA 
GGACGTGCACTAATAATATTTTATATGATCTTTCCTGGTTGGCAGGACCCATGGATGAA 
Sequence 383 

CCGCGGTGGCGGCCGAGGTACACGGATATCACCTGAATTATTAATGAATGCCCAGGAAGT 

AATTTTCTTCTCATTCTTCTAAAACTACTGCCTTTCAAAGTGCACACACACGCGTCCACA 

TACACTG CATTCGTTGCTCCAGTATAAATTAC ATGC ATG AG C ACCTTTCTGGCTTTTAAG 

CCAATATAATGGGCTGCAAAATGAAGACACCAGAGTGTATGCATACAAATCTCACTGTAT 

TAMGATGCAGGTTtTCTAATGTACCTGCCCGGGGCGGCCGCCCGGGCAGGTACCCCACC 

TCCCTAGCCCTCTGTGCTTCAAGTCTTCCTTTGACCCCCTTCGCTCTCCCACAGCAGGGC 

TCTCAGCTGCCAGCCAACCAAGCAGGATGGTGCAACAACCAGACCTAAAGGAAGGGACCT 

GGGGAAGAAGAGAAA 

Sequence 384 

CCGCGGTGGCGGCCGAGGTACACATGCATGCATACACACCCATACAAACATCTGTGTGAG 

GGCAGTTCTGGAGATGAGCAGAGAGAGACCGGAATAAACTCAATCTT7TCTTTCCCAAGC 

TCCTAGCCAACACTATCCTTGGGAGAAAGAAAATTTGCAGAAACTGCTAAGACCAAGTGT 

GGAGATGTCAAGCTAGTTCACACTCTGAGGCTCAGAATATGTAGGACATGCACAATTGTG 

CAGTCCTTTGGGATTGGGAAGTGAAACAGTCTGTGATCCCCTACCTTCTAGGGAACTAGG 

ACCTAGGAAGAGGTAAAGATTATCAGGTATGCAAAGCGCCCCAATTCTTCTGCTGCCATG 

GGGGGGATTTTACCCCAACTCCAGGGGTTCGAGGCCAATCTGAGAATGGCTTAA 

Sequence 385 

AGGTACC AGACTCTTATTTCCTGCAG CTTTTGAGAATTCTAAAGATAC CTGAGTCATTTT 

AAATAAATTTCAGCTTTCTTTACCAAACTACCCTTMCCGGATTCTCCTTGAAATGACAT 

CACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACTTTTTAGTGCGCCCACG 

CACTTCTGCTTTTATTAGGTGAGGCAGAATCAAGATTCCCTTTTGTTGGATCTTGAACCT 

GTTCATGCCACTTTGATATTCTAAATTCATACATAAACCAATGAAATATATGTGTTAGAA 

AAATTGCTATTTCTAGGTGGACGCAGTGGCTCATGACTATAATCCTACCA 

Sequence 386 

CCGGGCAGGTACACTGCGTTCCAGGCTGGTGTATGGGGTTGATCCTTATCGGTCACAGTT 

AGATTAGCTACTATGATGTCTACCCTGTTCTCAGGAACTTCACCATAAAACGTCATGGCA 

GTAAACTCTGGAGGATTGTCATTGACATCTGTCACTGTGATGACGGCCGCCCGGGCAGGT 

ACATGGTTGATTTTGTGCTGTTGTTTGGACTGTAACATCCATGTTGTCAATACGTATACC 

TTGTMGTGGATAACTTTTCTTTTTCCCAGGCCAGAGAATTCAMTTGTTAAAACATTGG 

CATTTGAAGAGGAGAACAAAATGTAGCATGATGTATTTAAAGTAAGGCCTTTAGTAA 

Sequence 387 

CCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAAAAAGGACTTGTCAGAA 

TCAACTGTGTCATCAAAATTTAAGTAATGTGCATTGAAAATAAGGTTGATCATGGGAATA 

TGCAGAATTTCCAATGTA I I i I I AAAT AC AAATAAAATTGT AATTT AG AA 1 1 \ \ IAATCT 

TAGGTTTCTTGATTAATTTATAAGAGATCAATTATTGTCAGTCI INI 1GTATGI Mill 

AAAAACATAGTCCAGAGCATGGGCAGAATTGACACCTCTCTTTTAAGTGAAATTTGGATT 

GCTCACAAAGCACTAGGAAATGTCATGGGGTTCAAATATATATCCTACA 

Sequence 388 

AGGTACCCAGTATTTCTAATTAAATTTCACATGCTAAATTAATGAAAGTAACAAGATTGT 

AATTTTTTAAAGTCAGTTGATTAAATGCAATAAATATTGGGTTGCTCAAATATGCCACAA 

ATAACTCGAAA \ I I \ \ C ATTTACTTTC AACAG C ATAAG ATTCTTTAATATTTAGG ATTG A 

CTGNTTCTTTCCAGTTAAGCACTGAAGGATTATGTCTTGTAGCTTCCCCAAGAGAAGGGG 

AAGGAAAAAAAAGCACTATGTTAAGGATAATACAGAACTCTTTTGAACTATTTTGGGTAG 

TACCCTGCCCGGGCGGCCGNCCGGNCAGGTACCAGTGGAGG 
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Sequence 389 

CCGGGCAGGTACTCTTGGATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAG 

AGGAGTGTGAATGTGTGTTTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCA 

GTGTCTGGCACACTGAGACACCTCCAAGAAGGAGATTGATGCATCANGTTCAGTTTAACC 

TGGAATA TCTGACTA CCCCTGAATCCACCCAGAAAGGGGGCCCAACACCCTJGTCCATTT 

ATGGGTAi mum 1 CGAAGTTATTMGCATATTCCTTTTCCACGAACCTCTTCTGTACC 

T 

Sequence 390 

CCGCGGTGGCGGCCGAGGTGTTTGAGTTTGGTTTGGAGCAAAACTGAGGTAGTCCTAACA 

TTTCTGGGACTGAATCCAGGCAAGAGAAAGAAGAAAAAGAAGAAGAAAAAGAGGAGGAAA 

MGGTAGGGAGAAATAAAGGGAGGAGAGAAGCACAGTGAAAAAAAAAAAMGTCCCTTTT 

CGACATCACATTCCTTGTGNTTTCCCTCAGCCTGGAAAACATATTTAATCCCCAGTGCTT 

TTTACCGCCCCGGAAACAAAGAGACTAAGCCAGACTTATGGGGGAAAGGGNAGATAAGAA 

GGATCCTGGAACTTTNAAG 

Sequence 391 

TGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTT 
CCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGTGAAAGCTGTAAAACACCT 
TTTATGGAAGAAAAGAAATAAMTGTAGTTGTCAAGTCTAAAAAATAGTAGCAACGGGAA 
TC ATAATG AATAC ATG C AATG AATTTAAAATGTAAAAATG AATTTAAAAAG TAAAAAGG G 
CTCTGTGGTGTAATTTTTCTTAACTACAAGAGTCTAAATACACTGCTTTTCTT 
Sequence 392 

CCGGGCAGGTACCGCATCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTG 

GTGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCA 

GAGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCAT 

GCTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTT 

ATGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTC 

AGCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAG 

Sequence 393 

AGGACAAATTCAGTCCCAATACTCAATACGTATTATAGATGACTATGAGTGCAAACCTTA 
GGATGTGATTTTGTGAATAATTGTTCTTTC 

AA GATC TAGTTTTAGTGTGAGCTCAGTAATGTTAATTGGTTAAGTTCATTGTGAATCTTG 
AGTTTTAGATAAGNAGTTATTTTmCAATATCACTTCTGTTm 

AGAAACAACGTATTCAAGAGCCATGGCTGACAGTGCCAGATATACTTAGGGATAAACATC 

AAAATGCAATTATAGTTGCTATAACGTTAGATACTCGGAATCAAAATTT 

Sequence 394 

ANACCCACTC ACTN N N AGGGCGAATTGGN AGCTCCACCGCGGTGGCCGGCCGAGGTACAG 

TGATAGGTATCTTTCTTTGGAG 1 1 I 1 1 1 1 I I I GTGC ATATGTGTATAGTTTTATGGGTTC 

TGAGTTGGTGACCAGTAAGTTGCATGTAGTGCTGGCACTTACTTAATAACTATTCATGAT 

ATTGTTAATAACTTGTTATAGGATTGTATTCCCAATTACAGTCTCTAAGATTGTAATTGA 

TATTATCTGAGAGGTAGTGTGACAACTTTCTTTTGTTGTTACATTAAGCCGAAAACATAA 

TACTAATAGACAACTAACAGTTTGCTTATCAGGCACATCAACTAAGGCACCTCCCCCCAT 

GCTAAGTTTCTCCTGGATATATGGAAG 

Sequence 395 

AGGTA CATAA TAAACAAGTTTCAACCAGCAAGAAATTACTAATATTGACTGTGGAGTTTT 

GGCTGTTTTAATAGTTCTAACTCATTATTCCGTAATTCAACACAGCACTACCAACACAGC 

TGGCAATGACAAGACTGGGAGTATCAAACTAGGATTATTAGGCACAATCCAGGTGGCCTC 

TGCAGCTGTGTCTCTCTTTCCTCTTCTGTTCCTATAAGGGCAGGGCCTCCTTCAGGAACA 

GCCACCAGTGAGCTTCCTCCTCCCTTCTGGTCAGTTGGATTTGTCACTGTTCAGCATCTT 

TTCGATGATTTACCTGCCCG 

Sequence 396 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCCATGGGCCCTGT 

TCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTA 

TCACCTGAATAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGTGA 

AAGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAGTCTAAAAA 

ATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAAAATGAATT 

TAAAAAGTAAAAAGGGCTCTGTGGTGTAATTTTTCTTAACTACAAGAGTCTAAATACACT 
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GCTTTTCTTTAAGAGTTCATTTTAA 
Sequence 397 

TTGTCTGAGCTACTGGAATGAAGTTCACAGGTCTTGAAGACCAATATTATTTGTCAATAT 

GTGGGGATAACCTGTAGCTGCATTCATGAGGTAGCAAATAGCAGTTTTGGCCTGTGGGGT 

GAAC AGCTAAACAGTGTTCTGGCATCAGATAACACGGCACAAAGGGATTGTGTCTAGCTC 

TTTTGGAGAAAATTTATGTCCAGTCTCAGGATCTGTGGACAATAACGGAAMCTTTGAAT 

ATTCCTG ACAAGTTGCC ATATTAGGAGTCAG7TCTTTACTTCTCTGAGATTCCCAGGTCA 

TCCATGATTTTTTTTTTTAATTTGAGACAAAGCCTCACT 

Sequence 398 

NACACCCACCGCGGNGGCGGCCGCC.CGGGCAGGTATGGNTCCGNGGATGCAAAACCCCTG 

GGCCGCAAGAAGGGAGCCAAGCTGACTCCTGANGAAGAAGAGANNNGAAACAAAANCCGA 

NCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCNNGNACCCCGGCCGCCCGGGCAG 

GNACCNGACAGNAGTTTAAGANGAGAGAGGNNAGGGNCCACNCGGNNNAAGCAAGAGANA 

GGNCGGGGGGCNNNNGGNNNAAACNANGGCCAGGAGANNGGGCNAAAGAGAAGACGACGA 

GAGNAAGGAAANAAAG 

Sequence 399 

CTCACCGNGGTGGCGGCCGCCCGGGCAGGTACACGGACCGCACGGAGAAGCTGAGGCCTG 

AGATGGAGGGGCCCGGCAGCTTCACCATCTTCGCCCCTAGCAACGAGGCCTGGGCCTCCT 

TGCCAGCTGTGAGATGACCTCCGTCTGCCCGGGGGACTCTTATGGGGAACTGCCTTACTT 

CCCCGAGGGGTGGGCATGATGAATGGGAGCCTGCAGTCATTTCCTACTGTTTCAGGAAGC 

TTTCTCCTTAACCCCTTAGAAAAGGCTGTGGAACTTGAGCTAAAATATGTCTTACCAGGT 

TGCGTCTAATGCCCCCCGTTCCCTACTGGGCAGAAAGACTTGGGTGCTTCCTGAGGAGGG 

ATCCTTGGCA 

Sequence 400 

AGGTACTGCTTAATTCTTTTTTCTCCCTCCCTCTAATCC I Mill I GACTGTCACATTTG 

TCCTAATAGCAAGTTAGGACATGTCTGTTGGCTCTCGAGATTCATGGGACAGCAATGCAG 

CAAATCTAGCCATAGTATTTGCTCTCTCTAGCCCTGCCCTTTTTCCTGTCCAGTGAATAT 

CAAAACAGGTAGAAAACATGGCCTGAAGGATTGTCTCTGCCACCACCTCCATATGCATTT 

TACCAGTAGTCCTGTCATACAGGTTGAATTAGTTTTATGTAGAACAAGTCATGAACACTT 

TAGTGTGGAAAAATAGTATTATATAAAGCTTAATATTAA 

Sequence 401 

AGGTACATTATCAGANACAGTGGTTGACCTC I Mil I CCTTTACTCCCTTTTCATCTGAG 

AGAGCCTTTTAGAGATCCGGAATCATTTGCTGTCTGCAATTACTATAGGCTTTGGCTCAC 

AATTCTGGGGAAAATGCCAATTGAAGGMCCCTGCCTATACATTTTATTTTC I I I I ICTT 

CGAGACAGACAACCTCAAMTAAGGTCCAAATATTNGGTTCCTTNAAATGGTGTCAAAAA 

GAATAGTATTATATGAGGAGGATAGTTATCACAGAATAAGAACTAAAATCCCATTTTTTT 

TTTTTTAGGAAAAAAAGACCTTCNATGATGCAGGTGTNTGTGTATAAGGAACTA 

Sequence 402 

AGGTCTCAGCGTGGCTACAAGTAACTGTGGTGTGGAAGCAGAGTAGAGAGAAAACTTGTT 

CCTCATTAGAGAGAGAGCCACACTTCTCACTGCTCACAATGAGAGGCCAAAGATTACCCT 

TGGACATCCAGATTTTCTATTGTGCCAGACCTGACGAAGAGCCTTTTGTGAAGATCATCA 

CTGTTGAAGAGGCAAAGCGCAGGAAGAGCACATGCAGCTACTATGAAGACGAGGACGAAG 

AGGTGCTGCCTGTCCTACGGCCCCACAGCGCGCTCCTGGGGAATATGCACATCGAGCAGC 

TGGCCCGACGCCTTCCTGCAAGGGTGCAAGGG 

Sequence 403 

AGGTACATTTCCATGGGCCCTGTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAG 

GGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACT ^ 

ATAAGCTGATGCAGCTGTGGTGAAAGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAA 

AATGTAGTTGTCAAGTCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATG 

AATTTAAAATGTAAAAATGAATTTAAAAAGTAAAAAGGGCTCT 

AACTACAAGAGTCTAAATACACTGCTTTTCTTTAAGAGTTCATTTT 

Sequence 404 

AGGTACTCTTGGATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAGAGGAGT 
GTGAATGTGTGTTTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCAGTGTCT 
GGCACACTGAGACACCTCCAAGAAGGAGATTGATGCATCAGGTTCAGTTTAACCTGGAAT 
ATCTGACTACCCCTGAATCCACCCAGAAAGGGGGCCCAACACCCTTGTCCATTTATGGGT 
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AT I I II I r ICGAAGTTATTAAGCATATTCCTTTTCCACGAACCTCTTCTGTACCTGCCCG 
Sequence 405 

AGGTACCTCCTCATGACATGGGGTTTAMTAAACTACMGTTCTATTACCTTTTTTAm 

TCCAGGGAAAMGAAACTTGGGAAGGCTATTTTACACAATTMTTACACATTGACACCAC 

AGTT CTGT TTGGATAAATTGGATACTGTATCCTGGAGTTTAGACAGGGAAAAGTGATCCC 

AATGTTTTACTTTCAGGTGTAAGATTTAAAACAAGAATTTGGACCAAMTGTTCAT^ 

GCAGTTACTTTCTGCTTTCCATTTTTMGCATGTTATCTTCATGTAGTCCACCTGAACAT 

CAACAGTTCAGCCTCTACAGAGTTTCTGTAATA 

Sequence 406 

AGGTACGTTCAGATGTAGCCATGACTGGAGAAATTACACTGAGAGGTCTTGTTCTTCCAG 

TGGGTGGAATTAAAGACAAAGTGCTGGCGGCACACAGAGCGGGACTGAAGCAAGTCATTA 

TTCCTCGGAGAAATGAAAAAGACCTTGAGGGAATCCCAGGCAACGTACCTGCCCGGGCGG 

C CGAGGTACTACTAATTCTA C AATG C CTTTCTC7TTAGTC AGTATTAAAAATCTTTTTTA 

AAGTATTGGTATGAAAACAMTTTTTGTTGCCCTGATAATGGGAATTTTAAAACTACCCA 

CAGTTTAAGAGAAAACATAACTTGGTAAAAAAGGTAGCCAATAAAACCACA 

Sequence 407 

AGGTACTCAGGGGAAACTGGAAGCCGCTGGCTCTTTCAATTCTGATGATGATGCAGAGAG 

CTGCCCAATCTGTCTCAACGCATTCAGAGACCAGGCCGTGGGGACGCCGGAGAACTGTGC 

CCATTACTTCTGCCTGGACTGCATTGTCGAATGGTCCAAGAATGCCAATTCCTGTCCAGT 

TGATCGAACTCTATTTAAGTGCATTTGTATTCGAGCTCAAtTTGGTGGTAAAATCTTAAG 

AAAGATCCCAGTGGAGAACACCAAAGCGAGCGAGGAGGAGGAGGACCCGACCTTCTGTGA 

GGTGTGCGGCAGGAGCGACCGTGAGGACAGGCTTTTGCTCTGC 

Sequence 408 

AGGTACCAGTTCTAAATGGGCCAAGATCCTGGGGACGCCAAAGTTATTGGCTGGATGGCT 

GATGAACTGAGGCCAACTAGAGACTGGTAGCACAACTGGGGACCACTTAGGGATGGAGAG 

TGAACACATCCTGGGCCTGGATAGGACGGGGAGATGGTGAGTGATGGTCAGACGTGGCCC 

TTGGCTCTCACTGTTGTTATCCCTCTCACGTTCACATAGGGGCACACGCAGGGCCAGCTT 

CATGGGTGTGCAACCTGTGCAGCCACTCAGAAGGGCCCCATGCTGGGTTTAATGCTTGGC 

TGTCACCATCATGAAATTAATAATGGTTGAACAGGGGCCCTGCGTT 

Sequence 409 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAATAAAAATTAG 

CAAACAATTA7TCTAGGGATATTTTCAGAT7TTACTTCAT7TCTTGAAATGCGTGTGCCA 

TATGCAATTGCATTTCTTGTGCCAAGAAACTAATAGAACTTATTTCACTTTACCTTT^ 

TAAAATGTGAATTTAAGTTATTATAGTTCAATlTTATGGCCTTACAGATGGCTmA^ 

TGTTTGCAGCTGACACTGCAGTTCCTTTCATGCAAAATACCATAAACTGTTTGATGAAAA 

TCATGCCCCTAATGGAAACTCTCTAGTTTTTCCATATAACTATCCTACTGTACCTGCCCG 

GGCGGCCGCTTCGACCA ACAT GTGGTGAGCATTCCACGGGCGCATGAAGTCTGGGTGCTG 

TGCTCGAGTCTCTGAATATTTTGATAGGAAAGCGACAAGAAAATTCAA 

Sequence 410 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAATAAAAATTAGCAAA 

CAATTATTCTAGGGATATTTTCAGATrrTACTTCATTTCTTGAAATGCGTGTGCCATAT^ 

CAATTGCATTTCTTGTGCCAAGAAACTAATAGAACTTATTTCACTTTAC C I HIM I AAA 

ATGTGAATTTAGTTATTATAGTTCAATTTTATGGCCTTACAGATGGCTTTTATTT^ 

GCAGCTGACACTGCAGTTCCTTTCATGCAAAATACCATAAACTGTTTGATGAAAATCATG 

CCCCTAATGGAAACTCTCTAGTTTTTCCATATAACTATCCTACTGTACCTGCCCGGGCGG 

CCGCTTCGACCAA CATGT GGTGAGCATTCCACGGGCGCATGAAAGTCTGGGTGCTGTGCT 

CGAGTCTCTGAATATTTTGATAGGAAGCGACAAGAAAATTCAAACTGCTCTTTGCTGACT 

ACTGGNAAGTGAAAAAGATGCTTCAAGGTTTANCCATTCAAAGGAAACCATTAGGCCTTT 

TCAAAAAAC 

Sequence 411 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAA 

AAAGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGTAATGTGCATTGAAAATAAG 

GTTGATCATGGGAATATGCAGAATTTCCAATGTATTTTTAAATACAAATAAAATTGTAAT 

TTAGAATTTTTAATCTTAGGTTTCTTGATTAATTTATAAGAGATCAATTATT 

TTTTTGTATGTTTTTTAAAAACATAGTCCAGAGCATGGGCAGAATTGACACCTCTCTTTT 
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AAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATGGGGTTCAAATATATATC 
CTAC ACAACTGGGCAATACATnTTGTTTG ATTTTTAGGTC TG TGTATAC ATTAAC AGTT 
CATGTAATTAATACCTGATCATTTGGGATAAT 
Sequence 412 

CCGCGGTGGCGGCCGCCCGGGCAGGTTTGGGTCTGAAAGTCGATGAAGGACGCAGATTAC 

CTGCGATAAGCTTCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGAATGGGGTA 

ACCTMCTGAGCAMCCTCAGTTGCATTTTGATGAATCCATANTCAAATTAGCGAGACAC 

GTTGCGAATTGAMCATCTTANTAGCMCANAAAAAAAAAAAAAAAAAAAAAAAAAM 

TTGNAACCTN 

Sequence 413 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTACACTCCAAGGTAACAGAATAATAATACAA 

TCTTCTATTTATTGCTCCCTATCAATGTTCTAAAAACCTTACATTTATTTAATCCT^ 

AAAATCCTGTGAGGTAGGTAGTATTATCATCCCTAATTAAAAGATGAGGAAATTGAGACA 

CAGAGAGATTAAMTTGCTCAAGGTTACAAAAACAGTAGGTGTTAGAGCMGMTTTAAA 

CCCAAGCAAGTCTGACTCTAGAGCCCAGGATTCATCACAATATTCATTTCCTATGAGCAA 

ATCTCAAGGTTTTTGGATATCTATAGTAMCCATTTATACTTCTAGTCAAGTAACACATA 

TGAAA ATTC ATGTCCACACCATGGGTGAAAGAGGTTGTTCAAAGAATAAAATGAATGCCT 

TGAAATTTTGGCAGATGGATACTACCATGGATAATA 

Sequence 414 

GCGGNGGCGGCCGAGGTACTATGAAAAGTTGATTGTAGCACAACTGTTCAATNAGTAAAA 

GGTCTTCGGCAMTTCCCTTTAGAGTATACTTTCTATAAACTACATGTTCCACAAAAAGG 

TCAATTATATATACAATTGATTTGTTTTACTTAATCT^^ 

CAATGAGAATATTMGCCTGAGGCTAGTTCTCAGTGTATAGGTTTAACAAATTAAGGCTC 
ATTTTCCCAAATCAAAATAG! I I I I AGTTTTCCTTTTAAATTATGAATTACATTCATAGT 
ACMGAAGAAATGCTTAAGGAAG AATTTCAAAAGAAATCTG AGC N GTTAAATAAAGAG AT 
TAATCAACTGAAA 
Sequence 415 

CCGCGGTGGCGGCCGAGGTACTTGGCATATATTAGGCCTTCAAATGATGATGGATGGATA 

GACAGATAAAAGGCCCAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATGA 

TATTGCCTAGTTTTTAATACAAGCTTAAGAATCAATAAAATGATAGATCTGAAATCTGTA 

GATGCCAGTCTGGCTCAGTATTAATACTGACAGTAG I I 1 I I I IGM I I I I AAGAAAGCAA 

GTATATTAAAATATCTCTGTTAGAAAAGATTACAATTTGAAAGGATCATTCTACAATCTT 

AGAACTAAAAAGCAATTTAAGGCAAAGAGGTACCTGCCCG 

Sequence 416 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGAGCTTAAATAA 
TATCAAAATGCAAATATAGATTGGGTGCACTGTTAAGCTGAATTGCAAATTATGGCAACA 
CACACTGGACTGGGGTATACCGTTGCTTTGATATCACCATTTGTTTGTTTATGTCATGCA 
GACCACAATAGTCAATCTTTTGTTTTTC I I I if IGTACCTGCCCG 
Sequence 417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGCCATGGCCA 
GTAAAAATGAAAGTGGATAGATACGGAAATACACACAAAGTGAATTTTGCTAAAACAAAT 
CAATGGCACTTAAGACATTTCTATTCTAAAAGGCTTATTCTTMGGCTTTTCTATTCTAA 
AMTTTGCAATGTTTTMTTGAATATTA^ 

ATGAAATATCAGGAAATAACTCACTGCTCAGTGGAAACTCTCATGGCTGNTGTTTTTTGT 

TGTGTTGTTTCAGGATTGGAGGTTTTAAAGCAGGAAACAGCAGTTGTTGAAAACGTCCCC 

ATTTTGGGACTTTATCAGATTCCAGCTGAGGGTGGAGGCCGGATTGTACCTNGGCCGCTC 

TAGAACTA 

Sequence 418 

ACTATAGGGCGAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACTTTATTATATTGGGTG 

TCTTGTC I MM I GGGCTT7TAGTTAGCTAATGAATGAATACATTA6ACACTTTTGGGTT 

TTAGTTGGGATTTTACATAGCTTGCATTTTAATTC 

CCCACAGCATTATTTTAATAAA 

Sequence 419 

CCGGGCAGGTTTGTGGTATTCATTATTAAACACTAAAACTTTGGCGGNTCTTGCATAACA 
TTGTCAG A I \\\\ \ I AGTGTATTTCTGTGAAG AC A I I I I I 1 I I CTTGTCATTCCTTTTGT 
AGTANTTGCTGCTTNGGATAAAAAGTTTGATGTGTGA I Mill I'ATTAAACAANATAGTA 
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AACCCTTCAATTATAAGTTAGTNCTTTGGTGGAAGTAAGGATGGTTTGTTAGGACNTTTA 
TTAGGTTCTTTAAATTTCATTTGGCCACCAMCCGTGGACTGGTTTGTAAGTCTTAAACC 
ACCCC AAAATTAAGTGGTCGTTN GGCC AAATTAACTTTTTTTT AAAMTGGGCTTG N AAN 
AACCACCCTTAATAAAATTNGTTNCATTCAGNAAAATTATTCTTTGTTCANCNTGNCTTC 
CTGGTTTGCNCAAAATAC C NTCTAN N N AAATTAGGGAAACTTTTAAN NACCGTTTATG NT 

TCCTTGAAGGTTCCCCCNTGGGAGGAANCCACCATTGGCCTTAANAANGTTCCCNAATTG 
GAAAAAAAAG 

Sequence 420 

CCGCGGTGGCGGCCGAGGTACTTTTGGTTATTTTTCTGTCATCAAACAAAAACAGGTATC 

AGTGCATTATTAMTGMTATTTAAATTAGACATTACCAGTAATTTCATGTCTACTTT^ 

AAAATCAGCAATGAAACAATAATTTGAMTTTCTAAATTCATAGGGTAGAATCACCTGTA 

AAAGCTTGTTTGATTTCTTAAAGTTATTAAACTTGTACCTGCCCGGGCGGCCGCCCGGGC 

AGGTACACAACTGGAAAGACTGCTGTAATAACACAGCCTTGTTATTTTTMGTCCTATTT 

TGATATTAATTTCTGATTAGTTAGTAAATAACACCTGGATTCTATGGAGGACCTCGGTCT 

TCATCCAAGTGGCCTGAGTATTTCACTGGCAGGTTGTGAATTTTTCTTTTCCTCTTTGGG 

GATCCAAATGATGATGTGCAATTTCATGTTTTAACTTGGGAAACTGAAAGTGTTNCCATA 
TAGCTTCAAA 

Sequence 421 

CCGCGGTGGCGGCCGAGG I I 1 1 I I I I II I I I U I I I I H \ G CAAAG ATTTTTTTTATTTC 

TACTGGGGAGGGAGGAGGATAMTAGAACTGTTTTCCAATTTGCTCTCCACTGTATACAC 

ACATACCCACACACATACATAAACATACATNCACCAAAAATACCCCAAACAAAAAACAAA 

AAAACCAGGAATCAAAMCCAAMCACCCTCAAACTGCACCMTACTTCATATTTTGACC 

AAAAAAATATCCTGGGAGGAAGTGCAAAATGCAAAATCAAATGACCCGAAAAATGCTGAA 

CAAACAGCATTATTAATATACAAMTATTTATATTACTGAACTAGTAACAGGTGATGTTC 

TCCATGTCTGTAAAACTTTGGAACCACATAGCTGATTTGTTAAATCTAGTCCATGCCAGC 
TTNCAAAAACCAAACT 

Sequence 422 

TATTGGGCGAATTGGAGCTCNCCNNGGTGGCGGCCGCCCGGGCAGGTACNATATACGAAG 

ACTCT GAGC TGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGTTGTAAAGCTCATTAAGG 

CCAACTTTTACTTTCAATATGTGATTCTGCAGAATTAATTTAAGGAGGCGCTGATCCATG 

CTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCAAAGGTGTNTTCTCCATT 

TG GCCTTAMN CCAAATNCCCAAANCTrNNTTGGGCCCAAAAANNCCCGGAAAGNGGGGT 

TTG I \ \ \ I ITT CCCGCCCCNTTNAANNAI MINI 1 GGGAAAAANNCGGGAATTTGGCCC 

ntni nrm iaaaaggggggccccccci 1 1 inn inaaaaaaaanantttttntntncc 

CCN NNTTTN NGNNCCNTTNTTTGNNNNGGGGGGGNNCNCCCCNCNAAAAAAAANNAAAAN 
NTNNm i \ I NCCCCCCNNNNANGAGGGGGGGNNCCCCCCTNTTTNANAAAAAAAAAAAAN 
NNNNNNNN 1 ITTTTTTTTTTTTTTTNC 
Sequence 423 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGATTTCAGTAA 

GTCTCATAGGTTAAAAAAAAAGTCACCAAATAGTGTGAAATATATTACTTAACTGTCCGT 

AAGCAGTATATTAGTATTATCTTGTTCAGGAAAAGGTTGAATAATATATGCCTTGTATAA 

TATTGAAAATTGAAAAGTACCTCGGCCGCCCGGGCAGGTACCATCTTGGCGGATGACTTC 

AGCATTAGATGGTCAGGTGTATTTTTTTATTAATAGGCGCATTCAGTTTGGAAAAGGCAG 

CTTTTCATTAATGTCCCAGGTAAAGGAATATGGCCTATACAGTACCTCGGCCGCTCTAGA 
ACTAG 

Sequence 424 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGTAACATCCAAGAGCC 
CATTCTACAGTGGGTGGTTTTGGTCTTrTTATAACTTTTTCTCAMGTCACTGATGm 
TTCCTGTTAAATGTATATGCA 
Sequence 425 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGGATCTAAAAACAAAC 

AACAACTCAAAAAAAAAAAAAAAAAGAAAGAAAAAAGAAAAAGAAAACCAATGCAGGTGA 

AGGTGTTACCAGGAAGTTAAGCATGCCAAAAGGTGTTCCGTGCGAATGAAAACCTAAAGC 

CAAGGTACTCTCCACTCTTCCAGTCAGAGTGGCACATCTTGAGGTCACGGCAGGTGCGGG 

CGGGGTTCTTGCGGCTGCCCTCTGGGCTCCGAATGTTCTCG 

Sequence 426 
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TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTAAAAATG TAAT CTA 

GTTGGCAAAGGTGTGCGCTAAMCACGGAACCGAACATGCATTGATTTGGATAACTTTTG 

AGGG I 1 < I I GNCAAATAGCATGTGAAGAGTTACN1 I I 1 1 C TTAAAAG ATTGGTGGTCCCA 

ATGTCAGAGTTCTTGGAAC AGATAACTGAATGATAGA T 1 rTTTTTl I AAAGATAAAACTT 

TACAACCTGCACATTTGTTATGCATACTAAATGGNGTGTTAAAATTAGGGTTTCTTTGCC 

TNTCTACACTACACTAATCTGCCTAAAGGNGGTTGTTTCATATTTATAATGCTM TTAT C 

ATACCTACCTACTTTAAATTTTAGGTAGAAAATTATCTGATTTAAATACAAACATNTTT^ 

TCTCACATTGAGTAATATGCATAATGTAAGTTCAAATGTATTTCAT 

Sequence 427 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCA 
TATATTAGGCCTTCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATT 
GTAAATGTGTCTTCTCCTATGTATCCATGATGATATTGCCTAGTTTTTAATACMGCTTA 
AGAATCAATAAAATGATAGATCTGAAATCTGTAGATGCCAGTCTGGCTCAGACCT 
Sequence 428 

CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCATACAGGGCAGATGT 
CAGAGATTATGATCACAGGGTGCTGCTCAGATTTCCCCAAAGAGTGAAAAACCAAGGGAC 
ATCAGATTTCTTACCCAGCCGACCAANATATTCCTGGGAATGGCACAGTTGTCNTCAACN 
TTCCCACAGNGTGGNTGNTGTTTCNCCCNTTTATCCCTNTTTGCCCCNCACCCCAAAAGA 
ANANTNGGTTNTNTGGCCCCNAAAAANNAGN I 1 1 II II II i U I AAAAAAANAAAACCCT 
GGGAACTNTTNGGCTTCCCNCAANGGCAATTTTNTNNTTNTTNCCCTTCCCCGGGGGGNG 
GNCCCNTTTTTNAAAAACANAGGGGNNNCCCCCCCCCNGNGNCGGGGANNGAATTTTNNG 
ATTTAAAAAN I Nil! 1 1 NNNNCCCCCCCCTNNAACCNCCNTNGGGGGGGGGGGGCCC 
Sequence 429 

TAACACTAC7TAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCG 

TGTTCAACCGCAACGAGGATGCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCC 

TGGCCTCGGCTTTCTTCTTGGTGGTCGACGCGTA TTTCC CCCAGATCAGCAACGCCACTG 

ACCGCAAGTACTTCCTTTGGCAGACGGCTATTATGTTTTACTTTAGTGCCGTATTGTGGA 

TGCCAGAGAACTTGCATTTGCCAGGGAATAATCTCCTGAGATGCTATATTATGAAGTAGA 

ATGGTGTAGTCTAATTCAATAAAGTCATCATTTCTT TGCA CTTGCTTGGACAGTTTGAAT 

TTTTACCATTTTATACCATGTGTCTTTCAGTAGGAATTTTGCCATATTATATAACCACAG 

GCAACCCMGCTAAAATGTAGAACGGAGCCGTCATTTCTGGGAGATACATTTCAAA 

Sequence 430 

ATCACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCACAGCTG 

ATCCAGMGTCAATATTCTCCTCACTATATTCAGACTTCAAGAAAGCTTTGAAAGCTGCC 

AGCCCACATTCATGACTMTCAGGTTTTCCAGTGATTCAGCCCATTTCTTGACTTCCTCT 

TGGCTCACTCTCTGGCAAATAACCACTTTGTCCTTNTGGTT GGGG GAAAGAATTGGGGTC 

ACAGGAATCAAAATTTTGCAGCAGGGAAACCTAGCCCGTGGTTTTCATATTNTTTGGACT 

CCTAAAGAAMNAMCCCGGAAAAACCCTTGGNAGGCCNTTTNGCAAATTNTTNTNTAAA 

ACNGGGTTGGANATTCCCCAGGGGGGANNAAAAAANATTNGCGGGAAGNGGGNGGNNCTT 

TTTTTTTTTNTNTTAAAANAANACCCGGGGGGGGGGGNANAAAANNNAAANAANTNCCNN 

CCCCCCGGGGGGGGGGGNNNTmNMTTTTTTTCCCCCCCCCCCCCCNGGGGGGGGGGG 

GGGGGGCCCCCCCCl l rTTTTTTT I I II 1 1 I M 

Sequence 431 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGACCTCCTTACTTACCCCCCTTCT 

CCTTCGGCTGGAGCTCGGCGAGCGAGAGGCGGCGCTGGCGTTGGAGAGCGTGCTTGAGAA 

TGAAATGTTTCCCATCATAGTGGATTCTTAAGCACGTTCTCCACGTATGCGGCGTGCTAG 

CTGGATGTCTTTNGGCATAATTGTTACACGTTTGGCATGGATAGCACACAGGNTGGGTGT 

NTTAAAAAN GGCN AACCAAATGGGCCNTTTTN G N CTCCN A AAAAAAC CC AAN AG \ III iC 

CTTTGNNANNNCCCAAAI Hill II NAAAGGCCCGNGAAA ATTTTTTNC CNCAAACCCCT 

TGGANGGGANATTTNGGGAAAANAAAAAATTTAAGGGGGC I W WW I NAAACCCTTTAN 

NTTNTNGGGGGGGCCCCAAAATTTCNTTGNCGGGGGGGGGCNTTTTAAAAAAAAAANNGG 

GGCCCCCCCGGNGGGGGGGGGGNANTTNNANNAAAAATTTTTCCCCCCCCCCCCCCCCGG 

GGGGGGGGGC 

Sequence 432 

GGGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCGATGTTGATCTTCANGTTTATCTCCC 
CGCGACTTGTCCACGTAGAGCTCAGGATGCACCTCCGTGGTGAGGTAATACTGCACCTGC 
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CCGGGCGGCCGAGGTACATGCCTAGACCTGGGCTCCGGCCAGCGCCCAACAGCGTGGATG 

TCGATGACTTCATCAATACGAGAATACAGGAGGCAGACAATGACCCCACG 

Sequence 433 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTATTGCCATTTCTCTGCTATATAGT 
AATATTTCTTGACACCMTGGTAATGTGTTTGTTTCTTTCTCAGCACCAGATGTGTCCAT 
ATGTATTTAAACAGTAGGAGCATTGTTTCTAAATTCATCTATGCTATGACATATATAAAC 
CCATTATTATTATGTTTCACTAMGTTCCAGCACTT^ 

TTTTGGCTCATTCATTTAGATMTACTGACCATTTTGCACTACAATTTCAAAAGGAACAT 
GAGAAATTTGGATTTCTTTGAAAGAGTCAAATATGTAATTACAGMTTGAAACACTGNGT 
TAATTCCCAAAGGGGGTGGTAATAATTAACATTAA 
Sequence 434 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTG 

GTGTTCAGCTTGAAAAAGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGT/vATGT 

GCATTGAAAATAAGGTTGA TCATG GGAATATGCAGAATTTCCMTGTATTTTTAAATACA 

AATAAMTTGTAA TTTAGA ATTTTTMTCTTAGGTTTC 

ATTATTGTCAGTCTTTTTTGTATGTTrm 

GACACCTCTCTTTTAAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATGGGG 
TTCAAATATATATCCTACACAACTGGGCAATACATT^ 

ATACATTAACAGTTCATGGAATTMTACCCGGATCATTTGGGATAATGr4AAGTGNAGTTA 
GTTGTAGATGMGTAAAAGNTATAAAAGGAGATTAAAAATGCGGTMCCTTTTA 
Sequence 435 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTGATA 

GTGGAATTATATATTTTTACTCTATGTTTCTCTACATGTTTTTTTCTTTCCG 

AAATATTTGAAACTTGTGGTCTCTGAAGCTCGGTGGCACCTGGMTTTACTGTATTCATT 

GTCGGGCACTGTCCACTGTGGCCTTTCTTAGCATTTTTACCTGCAGAAAAACTTTGTATG 

GTACCTCGGCCGAGGTACACMCTGGAAAGACTGCTGTAATAACACAGCCTTGTTATTTT 

TAAGTCCTATTTTGATATTAATTTCTGATTAGTTAGTAAATAACACCTGGATTCTATGGA 

GGACCTCGGTCTTCATCCAAGTGGCCTGAGTATTTCACTGGCAGGTTGTGAATTTTTCTT 

TTCCTCTTTGGGGATCCAAATGATGATGTGCAATTTCATGTTTTMCTTGGGAAACTGAA 

AGGGTTCCCATATAGCTTCAAAAACAAAAACAAAATGTGTTATCCGACGGGATACT 

Sequence 436 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAA 

AAAGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGTAATGTGCATTGAAAATAAG 

GTTGATCATGGGAATATGCAGMTTTCCAATGTATTTT^ 

TTAGAATTTTTAATCTTAGGTTTCTTGATTAATTTATAAGAGATCAATTATTGN 

TTTTTGTATGTTTTTTAAAAACATAGTCCAG 

AAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATTGGGGTTCAAATATATAT 
CCTACACAACTGGGCAATAA I I II 11 I GTTTGATTTTTAGGGCCTGNGTATACATTTAAC 
AGTTTCATGTAATTAATACCTTGATCAT 
Sequence 437 

CCGCGGTGGCGGCCGAGGTACTTAAGCACTGCAGATGCTCCAGTAATATGCCCATAAGTT 

CCTTTCCMTTTCAATTACTGGGAAAATATACATATGGACAATAGATGGATGCCACAATA 

AAAGGCTGGCAGCCTMCCCTCACATGAATTTTTCCCTACCTCTATTTAGGGTGACAGTG 

GAGGGCCTCTGGTTTGCTTTGAGAAGGACAAATACATTTTACAAGGAGTCACTTCTTGGG 

GTCTTGGCTGTGCACGCCCCAATAAGCCTGGTGTCTATGTTCGTGTTTCAAGGTTTGTTA 

CTTGGATTGAGGGAGTTGATGAGAAATAATTAATTGGACGGGAGACAGAGTGACGCACTG 

ACTCACCTAGAGGGCTGGAACCGTGGGTAGGGGATTTAGCATGCTGGAAATAACTNGCAG 

TAATCAAACGAAGACACTTGTCCCCAGCTCCAGCTACGCCCAACTCGG 

Sequence 438 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAAACATTTTCT 

TTTATTGTTACTTGCTTnTAAACTTTGTTTAGCCACTTAAAATC 

TTGCCTCAAAATCCATTCCAAGTTGTATATTTGTTTTCCAATAAAAAAATTACAATTTM 

AAAAAAAAAAG AAAC AAG AAAAAAAAAAAG AAAAAAAAAAN N AAAAN AN AAAAAN AAAN C 

ANAAAAAAAAAAAAAAAAAANTANANAAAAAAAAAAGGAAAAAATTCTGCCTANC^ 

AAAAAA ANAAG AAGAAGAAATGGTTTTNGCNNNGGNCCCNNGGNGGNCTTTCTTTTNAAA 

NANATNTTTTTNAAANGGNTGNNTAAAAANANANNAACAATGTNNNNNAAAAAATCTTTT 
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TTNTAAACCNNGGGGCNGTTNNTTTTTNNAAGCAACC^ 

AAAAAAATTATNGGGG rTTTTTTTTTI N AAAAACCCNCGGGGGCCCNGGG 11 111 1 1 i M 
AAAAAAGGGGGGGGGGGCA 

Sequence 439 

CCGCGGTGGCGGCCGAGGTACCACGACTCTTATTTCCTGCAGCTTTTGAGAATTCTAAAG 

ATACCTGAGTCATTTTAAATAAATTTCAGCTTTCTTTACCAAACTACCCTTAACCGGATT 

CTCCTTGAAATGACATCACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACT 

TTTTAGTGCGCCCACGCACTTCTGCTTrTATTAGGTGAGGCAGAATCAAGATTCCCTTTT 

GTTGGATCTTGAACCTGTTCATGCCACTTTGATATTCTAAATTCATACATAAACCAATGA 

AATATATGTGTTAGAAAAATTGCTATTTCTAGCTGGACGCAGTGGCTCATGACTATAATC 

CTACCACTTTGGCAGGCCGAGGCAGGAGGATTGGTTGAGGCCCAAGAGTTTGAGACCAGC 

CTGGGCAACATAAGTGAGACCCTGGTTTCTACAA 

Sequence 440 

CCGCGGTGGCGGCCGAGGTACTTGGCATATATTANGGCCTTCAAATGATGATGGATGGAT 

AGACAGATAAAAGGCCCAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATG 

ATATTGCCTAGTTTTTAATACAAGCTTAAGAATCAATAA AATGAT A GATCTG AAATCTGT 

AGATGCCAGTCTGGCTCAGTATTAATACTGACAGTAGTTTTTTTGTTTTTrMGA 

AGTATATTAAAATATCTCTGTTAGAAAAGATTACAATTTGAAAGGATCATTCTACAATCT 

TAGAACTAAAAAGCAATTTAAGGCAAAGAGGTACCTGCCCGGGCGGCCGAGGTCCCCCTT 

TTTTTNTNTTTNATTTTNCCTNTTTNTCTTTTTCTTNTTTACTGCC 

GTTGGANCCCTTTTTTTANTTTTTTNCAACACCCCCCGNCTTTATMN 

NTTTTTTTTT1 IAAAAAA 

Sequence 441 

AGGTACCAGGTTTTA \ \ I < I CACCTTATTCTCTACTTTAAACAAATCATAACTTTCTCTT 

TAAGCCTCTGCTATAAATTCTCCTGGCTCTCCTGGGCTTNCATATTTTGGGGGCTTGGGG 

TGTCAAAAGTGAGATGAAGTTCTTTAGCTCCAGGTTTTGGGGTAAACCANAGGTAGGTAA 

CATTGTTGGGCATTTATTTTGCAANTTAANCAANTACTTTCCTTGNNACTNGGNTGCCGG 

GTGGCTCACGTNTTGTAATNCCCAGCACCTNTTGGGGGAGGCNTTGAAGACCAGGTNGGG 

ATNCAACNNTNNAGGGGTCGGGGTAGGTTTCTNAGGAACCATGGCGCTGNGCCNCAAACN 

ATTNAGGCCAAAANACCCCCTTGTNCNTTTTNACTTAAANAANANTNACCAAAAAAAATT 

TAANTATTGGGGGNTGGN I MM I GGGGCAC 

Sequence 442 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTTTGCCTTAAAT 

TGC I I I l I A GTTCTAAGATTGTAGAATGATCCTTTCAAATTGTMTCTTTTCTAACAGAG 

ATATTTTAATATACTTGCTTTCTTAAAAMCAAAAAAACTACTGTCAGTATTM 

GCCAGACTGGCATCTACAGATTTCAGATCTATCATTTTATTGATTCTTAAGCTTGTATTA 

AAAACTAGGCAATATCATCATGGATACATAGGAGAAGACACATTTACAATCATTCATTGG 

GCCTTTTATCTGTCTATCCATCCATCATCATTTGAAGGCCTAATATATGCCAAGGTACCT 

GCCCGGGCGGCCGCTCTAGAACTAGGTG 

Sequence 443 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGNACTGACAGGAGA 

TTGGCTTTTACACTAAAAAAGACAGAAAATGGACTAAATAGGAACATAAmCAGAGTn 

ATGAGTCAGTTTACAATATGCCATATTAGCGATCTCATGATGATTCAGACCACAGTCTGG 

GTTAAAGATGTCTTGACCTCTGAGGCTAGCATAACATATATGAGAATACAACTTGCCTAT 

TTCCAGAAAATCTGTATATTTTTATAGAATAACTTTATTTACAATTTCTATCATCCAATT 

ACTCACTAGGCATTATTTGT 

Sequence 444 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCAGA 

AGT7TGCTGTCTCTAGGATTCATATAGCACCCACAGAGCTCCAAGTAACCAAATTCCCCC 

AAAGACAGGAGGTGTGGCTGAGGAGGAGTGACCATATTGAGTGTAGCTTTGAACGCCTCC 

GTTACTCTTTAGGAGGCGACCGCCCCAGTCAAACTACCCACCACGCACTGTCTCCTTCCC 

AGATAAGGGGAACGGGTTAGAAAATCAATTTAGCAAGGGTGGTATTTCAAGGTTGACTCC 

ACTAGAACTAAGCGTCCCAGCTTCAAAGTCTTCCACCTATCCTACACATGCTAAACCAAT 

Sequence 445 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTTTTTAAAAACAC 
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TGTAAAAGT AACC ACAAATATGTGAGGACTTACTATTTTAAATGGAATGGAATGAGCTCC 

ATAGATTAGTTTTGAATATAAAGTATATAAAAGTGCATCAGTGGTTTATATAGGCTTTAA 

AAACATGTTATCTTACAGTCCTTTAAAGCAGCCATAGAGTTTGTATCATTTTTCAAGCCA 

ATTTCAGTCAGGGATTTGAATTGTTTGATTATGGATGATAMTGTGTCATATCTTATTAA 

TATGTCTCATGTCTCGTTCCTTCTTAATATGATTTAGCTGGMTTCATTTTCTTTTCGTT 

TCATGTTTAATTTCATAAMCGTTTAACAATTGGCATATATACTTGGCATTCCTGTCCAC 

CAAG GGATTGTMTCCMGCCTGGGAAAATCTTAAATTTCTTTTTACTTAAATCTGGNAA 

TTTTGTC NTCATTCTGCCACC 1 I 1 I 1 < 1 I 1 [ 1 1 1 

Sequence 446 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCCT 

AAAAGAAGATTACTGCTGCCAACTTAAGTCATCTCCGTTMCGAAATTGCATTCTTGTGG 

CAGAGTTAA AACAA CA AGAG AAATTCAGTGTTTGCTGGTTCTGAATGTCATTTTTCCTCC 

CTGGTGTGGTTTTACATTTTCAGCTTCTTTCCCTTTTCTTCTCTCCCACCCTCAAAAT^ 

TGCCTTAGCATTTGTGTGCTTAATTAAATCCACTCTGTGCTTTATTGTTGGAGAATGTGG 

ACAATACAAAGATTTGGGGTGGGGTCATACAGTGTATACAAAACACACACACTATGTGTT 

TGGACAAAT TCGC CAAGCGTGAGAATCATCAAGTAGTGAGTTTAAMGTTTGAAAATCAA 

GACCCAACATTTTGGGGTGTTTAAAATATCTCCCGCCTTGAAATGGNTCCTGTTTAAGTG 

TTAGATNGGAGANCCCTTGA7TCAAAAACAAAAACCGA 

Sequence 447 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAGAGAGAAATTCAAACAAAATATTGCTGT 

TCTTCAGTTTTGTTTGTGGAATTTGAAATTACTCAAATTTAAAATAAAT^ 

GGAMTAACATAGAATTGAAGTTTTAATTAAATACCACTCAAACGAAAAGAACAGTAGTT 

TTTGTAGTTTTATATTGGATACTGAGGCATTAGGGAGGCATGAAAGGAAGAGGAATGAGG 

ATTGAGACATGTGAAGACATTGTGCATTATATCAATGTGCATTCCTGTAGTTCATTAACA 

AGGTACCT 

Sequence 448 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAC 

CAAAATTTTTTATTTGACTAAATTCTATTCATGCATGAGCATGACAGTCTCCCGTTTATT 

TGACTTTGGCAATAAAAAAACAAGCAGCCTTGTACTTCATTGTGAGGTGGCTACAACTCT 

ATAATATGCACAGTGATTTTAAAATAGGCTTTTTGCATGCCTTGCATGAAAGGTGCTACA 

TACAAACCTGTTTTGTGAACTCTTTGGTAACCACCAATTTAAAAATTTGGATGAAAGCAT 

TTCCACATGGACAGATCTGAAGCACATTATTGGAGCTCTGAGCCAAAGCTATTACCCTGT 

ATATTGATTCTTCAGTTTCCTTGAGGGGTTAGGTGTTGATTTAGAATACAGCCAGATAAT 

TTAAAGCATGTCAGGCCCCGGTTAGGAAAATGAAAATG 

Sequence 449 

CCGCGGTGGCGGCCGCCCGGGCAG GTAC AAAAATAGGAATGGG1 Mil 1 ACCTGTTTAAA 

GTCACTTTGTGTTTATAACAAAATTACTTTTAGCTGAGGAACAMGGTGACAAAGATTTC 

TGTTGGTGGCTGAGAGTCAAAGCAGGCCAATCCACACCATTACCTGAMTATTTTTCAGG 

CTAATTTATAACTTTATGGATTTCCTCGATACAAGTTTATTAGTTTATTCT^ 

AGTTTATTCTCCCAGAATAGCAGCAAATAAAACTTGAATTGGATGTACCTCGGCCGAGGT 

ACTACGTGCCAGCTCTAGTTTTCAGCCTTGGGAGGTTTTATTC 

Sequence 450 

CACTACTTAGGGCNA ATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACACTGCATATAAAA 

ATGGTCTAAGATGCAATTTTCCTCCATTCC 1 1 I ! I I GCTTTTAAAATACTGAGACAGCAT 

TTTAATTCAATATT CTAGGT TCAAACTGATACATTAAAAAAAAATCATACCAACCTTTAA 

TCATTCTACATCCATTTTTTAAAGTTAGCTAACAAGATGATGTTTCACTAAAATAAAATA 

TC CAAT CATCAGATTAAAGTGTAAAGTTTGTGTGAACAGGGAAATTAGATCATTTCTCTA 

AGTTTTAATTCCTATGTTTCTGAATGTTTCTTGAATTAAAAATTCATTT 

TTTCAAAACACGGCATCTCTCTTTCACCATTCCACAGAGAGAGAAAAGACTAGAAAATAC 

TTTAAAAAAAATAAATATTTTAAA 

Sequence 451 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTACGTGCCA 

GCTCTAGTTTTCAGCCTTGGGAGGTTTTATTCTGACTTCCTCTGATTTTGGCATGTGGAG 

ACACTCCTATAAGGAGAGTTCAAGCCTGTGGGAGTAGAAAAATCTCATTCCCAGAGTCAG 

AGGAGAAGAGACATGTACCCCACCTCTGAAGATGCTGGGGGAGGCAGCTGGGATGGGAGC 

CAGCCCCATGCCTGTCTGTGACCCCACAGTGGGTGAGAGCCCGTCACAGTCCTGGGGTGT 



Page 62 of 635 



WO 01/51628 



PCT7US01/00798 



Table 1 



GGCTGCTCTGGAAGAATTAGGAGGCAGCCATAATAAGAGTCTTCAGAGAGATGATGGGAG 
GGGCCAGTGAGGACAGGAACAGGAGAGTNNGATGTCCTATAATAAAGGGGGC 
Sequence 452 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGAGATGCTGCC 
TTGAGACAGCTGAGGTCACCGCGGCGCGCACAGGCCCCGAACGCTCAGGAGTGTGGTTGT 
TAGGAGAG CACACAGG TGTTCATACAGTGGCATTTGGGACACAATCGTTGGAACCTGAAG 
AATCTGAAGI I"! 111! I ACCACCATCTTTTTCTACTCTGTAAGGAAGTAGATCTTTATGG 
GGAAAAGAGAATTTGGGGTGTTCTGCAAGCCAGTCAAAGTGGCACAGCAAATCATATAAA 
TCGMTTAAATGGACAACACCGTTAGATGTGTATGTAAAAATTTTCTGTTTCATA1 111! 
CCTTTCACTTTCGGTTTAAAACATGCTATATGTACCTCGGCCGCTCTAGAACTAG 
Sequence 453 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGCATATATTA 

GGCCTTCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATTGTAAATG 

TGTCTTCTCCTATGTATCCATGATGATATTGCCTAGI I I I 1 AATACAAGCTTAAGAATCA 

ATAAAATGATAGATCTGAAATCTGTAGATGCCAGTCTGGCTCAGTATTAATACTGACAGT 

AGTTTrTTTGTTTTTTMGAAAGCAAGTATATTAAAATATCTCTGT^ 

ATTTGAAAGGATCATTCTACAATCTTAGAACTAAAAAGCAATTTAAGGCAAAGAGGTACC 

TGCCCGGGCGGCCGCTCTAGAACTAG 

Sequence 454 . 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCTGACTTTGTGACTTAGGCGGCTG 

TGTTGCCTATGTAGAGAACACGCTTCACCCCCACTCCCCGTACCTGCCCGGGCGGCCGCC 

CGGGCAGGTACAAGCTATTATATATTTTCATAATATAGAACTTTAAAGTATAGTAATCAA 

CAGTGGCAAAATATTTCTGTCTCTGACATTATTATTTGTGAGATCCAACCTGTAATTGAG 

ATCAGAGAAAACAGTATGGGAAAACAAATCCATGAGAACAGGATAAAAATATCATGAACA 

GCAAAATATGTCTGAGTATTTAAAACTGTATAGAAAAAATAAATTAGATGAACAATTGTG 

TAAATATAAATTAAAAMGATGACATTCTTAATAAATAAAGTTTACTATTTACAI UNA 

TATATTGCCAAAAGATTTAACTTAGATNCATAAAAACCACATAAACAGAA 

Sequence 455 

ACACTNCT ATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTC I I I I I 

rTTTTTTTTTTTTrnTI I GGGCGAATAC AG ATAAATTTATTAGTTAAATACTGATTTTC 

CAGCCATTTCACCTTMGACAATGTTMCAGGTTTGNGGGTTAGGGAGGGTATACGAGGG 

GGCCTTTGGAAGAAAACAATGTAMTGATGATTAAAACAGAATCTTGGTTCAAAGGTATT 

CTNTGCTATAGCCAGTAGGATTTTGGAGTGAGGGGTCTGGGCCGTGTGGGGAGGCGTAAT 

AATGCCACAGTCAGCTNCAGCTTTTGNTGAGAAAGAGGAAAGGAGTCTCTTGAGCTCAGC 

ATCAAGGGGCAGAACAGCAATGTGCCANAGGAAGAACGCGGGCAGATCCCGGAGCACTTA 

AAAGTCNGAGGCTTNCTTCAGGCCCCCCTGNCGGGTTTAANCAAGTCGTTTTTNTTTCAN 

GAATTCTGGTGGGGTGGGGT 

Sequence 456 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGATGAAGGAACTGACAGGAAA 

TTAACAGCCTCGTGCAGGCAAGGACAAAGACAAGAAAAGGCCCATTTGGGCCTCATATCT 

CTCTGGATTGGAACCCGCAGGAAGCAGAAGGGGCTGTTTAATCCTGGCTCCAGAGCAACC 

GTGAAGGCTGGCTTCCTCCCTTGMGAAGAACAAGGTGAAGGAAAGAAATCACTGTGTTT 

TCAGCTCAGGGGCCCTGTGACATTCCTTCGTGTTGTCATTTGTTGAGTGACCAATCAGAT 

GGGTGGAGTGTGTTACAGAAATTGGCAGCAAGTATCCAATGGGTGAAAAAGAAGCTAACT 

GGGGG 

Sequence 457 

GAGGTNCTGCATGTTACATATTCTTCTTTAAAATTTTGTAATAMCATTGACAGTGT^ 

GTGGGCACAGGGAAACAGGATAACGTAGAGTCATTACAGAAGAAAAAAACTTATTGCTAA 

CATTGCAGTATTCCTTTTATCAGAATTAGGTGAGTATTGATTGTAAAAGCTCTATCAACT 

CTTGCTCTTATTTG ATG ACTTTG AG AC 1 1111 I ACTCTTGCTATAAAAAGAAGGCTACTT 

TCTTTCCCTAATATATTTCTACCAATGCGAATAATTCAGGAAACAATGAGAGAAAAGTAA 

TTCACACTTAATGTGTTGTTACTTAAGAGATTTGACGGATAAAATAAAATCAAATTAAAT 

CATTGAAAAGGCAGGCTTAGACCCCNTTAAAAACACCGTTAGTTGCCTACCATAATTTGA 

GCACTTTTTCATGGAAAAAAAAAAAAC 

Sequence 458 

CTCNCCGCGGTGGCGGGCCGAGGTNCATTTCCATGGGCCCTGTTC.CCATTGATGTATACT 
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GCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTTC 

CGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGNNGAAAGCTGTAAAACACCTT 

TTATGGAAGAAAAGANATAAAATGTANTTGTCAAGTCTAAAAAATAGTAGCAACGGGAAT 

CATAATGAATACATGCMTGMTTTAAAATGNAAAAATGAATTTAAAAAGTAAAAAGGGC 

TCTGTGGTGTAATTTTTCTTAACTACAA 

Sequence 459 

CGCGGTGGCGGCCGAGGTACAAGCTCCTAAAAGAAGATTACTGCTGCCAACTTAAGTCAT 

CTCCATTAACGAAATTGCATTCTTGTGGCAGAGTTAAAACAACMGAGAAATTCAGTGTT 

T GCTG GTTCTGMTGTCATTTTTCCTCCCTGGTGTC 

GTTTTCTTCTCTCCCACCCTCAAAATTCTGCCTTAGCATTTGTGTGCTTAATTAAATCCA 

CTCTGTGC7TTATTGTTGGAGAATGTGGACAATACAAAGATTTGGGGTGGGGTCATACAG 

TGTATACAAAACACACACACTNTGTGTTTGGACAAATTCGCCTAGCGTGAGAATCATCAG 

TAGTGAGTTTAAAAGTTTGAAAATCAGACCCAACATTTTGGGGTGNTTAAAATATCTCCC 

GCCTTGAA 

Sequence 460 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGACGAAGAATG 

AAAAGGTTGTATTAAAGAACTATCAGGACATGGTTGTGGAGGGTTGTGGGTGTCGCTAGT 

ACAGCAAAAAAAAAAAACGTCAAGCCAAACACAAACAGCGGAAACGCCTTAAGTCCAGCT 

GTAAGAGACACCCTTTGTACAAGC I M HI Mi M ill II II i III I i I 1 NGAAGTTTTT 

AGTTTATTACCTGCCCG 

Sequence 461 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGTTATGTCGGGAACA 

CGTGCTGCTAACTCACTGGTGAGTTCAATGGCAACGCTTCATTCGGGAGGCTGTTCTGCT 

TTACGCATCTGAGAACTACATAGGAGAGCAAGTGTCTGCACCTCCTAACTGCAGAAGCTA 

CCGTCTTCTCAAAGACGAAGGTCTTTGCAAAGTTCAGTGCTCGGTGTTCTCGGCACAACA 

AT GC N N TGTAGTTC ANAAGGT ATTTTGGCAACTCTTAATCTGAAC AAGAATGGGGGGGGC 

GCTTTTGAAAAATAAGGCTTTAAGAAGGCTTGTCATTTTAGGGCTAAATTT^ 

GTGAG TCTGA ACTCTTACATTTAGMCAAACAAAACCTTAAAATTNCTGATTGGTTCAAA 

AAATGGTTTTATGGAAAAATTAATCTGTAACAAAAAGTTGGCATTTGAGTGCGAAGGCTC 

CCCCGTTGNT TTTTTTTTTGNTTTTT 

Sequence 462 

CTACTTAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGC UllilllHI 

rTTTTTTTTTTn ICCTGTTGCTACTAAGATGTTTCAATTCGCANCGTGTCTCGCTAATT 

TGACTATGGATTCATCAAAATGCAACTGAGGTTTGCTCAGTTAGGTTACCCCATTCGGAA 

ATCNCCGTATCATAGTTTATTTCCAACTCCNCGAAGCTTATCGCAGGTAATCGNGTCCTT 

NATCGACTTTNAACCTGCCC GGGC GGCCGCCCGGGCAGGTCCAAAAACCAACNTGACACA 

CAGGAAAAAATAAAAGTGCMTTTTAATATAGGGAATGNGATNCATGTATAATTCCTCAT 

AACAAAA TGGTC AAAACCTTTAAAAGATCCNCAATAGATTTCTGAAAATCTTAGCAATGC 

TGTATTTNTTTTGNGGACTAAATGGTGAATTTATATTCAAATTGGTCAAATTTATTT^ 

AAACTTTCAGAGCACANATCATTGGTAATTTT 

Sequence 463 

CCGGGCAGGTACACGTGTTCAGTGTGTTGCACATCACAGCCCAGGTTGAACTCAGCCAGC 

AGGATCTTGGCAGCTTCTTCCTGCTTTGCAGGAAGGACAGAGGTTGAGGTAGGAACTTCA 

TAAGCAATTTGGAGAGAGGCTCCTCCCATATCCAGTATCCCTACTGTCCTTCTCCGTCCT 

GCTGCCAATTCCTGGGTAGCCTCAGCATCTGATTCATCCTCGTGGTCGAATCTTCCCAAA 

ACAAAGTTGATTCCAATCCATGCATAAACCCCTTCCTGCTTCCCAGAGATCACTTCTGCT 

TGAGACTGTGAAAAGAGGAAGTCAAA 

Sequence 464 

AGGTACATTTCTTGTTTAGGAGGGTTTTCCTATCTACCTTTCTACTGAAGTAGTTTCTGG 

AACT TTCC TGGTGGATCAGAGTTACGTAATGCAGTCTGAGCCTTCAGACTGCTAGTTAGA 

ATTGTTTTAGGTGTTCAGAAAGGGCAAAATAGGCTGATGTGGCCTGTCAGAGTGATGTGT 

TCTCAAAAAAGTTCACTTGCACATCTGTGGGCCGCTTTTGTCCTCAGACCCTTAGTGGAC 

AGACTCCACAAACCCTCTGATGAGACGATTGATGTGGCCAGGGTCCAGTTAGCATCAGTA 

GAAGGATGTCACTAGGAAAGGCCCAGGTATCTGGTAAGT 

Sequence 465 

CCGCGGTGGCGGCCGAGGTACCTCAAACTCAGAGTTTCTTCCCTTCTTTGATTTTCTGGA 



Page 64 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



GGACCTGCAGCTGGCCTTCCTGAGACAGGCTCCATTCCTGTTCCATTTGCCTTCCCGGCA 

GCCTTCCCTTTAGTGGGTATAGGTTTTGACGTTCTGAGTTACTTTGTATCAAAGAGCTAA 

TTAAAAATGGTCCTTCAAAMCATAAAGAAAAACAGCTTGAAAAATGTACCTGCCCGGGC 

GGCCGCCCGGGCAGGTACCAAAAACCAACATGACACACAGGAAAAAATAAAAGTGCAATT 

TTAATATAGTGAATGTGATACATGTATAATTCCTCATAACAAAATGGGCAAAACCTT 

Sequence 466 

CCGGGCAGGTACCTCAGCTATCAACATTTCTGAGCTACCATTCAATGTTCCTCTGTGTCA 
TGGAGTGAMTTCTTGTTTTGTGGGTATTAGGAGTGTGGGAATGTGATAACCTAAACAAC 
CTTTGCTCTGAAATTCCATTTTTCCCTCTTTCCCTGAGTTGTATTGACCTACAGAGTTAA 
TTTCCTTTGTATTTTTTTAAG 

CCGGGCAGGTACTCCAAGATGAGCTTGACGCGGCTGTGCAGCATCTTGATGGCGCTGTGC 

TGTGCTATCAGGTGT 

Sequence 467 

CCGGGCAGGTACTTGGCATATATTAGGCCTTCAAATGATGATGGATGGATAGACAGATAA 

AAGGCCCAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATGATATTGCCTA 

GTTTTTAATACAAGCTTAAGAATCAATAAAATGATAGATCTGAAATCTGTAGATGCCAGT 

CTGGCTCAGTATTAATACTGACAGTAG I Hill 1 GTTTTTTAAG AAAGC AAGTATATTAA 

AATATCTCTGTTAGAAAAGATTACAATTTGAAAGGATCATTCTACAATCTTAGAACTAAA 

AAGCAATTTAAGGCAAAGAGG 

Sequence 468 

CCGGG'CAGGTACTAGGTTAGAACATTGCTTAATCCTTTTAAAAAANATGCATTNACTGTA 
AACACAGAATACTGAAATTGGNGGATTTTTTAACTATW 

ATTACATTACACATTCATTTANCCCAGATTTCAAATAGGGGGGGAAGAAAGAAACTGTAT 

TTCAGAGTAAAATCTCCTAAAGGAAATANAAACACAGAGTTGTAAATNCACATGCTTGCA 

AAAACATTAGTCGTGAAATCCCTAGCAACAAGTCACTGGATTTTTCTCTGTCAGCACGCG 

TGTCAGCTGCCAAA 

Sequence 469 

AGGTACAAATGTGGTGTGTCTTCCAACTTTCATTGAAAATGCCATATCTATACCATATTT 
TATTCG AGTCACTGATGATGTAATGATATA I ill! I CATTATTATAGTAGAATATTTTTA 
TGGCAAGATATTTGTGGTCTTGATCATACTTATTAAAATAATGCCAAACACCAAATATGA 
ATTTTATGATGTACCAAGAAAGTCGGGATCGTCGGTAAATACGGGACCCGCTATGGGGCC 
TCCCTCCGGAAAATGGTGAAGAAAATTGAAATCAGCCAGCACGCCAAGTACCTGCCCG 
Sequence 470 

AGGTTTGAGCTCCATAGAGACAGCGCCGGGGCAAGTGAGAGCCGGACGGGCACTGGGCGA 

CTCTGTGCCTCGCTGAGGAAAAATAACTAAACATGGGCAAAGGAGATCCTAAGAAGCCGA 

GAGGCAAAATGTCATCATATGCATTTTTTGTG CAAAC TTGTCGGGAGGAGCATAAGAAGA 

AGCACCCAGATGCTTCAGTCAACTTCTCAGAGTTTTCTAAGAAGTGCTCAGAGAGGTGGA 

AGACCATGTCTGCTAAAGAGAAAGGAAAATTTGAAGATATGGCAAAAGCGGACAAGGCCC 

GTTATGAAAGAGAAATGAAAACCTATATCC 

Sequence 471 

ACCGCNGTGGCGGCCGANGTACAGGCTGTGATNCGTGTGGCGATCGATCTTCTTAGATTC 

ACGGTATCTTCTGAGCAGCCGGCGCAAAATCCTCATTCTCCTNATCCATGTGACCTTNTC 

TGGCATTCGGGCATTGGCTGTNCGAATCAAANCACTTACATGAGGGGGCAAAGTCAGAGA 

CAGNTGAGGAGCTGAAGAAGGTGGCTCANGAGCTGGAGGAGAAGCTANNCATTCTCAACA 

ATAANTATAAGATTCTGCNGGCGGNCCAAGAACTGTGACCACAGGGCANGGCATCCACCA 

CCANAGATATGCCTGGCNGGGGCCAGGACAAAATGCAAACTTTNTTTTTTTCTGAGACAG 

AGTCTTGCTCTGTCGCCA 

Sequence 472 

AGGTACAAGCTTACCTTTTAGGGTAGAAAAAGAAAGATCATTTGAAAAATGTATCTAAAA 

TAATCCAGAGAACATAATGTTTGTCTTGGTCTGATAATGATAAGAAGTCAA GGAT TGGCA 

GAGAAAATACTAAACGCCAAGAGTTGAGCCTGTGGGTCTCTCCATAAGAGTTTTAAAACT 

CTTGCCAGTTACCACTTTATCCAATTTGCTATCAT TTTCG TATTATCA GCTA TCGCCCTG 

TAAAATATTCAAAACTAGCTATTTNTAAAGTAAACATTTTATCTGTTACTTT^ 

TAGGTGTCTTTGTCATCCTTCTACTATAAATTGTTCTTTGCCAACCTGTACCTGCCCG 

Sequence 473 

CCGGNCAGGGCTGGTTTGGGGCACAAGGAAGCCTTAGGGTATGGGGAAAGGCTGTTATTA 
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CCTAGAGTTTACTCCCAGGCCAGGGGGCTGCCATCTTCTTCACAGACATCCCTGAAAGGA 
AGCC CCTTTGGGG CAGGG AGGTGAG GACTTCATCTCAACATCGGCTGGTGGTTGGTAGGG 
GAGCI I ( I ICTTTTCTTTCCI I I II II IGTTTTTG1 I I 1 1 GTTTTTGTTTTTGGTAACAT 

GTTAGGAGTCAATGTTGCAMGAGTAGTTTACATCTTCACTTTCTGAAGACACTTGAATT 

TAGGACCGATGTATCTGTGACAAGCATGCCAGAAGTGGCAGGGGCCATCAGGGCTAACCA 

CTTCACACCTACCATCGTCCCATGGGGATCCAAGACCTGAGATAAAGCAACAGCCTGCCC 

AGATCCCTCTGTTCATCCTATCCCTTCCAAGGTTGGTCCATGCCAACATAACCTCTGGGC 

ATCAGACATCAGCAGGTCTGTGTGCCTCAGCCCTGTTAAGGGGCAGGTTTCTCTTTAGCC 

CTCTTCCTGCACTTGGGAGCAAANGCACTACCAGTNGAAAAAGGGCCATTCAGCCGTGCC 

CCCAACCTGGGACCCCTGGGGCTCAAATAGAGGTGCTGAGCCCCTGTGTNAAAAGTTGGT 
AAATGGTT 

Sequence 474 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTTTCAA 

ATATACAAGGATCATTTACCAAGATAAACCATAATTTTGGGCCATAAMCATTTAGGATT 

GAAAATACAGAGCAAATTCTCTAACCACAATAAATTTAAAAAATCGATAGCCAAAAGATA 

ACTGAAAACTCCCCCAAATGTTTGAAAATTAACACATTTCTAGAAAAGTCATAGATCAAA 

ATTAAATTACAAGGGAAATCTGACACTATTTTGAACTACGCGTTCATGAAAACACATCAA 

AACTTGTGAAATGCAGCTAAAACAGTGCTTGGAGAGAGAAAAGTTCTAAAATTACTAATA 

TAAACTTCTACCTTAAAAAAGCTAGAAGAAAAGCAAACTCAAAGTAGAAGGAAAGAAATA 

AAGATAACTGCAGAAATCAATGCGATAGAAAATAAGCAAATAGAGAAAATTAGTAAAACC 

AAAAGTTG CTCCTTTAAATGATCAATAAA 

Sequence 475 

CCGCGGTGGCGGCCGCCCGGNCAGGGCTGGTTTGGGGCACAAGGAAGCCTTAGGGTATGG 

GGAAAGGCTGTTATTACCTAGAGTTTACTCCCAGGCCAGGGGGCTGCCATCTTCTTCACA 

GACATCCCTGAAAGGAAGCCCCTTTGGGGCAGGGAGGTGAGGACTTCATCTCAACATCGG 

CT GGTGG TTGGTAGGGGAGCTTTTTCTTTTCTTTCC I II I 1 If I GTTTTTGTTTTTGTTT 

TTGTTTTTGGTAACATGTTAGGAGTCAATGTTGCAAAGAGTAGTTTACATCTTCACTT^ 

TGAAGACACTTGAATTTAGGACCGATGTATCTGTGACAAGCATGCCATAAGTGGNAGGGG 

CCATCAGGGCTAACCACTTCACACCTACCATCGTCCCATGGGGATCCAAGACCTGAGATA 

AAGCAACAGCCTGCCCAGATCCCTCTGTTCATCCTATCCCTTCAAGGTTGGGTCCATG 

Sequence 476 

AGGTACTGTTCTATACTATTCAGGTATCTTTTTATTTCTGATAGTTTTATATTATAATAG 

AAAGCCAGCCACTGCTTAGCTATCATAGTCACCATTTTCTCACTGTTAACATTAGGAAAA 

TCAAGGCTACTATGCTTCAGGATTGTCTGGTTAAATAGTATGGGAAAAAAACTGAAGAGT 

TTCACATAATTACACAGGTGAAATAATTAMGCTTAMCTGAATTTGTATTTCATTTTA^ 

TG TCAG ATGGTGGTGTTCACCAGCCTGTATCTTGTCTGAGACTGCATTCGTATCTGAGCA 

GGTTTTCTATGCCTACTGATGTCAGTATGTTTATACTMCCTTCATGCTTTTTTCCCAGA 

ATCCCTCATCTGCCAGAAAACTTGAAAAGTTTATTGC7TGTAGAGTTGTACCTGCCCG 

Sequence 477 

AGGGCGAATTGGAGCTCGCCGCGGTGGCGGCCGAGGTACTCAGCAACAACTTAAGTGTTT 

TATCCTTACGGTCAGCTTTGGTTAAAACAGTAACATAACAAAATTAATAGAATCATGCAT 

CGCAGGTGATAAATGATTCAAAATGACAGGAAATCAATTTAATTAAGAAATAGCAAGACT 

TAAAAAATGCATAGGGTTGAGCACCCGTATTATTGAGGGTGGATAATTTAAATGTATAGA 

AGAAA TCAGTTCAGAGTTTMTGATTGAATAACATTCTCTtTAGCAACTATGAGTGTGAA 

ATTTTGGTTAATGATACCGAAAATATTAGGTTATAGGAAGTTACAGTATTACATAAGGTG 

AGCAAAATAATCTAATCCAATTAATATAACGCCATCTGGGGGCT 

Sequence 478 

CCGGGCAGGTACATTCCAGGCAAAAACAGGGATCTCAAGGNGGTCAAGAAGAACATTCTG 

GATAGCAAGCCCACTGCAAACAAGAAGTGCGACCTGATCAGCATCCCCAAGAAAACCACA 

GACACGGCCAGTGTGCAAAATGAAGCCAAGTTGGATGAGATTTTAAAAGAGATCAAATCT 

ATAAAAGACACAATCTGCAATCAAGATGAGCGTATTTCCAAGTTAGAACAGCAGATGGCA 

AAGATAGCAGCCTGAAGGTCCCACCCCCACCCCTACAGAAAAAATGGGAGCAAGAACTTG 

TGCTTGGGAGCTGGTTATTGGTGTGGTCCTAGGGAGGGCGGAAAGGGAGGCACTGCCATT 

TGGAGACATTCCATTTCAGATTTGTCAACCAGCGATAGGCCACATTCCAGTAAGAACTCA 

ATT TGTCTC CCA AATTT GCAGAAACAAAACGTGATTTAAMGCTGAGCTTTTTATCAGAA 

AGCl nil iGATGTTTTAAGTGTTATGTGACTTGTTGAACTTTTTAAAAAGTGCTNCTTT 
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TAAAATCCCAGATACTCTGAATTTTAGAAAACAAACTAATTCTGATTGNGTCGTGCCCAA 

GTACCTNGGCCGTCTAGAACTAGTGGATCCCC 

Sequence 479 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGAAACCCACCTCACCC 

CGGCTCACATCTAMGGGGCGGGGCCGTGGTCTGGTTCTGACTTTGTGTTTTTGTGCCCT 

CCTGGGGACCAGAATCTCCTTTCGGAATGAATGTTCATGGAAGAGGCTNCTCTGAGGGCA 

AGAAACCTGTTrTAGTGCTGCATTCGACATGGAAAAGTCCTTTTAACCTGTGCTTGCATC 

CTCCTTTNCTTCTNCTCTACAATCCATCTCTTTTAAGTTGACAGTGACTATGTCAAGTCT 

AATCTCTTGTTTGCCAGGGTTCCTAAATTAATTCACTTAACCATGATGCAAA 
Sequence 480 

AGGTACAGGTTGMCATTCCGTTCCAACAATCCAAAACCCCAAATGCTCCAAAATCTTAA 

ACTTTGAGTTCCAGCATTCCGTCAGAAGTGGAAAA1TTTACACCTGACCTCATTTGACAG 

GTGGCAGTTACAGTGCAATCAAAACTTTGTTCCATGCCCAAGATTATTAAAAATATTGGA 

TAAAATTACCTTCAAGCTATGTTGTATAAGGTGTATATAAAACAAATGAATTTTGTGTTT 

AGACTTGGGTCTACTCCCAAGATACCTCATTATATATATACAAAAATCCCCAAATCCAAA 

MCATTTGAAACAC7TCTGGT CCCA CACATTTTGGATAAGGGATATTTGACCTATATCTA 

ATTATATAATAAGAAAAATGTATTTTC I rTTTTTTTTTTTI I GAGATGGAGTTTCACTCT 

TGTTGTCGCCCAGGCTGCAATGGCACAATCTTGGCTTACTACAACCTCCGCCTCCCTAGT 

TCAAGGGATTCTCCTGCCTCAGCCTCCCAGGTAGCTGGGATTACAGGCATGTGCCACCAT 
GCCTGGCTAAT 

Sequence 481 

CCGGGCAGGTGGCAGCTCTGGCCCTGCTCTTTGCCATCCACAGCTACAAGCCAGCCCCCT 

TCTTCGTCCTGGATGAGATTGATGCTGCCTTGGATAACACCAACATTGGCAAGGTGGCAA 

ATTACATCA.^GGAGCAGTCGACTTGCAACTTCCAGGCCATCGTCATCTCTCTCAAGGAGG 

AGTTCTACACCAAGGCCGAGAGCCTCATTGGAGTCTATCCTGAGCAAGGGGACTGTGTGA 

TCAGCAAAGTCCTGACCTTCGACCTCACCAAATTTCCTGTTAATCGAAATTTAACCCGAA 

AGGCTGGATACCTTAATGCTAGGAATAAAACAGGCTTGGTGTCATCTACCTGGGACAGAC 

AGTTTTACTTCACGCAGGGTGGAAATTTAATGAGTCAGGCCCGTGGGGATGTAGCAGGAG 

GCCTGGCCATGGACATAGACAACTGTTCAGTGATGGCTGTGGACTGTGAAGACAGACGAT 

ATTGTTTTCAGATCACCTCTTTCGATGGAAAAAAATCTTCAATTTTGCAAGCAGAGAGTA 

AAAAAGATCATGAAGAGTGGATCTGTACCT 

Sequence 482 

CCGGGCAGGTACAGAAGAGAGATGACGCCTCCGGCTGGGGAGGGAGGGGCTGGCGCCTCT 

GAGAGAGCCCATTTATTATCAGGGTCACGGCCAAGACCGTGGGCTTGGTTTCTAGGTAGG 

TGGACTTTGAGGGGCAAAGACAACCCACTTTTCCTGCCCCCTACCCCCGGCTCTTAGAGA 

CCATTTAAATCCATATCCAAAGTATATCCAGAGCCACCTGGGACCATCAGGCCTCTGGCT 

AGTCTTTCTGGGCCAAACCTGTCCTCCACCCTGAGGACCCTCAGGGAACCAGGAAGGTGA 

GGGGCTGGCTCCGCCCCACCTGTAGCACAAGAGCTGGCCCAGGAGGAGGCTGAAGATGTG 

TTTCTAGTTCCTGCATGCTGCTTTGTTGTGTGACCTCGGACATGTCACCTCCAGCCTTCT 

TCACCCACCTCACCTATCAACTAGGGCTAGTAATTGCCTGTGGTGGGTGGGCTGGGGGGC 

TCTGATGCCAAAATGCAGGTAGAAGACACACTGTGAGAGGCAAACGTCTTCACTCATCCT 

GCAGGGAAAGGCCAAGTGAGCATCTGGGACCCATTCGAGCTGAATGACCCCCAC 

Sequence 483 

ccgggcaggtacgcaagcctcttctctcccttgatgtggtagctacaggcagttacattc 

ctttgctgcttgtgagaagcttacattttggcattttcttccaaaaattaccacgt^ 

camgtaaacattacaagaatatgaacttgYtattgggggaaaggggaagtgaagcaatc 

tgtag aaaat atttaactg aaattac aaac ataagg ac aag cctttaaagc aaattatac 

aggcatttttgtcctctctctggtccctgcagttaattaacagggccacagataatctga 

catctttagataamcmctacacacctatattacttaagttaaagccagtaagttcaaa 

aamgcaattacmccatttgacatgtttgtatttttcatctctaactcctgatttccta 

aattacattcagtcatacaaacatggctgaacaacagcaaatgggatctgactctgggat 

gtttacttcttgggattaamttcccaaacactttggtgtctctttcttgtgaatctata 

caacatttccaataaatgtttaataatgattttttaaaagatattgccaaa/^ 

gatttaagatgaaataaaaac 

Sequence 484 

CCGGGCAGGTGGCCTGTTCACTTTCTCTTACTCACTGTCTATTCACTTGTCCTGTTCACT 
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CGTCTGGAAGATCTCAGCCAGCACCATGACTGACAATGAGCTGTCTGCCTTGGTAGTGGA 

TAATGGGTCAGGGATGTGCAAGGCAGGCTTTGGTGGTGACGATGCCCCCCGGGCTGTGTT 

CCCCTCCATGATAGGGCGTCCTCGACACCAGGGCGTTATGGTAGGCATGGGCCAGAAGGA 

CTGCTACGTGGGAGATGAGGCTCAGAGCAAGAGAGGCGTCCTGACCCTGAAGTATCCTAT 

CGAGCATGGAGTGGTCACCAACTGGGACGATATGGAAAAGATCTGGTACCT 

Sequence 485 

NCGAGGTGGACAANAAACACGNATGGCTAGGANAAACTATCAATGCTGGCAGCCAGNTTG 

AATANAATGTGGAAGGAGTGACTTNCAAGGAAANGGCTACCCAACTNGCCTNCATGCGCC 

TGCTGGCCAACTATGCCTCTCANAACATCACCTACCACTTGCAAGAACAGCATTGCATAC 

ATGGATGAGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTT 

GMCTTGTTTGCTGAGGGGCMCAGCAGGTTNCACCTTACACTGTTTCTTTGTAAGATGG 

CCTGCTCTAAAAAAGACAAAATGAATNGGGGAAAAGACAACTCATTTGAATACAAANACA 

AAAATMGCCCATCACCGCNC TGCC CTTNCTTGANATTGGCACCCTTTTGGACATCGGGG 

GGTGCCTGACNAAGGAMTTCTTTTGCNGGACATNNGGCCCANTNCCTNGTTNCAAAANA 

AAAN TGAAC CTC AAATC TAAATTAAAAAAAAGG AAAAG AAAAATTTTG N AAAAAAACTTT 

TCCTCTTTTNGCCNAGNGNCCTTNGTTTCNTCCCTTTT 

Sequence 486 

AGGTNNTGACACACMNTACANACATAGTCCCACTTAGGGTGNTGATATTNCTTTAAAAA 
AAAAAACAANGGGCTCACAAACAATAAGACATCACAAATAAAACATCNAGAACCACANTT 
GCCAANAAATGCCAAGACCAATATGCTACTGAAACAACTGGTAGAAATGTCACANACAAN 
NTATTC AN N ATGTATTG NAATAATTACTTGNAGTTAANG AN NGCG AAAGGTAAAACC AAA 
CNGGGANA AGAA NTAACTCCTTGTAAACTGGGCTCTTGGGNTNNAAAGAGGATCTTTTNA 
CAAN NTTANTTXTNTAAACCATTACCCTTTGTTTATAAAAAAAAANGTAN CCAAANCTAC 
AATGGTNNACCCCGGANGGGGA 
Sequence 487 

TACGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGAGACAGGCAAGCTCGGATTCA 

ACTACCTTAGATAATATTTCTGAAAACCTACCAGCCNGAGGGTAGGGCACAAAGATGGAT 

GTAATGCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTGAGCCCAGGAGTTCAAAGACCA 

GCCTGGGCAACATACCAAGACCCCCGTCTCTTTAAAAATATATATATTTTAAATATACTT 

AAATATATATTTCTAATATCTTTAAATATATATATATATTTTAAAGACCAATTTATGGGA 

GAATTGCACACAGATGTGAAATGAATGTAATCTAATAGAAGCCTAATCAGCCCACCATGT 

TCTNCACTGGAAAAATCCTCTTTCTTTGGGGGTTTTTCTTTCTTTCTTT^ 

ACTGGGACGGTGGACGTCAGCCCATGTACCTGCCCCGGGCCGGNCGCTCTAGACCTAGGG 

nnnnnnnnnnnnnnnnnnnnnnnnnGATATCAAGCTTATCGATACCCGTCGACCTCGANG 

GGGGGGGC 

Sequence 488 

GANTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAANGCAAGAAAGCTGGGG 

GGCTCCATGTTCACTGCCAACCCATGGATCTGTATATCAGGAGAATTGGGTGAGACACAG 

ATCATGCAGATTCCCAGGAATGTGCTAGAGATGACCTTNGAGTGCCAGAACTTGGGGAAG 

CTTACTACTGTCCAGATTGGCCATGATAACTCTGGGCTGTATGCCAAATGGCTGGTGGAG 

TATGTGATGGTCAGGAATGAGATCACAGGACATACCTACAAGTTCCCGTGTGGCCGGTGG 

TTAGGGAAGGGCATGGATGATGGAAGCCTGGA 

Sequence 489 

CCGGGCAGGTACAAGGTAGTGGAATTATNTCTTATATTGCTTTTTCCAAAAAGTAAAAAC 

CAAAAATGTGGGCNCGGGTGTGGTGGCTCATGCCTGTAATCCCAAGCACTTTGGGAGGCT 

GANGCAGGCGAATCACANNGCCAGGAAGTTCAAAGACCAGCCTGCCCAACACAGTGAAAC 

CCCGCNCTNTACTAAAAATACAAAATTAGCCAGATGTGGTGGCGCACACCTGTAATCCCA 

GCTACTCAGGGAAGCCTGAGGCAGGAAAATCGCTTGAACCCTGGGNAGGTGGGAAGTTGG 

CAAGTGGAAGCCCGAGAATCGCACCACTTGCACTTCCAAGCCTNGGGTTGGACAAAGAAG 

CCGAAAACTTCCANCTTNAAAAAAAAAAAAAAAAAACCCAAAAGTGGAAAACACTTAAAA 

AAATTC C C C TAATAG ATAATATTTC C C AAG G AAAATT ATTTTT AAATT 

Sequence 490 

CCGGGCAGGTACCAACATTTATTAATCCTCACAACACCCCTGTGAGGTAGGTCAGTATGG 
NCTTTAGAGTCGNGAACTGAGGCAGAGGTCAAGCAAACCTGCCCTGGGCCACAGAGCAGC 
ACGATGAAGGGCCTANACCTGGATCCAGAAGCTAGGGGCTCTTCGGNCCAGCATTCATCC 
ACTGGTGGACATNACATGGGGCTTATTTTTACCAAGCCGAAGGTTACCGTGAAGGGACAA 
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AACCGCACTCANCCAGCAACCGGGAAACTCAAACAGTTCAAACAGCACCTGGGGGGAACA 

TGTTCAGTGTAAAAGGAGACGAAAACCGCTGACCAGCTCATGAATGGAGGGCAAGGACAA 

CCATGGCGGCTTGAGGGAAAAGTGTGGGGAATCAATCACCGACCTGGGGGGATTTAGGAA 

CCCAAAGAAACCGGGGGCGGCTTCAGGGGAGGGGGTTTCAAGGGAAAAATGGTTAAAACA 

AAACCTNAGGGAACCCCCATCCCAAGGGGGGGGGACCAGGGCCCCAAAATGGCCTTACCG 

GNTCTTNCAAAATGGGTAGAAAAAAATTTTGAAAAAAGGAATAGGNGGGNAAAA 

Sequence 491 

TCAGATCCAAAGCCAGCACGGCACCGGGTTTCTTGCCTAAGGTCCATGGTGACCACTCCT 

TGGCTATTGCCAATGTTCACTCAAGGCCCAAGCACTCTTTATTCAGCTGGTGGCAAATCC 

AGCCAGGCTTGTGTTTTTCCCTTCAGTATGACGAGCTCCCCCTAAGCCCAAGACAAGTCC 

TAAAATGTCATNTGGGAGC C AGGACCTGGAATC AGGATC CTTAGGAATTTAC TTGGTGCT 

GTATTCTACTATGGCTGAGCTGGCAATCAAAGTGGCAAAGATTAAAGCCCTTTATTTTCT 

TTG NTCTCCTTTCCTC 

Sequence 492 

AACGCGGNGNGGGACTTGNGGGCCAAGCAGGAGGGAAAGCCTGTCTGGAAGTTACNTGTG 

GACATGGNGNCCAGGACGCTGGTATCCTGCATAGATTTCAGGTACCTCGGCCGCTCTAGA 

ACTAAGTGGATCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGATACCCGTCGA 

CCTCGAGGGGGGGGCCCGGNACCnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTAATTGCC 

GCGCTTGGCGTMTTCATGGGNCAATMGCTTGTTNTCCTGTNGTGAAAAATTGTTATCG 

CGCTTCACAATTTCCACACAAACAATACGAAGCNCGGGNAAGCATAAAAGGTGGTANAAA 

GCCTGGGG 

Sequence 493 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTGTGGGGTGCAG 

TGAATGAAACTAAAGAGTTGTTGCACCTGAAAAAGACACCAGTGATTGGGTATTGGGTAG 

TCGATTGGAGACTTTGA I II II I ! II I I I I I CACATGGGAAAGTCTTAGATNCCAAGTAG 

ATGGAATTNGCAATCCTTTGCAGTTTGTTCTCATTACAGATAATCAACATTTNTTTTGTA 

TAATTCAATACATGATCAAGTTAATTGGTAAA \ \ I I I ATTG AAATTT AAAACTTG C ATTT 

TTATATATGTGTATGTGTACCT 

Sequence 494 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTGAATGATATTTNGA 

CCACATCCATGCAGGGTGCTNACACTGNGACATNNGGAACTGATATCAGGTCAGCTCTCT 

GCCTGNCTCANTATGCATCACTATATCTCATCAATATGAATCATATTGATANCNGAAGGN 

GNTGAAACAAATGAAGCNGGCNTTTATATATTAACTGTGTGTNNAGTGCGTATAGAACAT 

GGGNTCACATTCACTAATATTATGTGTATTCATACACATGCACAAAACCTT 

Sequence 495 

GGTACTTTACCCTTCAACGGGACANGAAACTGCNCCAGAAGTCAACTTCACATTTGAAGG 

AGAAACGGGAAAGGNTCCANGATTGAAGAAGACAACACATTTTATCAAAGACTTAAGTCC 

ATGAAAGGAACNCGNTAGAAGCACANAATATTCCAGACAATTTTGGAAATGTATCTCCAG 

AAATGACGCTCGN NCTACATTTAGCCTGGGN CTGCCTGNGGCTTATATAAN ATTGGCAAG 

AAATTCCTACTTGAAGACACAATGGTATAAAAATGGGNNAAAAAATTNAANACTGTCAAG 

CCAAGGTGGCAAMGCAMTGANNGACCTTTTATTGGAAT7TAAGANTACACCATTCTAC 

TTTCATTAAAT 

Sequence 496 

AGGTACTGTTCTACCAACAGCTCTGATTTGGTGAGTGCATTTTCTACGCCACTGCTCACG 

AGCTGCATNATCCGACTCCCCAAGACTGTGTTAATGCTGCCACTGACCACAGACTTGGTC 

TTCTCCACACTGCCAGTCACTGCCCCTTTGGTCTTGTCCATCACCCCTGTGATCGTGCTG 

GCCACAGAATCCTTGGCCCCAGTCACAGTANGTCCGTCACAGCATCTTTTTGCCCCAGTC 

ACAGCGCCTTTGGCATTTGGGCAACAAATCTTGAGTTTGATGGGGCTGATTCAAGAAATA 

AGGCAAGTTCCTCTCCTCAATCCCTGGTCTAAGCCCCCTTACAGGGCATAAGGGTATTGG 

GGCCAACTGGCAAATTTTGCGGGCTTCCTAGCTTTCTGGGAATGAATGGGGCCAAGAAGC 

ACTGGGTCATGGGCCCACCGGGAGGGTGAATGGTCTTTCAACAACCGTTTCCTCNTGCCC 

ATCTTCACAACAACAGGAACTTTCAAGGGTAGGGGAATACCTGGGGTCCCTTTGGTAACC 

TTGCCCCCGGGGCGGGCCCGCTTCTTAGAAACTAAGGNGGGAATCCC 

Sequence 497 

GAGCTCCACCGCNGTGGCGGCCGAGGTACTTTTCTACCTTAAAAAAATCAGTGAGGATAT 
TTATTTAATGAAC ATC AATTCCTTTTAAATTTTCTTAGAG AAATN G NCTCTGTGGCTC AG 
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TTTTACCACCCATAAAGCGGAGACAGTAATTTATGGTTATTCCTTTCTGACCCACAAAGT 

ATGAAAAGTTC TTGTA ANCTGTAAACTCAGTTCTGNAATCTGCATTATTGAGATGATTAA 

TATAMGTTGTATTTTCACTGAAATG^ 

GCCGGGCTTA I Mill IGAAAG 

Sequence 498 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTAATCATTTT 

GACTGTGTCAGTAATATAATCAGAATACTGTAACGGTAAGAGAAAAATAC I I I I I i 1 1 I I 

TTTGAGACTGGGTCTCGCTCTGTCGCCCAAGCTGGAATGCAGTGGCGCAATCACACCTCA 

CTGCAGCCTCAACCTCCTGGGCTCAAGGATCCTCTAGCCTCAGCCTCCTGAGTAGCTGGA 

ACTACATGCATGCACCACCACAGTCMGCTNATTTTTTTTAAATTTATTTTTAGTAGAGA 

CAGGGTCTCACTATGTTGCCTAGGCTAGTCTTGAACTCCTGGGCTCAAGTGATCCTCCTG 

CCTNAG CCTN CCAAAGTGCTGCGATTACAAGCGTGAGCCACCGCCTGGCCCGAAAAATAC 

CTTGGGTTTTAATCAATTCCCATTCANCAATAGTGAAAAATGCTNTATGGTCCTTCCGGC 

CCNTTTAAGAAACTAGGTGGGATTCCCCCC 

Sequence 499 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACCTGGGGGGGTTTGCT 

TTCCTGCCTTTTCTCTGGTTGGTCAACATCTTCTGGTTCTTCCGAGAGGCCTTCCTTGTC 

CCAGCCTACACAGAACAGAGCCAAATCAAAGGCTAGCTGGAGATGGCCAAAGCCCGGAAC 

CAACTGGATGCTGTCTTGCAGTGTCTGCTGGAGAAGAGTCACATGGACAGGGAGCGTCTG 

GATGAGGAAGCTGGGAAAACACCCTCAGACACCCACAATAAGGACTGCTCCATCGCAGCC 

ACTGGCAAAAGGCCATCTGCCCGCTTCCCCCACCAGCGGAGGAAGAAGAGGAGGGAGATG 

GATGATGGGCTGGCTGAGGGAGGGCCGCAGCGATCCAACACATATGTGATCAAGCTGTTC 

GACCGGAGCGTGGACTTGGCCCAGTTCAGCGAGAACACGCCACTGTACCTGCCCG 

Sequence 500 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGAGCTCAGACCCGGGGAGAC 

CCTCAACGTCAACTTCCTCCTGCGAATGGACCGCGCCCACGNGGCCNNGATCCGCTACTA 

CACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCNGGACGCCAGGTGCGAGAGCC 

CGGCCAGGACCTGGTGGTGCTGNCCCTGTCCATCACCACCGACTTNATCCCTTCCTTCCG 

CCTGGTGGCGTACCTGCCCGGGCGGCCGAGGTACTTNCTACCACTGCTGCATGAGTATAA 

TGCTCCGGAATTATCAGAAAGCATAATGCAGAAATACGAATTANTGGAACTTAATCATGT 

GCCANATAAGCTTACCTAACAAACAGTTATATCCCTATTCCTCAACTGAATGTCTTGTAA 

TAAATAAG AATTN ATC ATTTATTTTCTG C AAC I I I II I I ATGTCTCATTCTTTAAACATT 

MNMCCCTTAATATTTTMCAAAAATTTCTTGATTAGAGGCACAGTGAAAAAAGAAGTC 

NAAA 

Sequence 501 

AGGTACNGTGTGGCTTGGTCACTTCGTGGCTAAGGTAAGAACGTGCTTGTGGAAGACAAG 

TCTGTGGNTTGGTGAGTCTGTGTGGCCAGCAGCCTCTGATCTGTGCAGGGTATTAACGTG 

TCAGGGCTGMGTGTTCTTGGGATTTCTTCTAAGAGGGCTGGCAAAGMCCANGTTGTTT 

TTGTCTTGCGGGTCTGTCAAGGGTTGGAAAGTCCAAGCCGTAGGACCCAAGTTTCCTTTC 

TTAAGCTGATGTCTTTTGGCCAAGAACACCCGTGGGGCCTGTTACTTTGI I I I I IGAGTT 

GGGAAGCCCGGGTTTGCAATTTTANCGCCCCTGTAAAAMTGGTATTTCAATTTCTTAAA 

TTTTTAATGGTNAAANGGN \ lltllllll IGGTAACCCTTNGCCCCGGGGGCGGGCCCGC 

TTCTTANAAACCTTAAGTGGGGAATCCCCCCCCGGGGCTTGCAAGGGAAAAATTTCGATA 

ATCAAAAGCCTTTTAATTCGAAATAACCCCCTNCNAACCNTTCCAAAGGGGGGGG 

Sequence 502 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGAACATACACACACAC 

ACACACACACACACACACACAGAGGCCAATTTCCAGTGAGGAACAGGCTCCACAGAAGAA 

GCTGATAGGCAGGGGGGATCAAACGCTGCTCCCCTGCTCACCGATCTACTTCTCCAAATA 

CAATATCTTGGGTACTTACAACTATAAAAAGATACAATAACTC T ACCAATTATAAATAAT 

GTAGCATTTCATATTAAAGACATTATCGTACCTGCCCG 

Sequence 503 

CTGTTCCTACGGCAGTCAAGAGGCCCAGGCTCTGTGGGCTCCAGCTCTGCATTTCCTGGT 

TCTGGGGNTGGGGCTGGGATGACTTCCTGTTGGACTTGCTGCTGGGACTGGAACTGGAAC 

TGTTCCTCGGAGGGCCGAGGAGTCACCTCTGCAGCCAGGGGAGGATAAGGGGGTCTGCTC 

CCTCTACCCCTCCCAGGGGGTCTCCCCACCCCAGCTGCCGGCCCGTCCTATCCTACCCCT 

AGCCCGTTACCTTGATAATCATAGTAGTCTGGGTTGTCGATCTGGTCGCTATAGTGGGTG 
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TACCTGCCCG 
Sequence 504 

AGGTNCATTCAGAATCCCACGAGCAATTCAGATAGACCAGTCACCAGGCCTCTAGCTAAA 

AGAGCACTGAAATACACAGATGAAAAAGAGACGGAGGGTTCTAAGCCAACAAAAACTCCT 

ACCACTACACCACCTGAAACTCAGCAGTCACCTCATCTTAGCCTGAAGGATATCACCAAT 

GTCTCC7TGTATCCTGTTGTGAAAATCAGAAGAC7TTCTCTTTCNTCCAAAAAAGAATAA 

AGCCAAGCCCAGCAGNGGGCTCTGCCTAAACCGTAGGGGGGCACAANGCCAGCCGTNGAA 

CTATTAAG GGAG GCC CACN CCTTGCTTTTGNAAAACTGAAGAAAAGAGGGGGGACCCCTT 

TTACAGAATTTTGNTGTnTTTTGGAAATTCTCCCTATTTTTCAAGCANAA 

TTN AGGACCGTTTCTAAAAAAAMGG NATTGAAACCAAAATACC 

Sequence 505 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAG GTAC ATGTNTTTGTCACTTAAA 
GGTNCTTTCTGTAANCTGCTTCAGATNCTTTNACTATTCAATTTTTAATNCTTAATNTCT 
GTAMG A ACGTT MTATTCCTCTTTATAATCAATCTTTCCCAGTTN GCCTN AAAAATGTA 
TTCCCTACTTTTGCTTCAGGAGATCAT 
Sequence 506 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGAAGGCACTGAATA 

AATTCACAAAGGCCGATTGGTTCACCCATTCTTTTAGNGACAACAGACACGCAATTCTGA 

CGAGGACTCCTGTTACTAAAAGACACAGCCTCTGATACAAGAGAGATATCCCTTTGACTA 

AAGCATTACCAGGGTCCCCAGGGCCCCCTCCCACTGGGGCGGNAACACTACGGGTCTCCC 

CACCATATATTCCATGTCAAAGTATCTACACAAATACAGAGGAAATTAAGCAAGTAAATA 

CGGTATGTMTTGTTATCATTTGTATTC 

AACMTATGAGTGAGTGCCTTTCATTAGCTATGATCTTTCATACTGATATAT 
Sequence 507 

GGNGAATNGGAGCTNCACCGGGGNGGCGGCCGCCCGGGCAGGTACATAACATTTCAAATA 

TAAGTGGAAGGATCATCAGTAGTGTTATCAAAATGCATAGGNACAGANNCTTTTTAAGAA 

AGGATAAAAAATTACACTCAGGACCCATAACTCTTCCTCATTATAAGCATATGTAGTGAT 

TCATTCATGCAGGTTTTTATATGTAGATAAGGA I I I I 1 I 1 I I NCTTTTCAAGAATTCCAT 

TGTANGCCATGAAGATGAAAAATTGTATTATGGGNAATNGGTATAGCTTTTCTTCTATTT 

TGCTTTCAGTGTTMGGGATTTGCCTAAAAGGCCTTATTTTAAAAATTCCCCAACCTGAA 

CAATAAATNG N NGTTTTTCC AAATAAAN GGGAGGGAACGCTTN NCCTAACCTCCCGGGG A 

AGACGGGGGGGCCCNGGNTTTCCCCTTGGANGCCAAACCCTTTNGGNTCCGNTTTCTAAG 

AAACATAAGTGGGGANTNCCCCCCCGGGGGCTTNGCAAAGGGAAATTNTCGNNNTATTCC 

AAAGGCTTTTAATCCGGAANANCCCGNGGCGGANCCCCCNTNANGGGGGGGG 

Sequence 508 

ATNGGAGCTCCCCGCGGTGGCGGCCGCCGCCTCCAGCCGTGTGCCGCTATGGGAGTCCCG 

GCGTTCTTCCGCTGGCTCAGCCGCAAGTACCCGGCCATCATAGTCAACTGCGTGGAAGAG 

AAGCCAAAAGAATGCAATGGTGTAAAGATTCCAGTTGATGCCAGTAAACCTAATCCAAAT 

GATGTGGAGTTTGATAATCTGTATTTGGATATGAATGGAATCATCCATCCCTGTACTCAT 

CCTGAAGAC AAACC AGCACCAAAAAATGAAGATGAAATGATGGTTGCAATTTTTGAGTAC 

ATTGACAGACTTTTCAGTATTGTAAGACCAAGAAGACTTCTCTACATGGCAATAGATGGA 

GTGGCACCCACGTGCTAAAATGAACCAGCAGCGTTCAAGGGAGGTTCAGGGCATCAAAAG 

AAGGAATGGAAGCCAGCAGTCGAGAAACAGCGAAGTCAGGGAAAGAAATATTGGCAAAAA 

GGTGGCTTTTCTTTCCT 

Sequence 509 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCGCTTGGAACCTT 

TTGGGGATGTGACAGAGAGGAAGCAGCTCCGGGACAAGCTCCAGTGTAAAGACTTCAAGT 

GGTTCTTGGAGACTGTGTATCCAGAACTGCATGTGCCTGAGGACAGGCCTGGTTTCTTCG 

GGATGCTCCAGAACAAAGGACTAACAGACTACTGCTTTGACTATAACCCTCCCGATGAAA 

ACCAGATTGTGGGACACCAGGTCATTCTGTACCTCGGCCGCCCGGGCAGGTCTTTTTTTT 

rTTTTTTTrn rm iiiiinimiiimi i ccatggnaaatttngcttttntt 

Sequence 510 

acttagggcgaattggagctccccgcggtggcggccgcccgggcaggtacctaacctacc 
tttaagactgggataactattggaaacaatagctaataccggatatagttatttatcgca 
tgatgagtaatagaaaggagcttcacagcttcacttaaaaatgggggtgccggaacatta 
gttagttggtagggtaatggcctaccaagacgatgatgtttagccgggccgagaggctgt 



Page 71 of 635 



WO 01/51628 PCT/US01/00798 



Table 1 



ACCT 

Sequence 51 1 

TTAGGGCGAATTGG AGCTCCCCGCGGTGGCGGCCG NN G N NC NG NTAGCCATN AATCATAN 

ATATAGGANACAGCANCTATACANCTGACATNACCANTTAAtNTTATATNATGAAAGCAN 

NNCATNTGCNTGTGCATCAAGGCCAGTCCTATTCAACCNANCTNTCGAATGCTGATANCT 

GGATAGNATGTCATNNTGAAGNTGNCACATAACTTNTCTAAAAAAAaGCANTCTTTGTTG 

NNTGCTTCTTCCCTACNGATGACTTCTAAAAATATATGACAGGGTATAAAAAAATTAGCT 

ATACATGATCATATCAACACATGTAACTGCTGAAATGGCATTCTA 

Sequence 512 

CCGGGCAGGTACAGGTTCAAATGAAATTCAGGTTGTTGCAGGAGACCATGTACATNTTGC 
AGTATGGGCCGGCGAGTTATGTTAATATGCAAGGTTAAGCAGAAAAAAGCGGANCCGTAG 
GGAAACCG AGTNTGAATAN GG GCG ACTTTAGTATATTGGCATATACCCCCC 
Sequence 513 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTAGCACCATCTGCTAGACTT 

GAAAAATCTACT ATGAT ATCAATTATTTTAGACTTCTTGTTTGCAGTTCATTATTATT^ 

AAATATGGAATTATTTTATATGAAAAATAGTTGTCCCACAGG 

GTTATATATGAGAGGAAGGGTCAACAAAACAATATAGATAACACTGGAGTCTCGCTGTAG 

TATACTCATTTGGTTCTGTTAAAAAGAGTTTTGTATTATAGAATGTACCTGCCCG 
Sequence 514 

AGGTACTCCTTTNTTAAAAACACTGNAAAAGTAACCACACATATGTGAGGACTTACTATN 

TTAAATGGAATGGANTGAGCTCCATAGATTAGTTNTGAATATAAAGTATATAAAAGTGCA 

TCAGNGGTTT ATATAG GCTTTAAAAACATGTTATCTTACAGTCCTTTAAAGCAGCCATAG 

AGTTTGTATCATTTTTCAAGCCAATTTCAGTCAGGGATTTGAATTGNTTGANTATTGGAT 

GATAAANNGNGTCCATA TCTTATNAA NATGNGNTCNNGTCNCGTTCmTNGTTANATATG 

ATTNAGGGGGGMTTCATTTTTTTTTN CCCTCC ATGTTAAATNTCANAAAACN NTAAACA 

GTTGGCCNTTTNTA NTNGGNNT TCCTTGCNCCCCAAGGNTTNGNNNCNCCCCNNGGGGGG 

NAAAAATTTTAAAAN I II \ I I I I NCNNAAAAANGAGGNGGAGGNNNCCCCCCCCCCCCCC 

CCCTTTTTTT 

Sequence 515 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCTCANGCAAAAATTGCACGNGGTTG 

ATGCCTTACATGAAGTACCAGTGAANAAAGGNGAAGGTGCCGAGCTATAAACCTCCAGAA 
TATTATTA 

Sequence 516 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACCAGTGGAGGAAG 

GCCTTCCGGCGGAACATGGCAGTGAACTGCTCCGAGATGCGCTTGAAGAGCTCCTGGATG 

GCTGTGCTATTGCCAATGAAGGTGACTGCCATCTTGAGGCCACGAGGNGGGATGTCACAG 

ACTCGGTCCTTGTCGCCTGGAAGGCCCCGCGCCGGGAGCGGTCGACCTCGGCCGAGGTAC 

CCACCACCACTGTATGATGCTCATGAGCTCTGGCNTGCCATGAAGGGAGTAGGCACTGAT 

GAGAATTGCCTCATTGAAATACTAAGCTTCAAGAACAAATGGAGA 

Sequence 517 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCACTGGGCT7TATA 

AAGACTGCACTCAGAACCACAC TGCA CAGTCCAGTTTTTTAAAAAGCTGCTACATGACAG 

ACAGGTAATCCCACTGAGTGAGTTTTGAGAAACAAATCAAACGAAGTAAACAAGAAACAT 

AAAAACCAAATAGCAAATGMTAAAAGCCTGTTCTTGTAACTTAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAA 

Sequence 518 

CC GGGC AAGGTACCATCTGAATACCTCTTTGAAAGAAGGAAGACTTTACGTAGTGTAGAT 
TTGTTTTGTGTTGTTTGAAMTATTATC 

TGGAATATCTGCTATATGTCAACTTTATGCAGCTTCCTTTTGAGGGACAMTTTAAAACA 

AACAACCCCCCATCACAAACTTAAAGGATTGCAAGGGCCAGATCTGTTAAGTGGTTTCAT 

AGGAGACACATCCAGCAATTGTGTGGTCAGTGGCTCTTTTACCCAATAAGATACATCACA 

GTCACATGCTTGATGGTTTATGTTGACCTAAGATTTATTTTGTTAAAATCTCTCTCTGTT 

GTGGTTCGGTCTTGGTCTGGTTTGG 

Sequence 519 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGTGAAATTAACTGGCATCCTGG 
TGGGCCCAAGGGTTTTCAGGACTGGGGGCCAATGACTCACCCCCTCCTTCCTCCTCCTGA 
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TCCCTATCTCTAGCTCTTATCACAGATTTTGAACAATTGTCTGTGAGGTTAATGATGGTT 

TCAGAGGGAAGCCCTTTTCCTCCCTGAGACTGTGTGGGGTTCAGTCAGCCTGCTGAAATT 

GCTTCCACTTATTACCCATCCTTCCTCTTAAAAAAAAAAAAAGCCCACCAAGTTAGTATT 

CTCTGTAGCTCTCAGACAGCTACAAGTGTTCCTGGCATATTTACCAAAGTACCTGCCCGG 

GCGGCCGCTCTAGAACTAG 

Sequence 520 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTTTTAGTG 

TGAATTTACATAGAATAAATTTACTTCACTTTCATGTCATCGACATGAATGACACAAAAG 

CTACTTCATAATACTACTTTACAATAGTTTTCAACATTTCCATATGGTGCGACCCCTTTG 

CTCTCATCAATTTTGGGTGTCATGAGAACAATAGGTATCCCGTTGGACATGATGTATTGC 

GAAGAGCATATAAAGCAGAGGGAAMTGAAAAAGCMGAGAAACTCATTTCAATGCTTTT 

TCTAAAAGGTAACAAATATAATTTTAATCAACTTCCTTGGAAAATATTTTTAAM 

ATCAATAGAAAAAATTACAAAACATCATATGMGCTATAAATAATTTTGAAAAACACCTC 

GGCCGCTCTAGAACTAG 

Sequence 521 

AGGTTTAACCCCCCAGGGNTAAACCCCACCCAAAGGNNAAAGANGGGGGNACACAAAGCA 
CCGAAC AGCAGAGCAGN N N AAN NCNNCG NN N ACN AAACNCGGGGCN AGACACN AACN NGA 
AAG AAACCACCCCCCCGGAG NN AAAAGG NAN AGGANCAN AACANGGGC ACCGAGCCAAAG 
AAAACGGGCAAGG ACNAAGCCCCCCGGANCCACCCCCG NGGAACACCC NC N AG AGG NGGA 
GAGAAAAGAAGGAAAGGCCNGAAGAGAAAGCAAANCCNCACANNCAANCCGGAAAGGCCC 
CAAAGGCAAAGGGG 
Sequence 522 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTNCTCGGTAACGGGCC 

GACTTAATGGGNAGATCAACGCGAAATCCTTGGAAATATATANCAACAGCAAACACAAAA 

ACTGNAGCAATGAGGTTCATGAGATTGGGTAAGTTCTGCCGATAAAAAGCCTCCCGTAAA 

GCTCGGACTTTGNCCGTCCTGGTGGCCAACAAATGGAACAGAGCTATGACTGCACCCTCA 

AACTCAGTACCTGCCCG 

Sequence 523 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGGGTTGAGTATTCCTAATCTGG 
TAATCTGAAATTTCACATGCTCCGAAAATCACAMCTTTTTGAGTGCCAACATGATGCTC 
AAAGAAAATGCTCATTGCAGCATTTAAGATTTCAGATTTTTGGGTTAGGMTGT^ 
GGTATAATGCAAATACTCCAAAATCTGAAGAAATCCAAAGTCTTAAATACTTCTGGTCCC 
AAGCACTTCAGATAAAGGATACTCAATTTGTATTCCTATTTTGGAGATAAGAGATGGAGA 
CTTACAGGCTGGGTTGGTTGGATTGGTTGCCTG I I I I I I GTTGATGTTGTAGTGGAAGCT 
AGGAATGG1 1 I \ ICACAI I I 1 1 I AAAGTGTACCTGCCCG 
Sequence 524 

AGGTACCGGCCATTGGTGCCTTGTGTCAGGTCCCAGAGGGCTGCCTCAATGCCAGCAGCA 

ATGCCCTCCACCACCTCCTCACTCAGTACTGGGTCTGGGAGCTCCCGAAGGCTGTGGAGG 

CCAGGGCACAGAGTAAAGCTCCTGGGCTAAGCCAGTGGCCAAGCCAGAAAACAATGCTGG 

GCAGAAAAGGGTTACAAGGCAGGTAGTCTGGACACTGGATTCAACAGCTTGCCATCAGCC 

AGGGAAAGTGAAGCTGAGGATGCATATGGAGCAGGGCCAGGAAGATCAGGAGAGCTGAGG 

GAGAACAGGAAAATTGGGAGGAAGGAAGAGGCAGCTTANCAGCAAACTCCAGGAGAAATA 

ACCAGCAGCCCTCACAGATTTGTGATTGTCTCTTAAAAAGATNTATTTTCTCTTCTTGCC 

TGAATGTACCTGCCCG 

Sequence 525 

CGAATTGGAGCTCCACCGGCGGCCGAGGTACCACCGATAATGCTATTAGCCCAAACCGTG 

GGTGTTTTNTAAATATTAATAGGGGGGCTTGATTCAACAAAGCCACAGACTTAACGTTGA 

AATTTTCTTNAGGAATTTTCTAGTAACCCAGNTTCTAAAGTAGCTACAGCAAAGGGGGAA 

ATATTATGTGTGANCATTTTTCTTCTTATGCTATATCCCCAAG1 iTTTI ICAGGACTCAT 

TTTAAGTNAAAGGCTAGAGTTGAGTAAAGGAAATAGAGCCCAAATGAGGGTAAGGTTGTC 

TGAGCCATTGAAAGTNTTAAATACTGAAAAGAATGGTCCACTTTTTATTTCANNGAAAAT 

TA 

Sequence 526 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTCCACGACAAATTCT 
TCAGGCTCTGCCTCTTCAAC I I 1 1 I I ACTCTTTCCATTCTG I Kill I CCCATTTTTTGC 
AATGTAGTTTTGTTGGAGGCC A 'i 111! IA TTGCAGACTTGAAGAGCTATTATTCACCGCC 
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TCCGAGCTGCTCCGACCTGCCCGGGCGGCCGAGGTACTTGTGCCTAGTTTTTCAAGGTAT 
TGGCTGTTCTATAGATGCAGTGATTGTCCCAGCTAGCTCTGTTACCAGCCTTTTGGTGTG 
TCTTTATGTTCATTTGGAGAGTCAGGGCC 
Sequence 527 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAAAGACAAACTNGTAANGCA 

TGACCTGGGGAAATGGTCAGACCTTGTATTGTGTTTTTGGCCTTGAAAGNAGCAAGTGAC 

CAGAATCTGCCATGGCAACAGGCTTTAAAAAAGACCCTTAAAAAGACACTGTCTCAACTG 

TGGATGTTAGCACCGGCCAGCTCTNTGTACTCCCTGGCACCATCTGGTTCATGCCAGGTT 

TAGAGTTCATCGTGTTTGGCTTCCTGGTCTTTGAGTTTAAATTTCTCGGCTGTGACCTTC 

CCATCTCCATTCCGGTCCTGGTTGGTGAACATATTCTTCACANNCAGGTCAGNATNNAAA 

GCCAGGAGCGAGTTTNCCTTTGCCAGATGCCACCTGGGCGTGAATGTACCTNGCCnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCCNNTTATAAAANCTTATTN 

GGATTCCCGTCNNACCCNNGAGGGGGGGGGCC 

Sequence 528 

AGCTCCCCGCGGTGGCGGCCGAGGTACAAACTCTGGCCAGTGCCCTGTTAATTGTTAATG 
TTGGTAGAMGCAGATTTGAATGCTTCTTTTAA 

CAAAACAGTATGTCTGTAGTTAACCTACTACTCTAGTAACCAACACATGGAAGCATCATA 
TGCACATAAAAGTTAAACATTAAAACAAAACTTAAAACTCTCAAAACAGATAATCATGAA 
TTTGTGTTTATAAAATTATATAGAATAAAGGCTTTGTGTGTGGGGACTTTCAGACCTGCC 
CG 

Sequence 529 

TGGAGCTCCTCGCGGTGGCGGCCGCCCGGGCAGGGTACAATTCGCCCCGTGAGTCTAGGA 
AGATTTGNTGAGCGTCTGCCAACTGGCAGATGAGGAACCGAAGTTCAGAGAGTTGAAAGC 
ACTTGCCCCAGGTCA 
Sequence 530 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCGACCGCCACGGAGCAGTGGG 

TT CTGG TGGAGATGGTACAAGACCACACCGGATGTCATCTTTGTATTTGGATTCAGAACT 

CATTTTGGTTGGTGGCAAGACAACTGGCTTTGGCATGATTTATGATTCCCTGGATTATGC 

AAAGAAAAATGAACCCAAACATAAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGA 

CCTCAAGAAAGCAACGAAAGGAACNCAAGAACAGAATGAANNAAAGTNANGGGGGACTNG 

CNAAAGNCCAATGTTGGNGNTNGCCAAAAAGTGAGCTTGGNNNATTGGATNCNCNAGCCC 

G AAGGG ANTTAAAAGGGGGCNNNNCANNTGNTNTTNGNNTTNGGGGCCCCCTTGGTGGNT 

NTTTrTCNCCAGGGACCNTTTAmANCTTTAANAACNTTCTAAAAAAAAA 

Sequence 531 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGACAGTATGTCTAAGATGAGAGAGTTTA 

GGGACTACTCTGTTTTAGCAAGAGATATrTTGGGGGTCI I I I I GTTTTAACTATTGTCAG 

GAGATTGGGCTAAAGAGAAGACGACGAGAGTAAGGAAATAAAGGGAATTGCCTCTGGCTA 

GAGAGTAGTTAGGTGTTMTACCTGGTAGAGATGTAAGGGATATGACCTCCCTTTCTTTA 

TGTGCTCACTGAGGATCTGAGGGGACCCTGTTTA 

Sequence 532 

TTAATTGGAGCTCCCCGCGGTGGCGGCCGCCNNGGCAGGTACAATNNTTTATTANGGCAT 

TTTTATTGCTCCGAATGAAMGAGTGGAGCTTTAAGTAMTATGCCAAAAAAATATTCCT 

AGCACAAAAGCCTGCACCACTGCTTGAGTCTCCTTTTAAAGATAGAGATCATTTCCAGCT 

TGGCACCTTATCTCATACTCTCAACATTAAAGCTACTGGGTACCT 

Sequence 533 

GCCGAGGGCGGGGGGGGGGGTTn'GGGGGGGGGNAAAGCCCNTTTGGGGGGGGGTTTAAA 

CNCNCCNANACACAGGNANGGGNNNNNACNCCNANNAACAAAGGGCAAAAAAAAACNNCN 

NCTNNNAGGGNAAAGGANATAAGAGGGCCAAAAAAAGCAGCNCNAANNAr4ACAGGGNAAN 

AAAAAGNAAAGCNNGCGGCCCAGGCCCGGNNGGGGACCCAAGGNAAAACGGCCCAGGGNG 

GGGNAGGGCCACAAACCAACGGGGGAATTCCCNGGACCGGAAAGAAGGGGAGNNANAANA 

NCCCAANNGNNAAAAAANAAACACNCANCCGAAACACANNNANAAGCCAGGAANCNGCCA 

TCTGCCNTAAAAGCCAGGNAACGGGNANCNCACACCCCNGGGANNNACACAACCCCNNNC 

TCTNNGGCNGNAAGNGGAANCTGGAGGGGAGGGNCGCTCTTCTCCCGGACCCCGNAAACN 

GGGCCGNNCAANAACAAAGGGGACCCCCCGGGCGGCAGGAAAGNAAAANCAAGCNAAACN 

AAACCGGNGACCCNNGNGGGGGGCCCGGNACCCANGNNNGGCCCCCTAAGGGAGGGGAAA 

ANNGNCNCCGGGGGGAAAAACAAGGGGNAAAGCGGGGGCCCGGGGGAAAAAGGGGAAACC 
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CCNCANAAANCCCACACAAAAAAACCANGCCCGGGAACAAAAAAGGGGAAAACCCCGGGG 
GGG 

Sequence 534 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCANGAGGCCTGAATTCATGTC 

CACAATGACCTGTGCTTAACTATTCCAAAGGTCGCTAAAGATACTGTTACTACTATTGAG 

ATATTATTGGCTACTTCACGTTTACATAGTAAATGTTTGCAGCATATAACATTCAATCAT 

AAACCCATAATTAACTTATAAGTGTTAATGGACAACTGTGCTTTGATTTTTGCCTTTAGT 

GATMGAAAACAAAGTAGTGAAATGGGTCACTCCTCAAAGCATGGMCATTTTMCTTTG 

CCTAGTAAGGAAAMCAAAACAAAATATAGCAATTACATGTGGAACCCGTAACCTGCAAA 

AAGTMCACAAATATTGTCTNAAAAGGTACCTGCCCGGGCCGGNCGCTCTAAAACTA 

Sequence 535 

CGGGCAGGTACATGAAATGGTTTTGAAACAATAGGAACAGATAAGTCCCAGATAGGAGGC 

TCACTGATACTTAATTGGCCATGTCACCAATGTTTGTTTTTAAGGGAGTTTGGTGGTTGC 

CATGTTATC A I i I I II I TATCTTTCTTTC I I I I I CTTTCTTTCTTTCTTTCTTTCTTTCT 

TTTCTTTCTTCCTCTTCTTCTTCTTTTTTGCAMTAATTTACAGGCAT^ 

GTAAGAATACTGGGTCATAGTTTCATCCAGTGAAACTCTGTGACAATCCTTCACTAGAAG 

GAGAGTACCT 

Sequence 536 

CCGCGGTGGCCGGCCGAGGTACCTTTCTCCCTCTCTACCAGTTTTTCCTCTAACTCTCTC 

ACTTTCTCTTTATTTCTTTCACTTTCTTCTAATGCATTCTGCMGATCCTGCTTTA 

ATTTATTTCCTCTTCC GTG ACCCTC AGCTCGTGG AAATGTG N ATAATTAATCC ATGCTTT 

CAGGTGAGACAAAGACCGAGTTTCATCTGGCTTCTGCACACTAAGGACTTCTTTCATAAG 

CCTCCTCGGTATTTGCTCTGMGmCTTTAAAGTTTCTGGGCTGAGGGTCAAGAAGCTG 

TTCTCTTGAAATCTCAAGTGGT TGTT TAAAGCATACCAAGTTCTTGCCCTTCGGGGATTG 

GAAGTTCCGACCTGTAGCTGGTTTTCTCTCTTGCTTCAAGAAGCTTCTCTAAATCTCNTT 

CTGGNAGGATCTTNGCAAAAATCTCTTNGCAAGTTGGCTGGAAATTCTGACCANTCAATT 

GGTCAGGTAGGTGGGAATTTTACCTAAAAGAATGGTAATTAAAAGACAAGGATGGGAAGA 

TNGAATTTGGGAAGTCTTGGGGGGAAGAAGGGTTTCNACCMGGTTTTCCCCAAGGGGAT 

NGGGGCCCCCAANTTAAAGTTTGGGGNGGGAATNGGGTNTGAAGNTNAAAACTTTAGGTN 

TTGCTTCCCCCCTCCTTGGGGGGNAANTTGGGGCCCCNGNAAANTAAATCTGGGNAAACC 

CCCTAATTTGGGNAAAGGGGA 

Sequence 537 

AGGTACCCTCTCATGAGGAGGAATCCATGCTGGAGGATTCTCATAGGCAGGCAAGAAAAC 

CACCGGGTAGTCATCATAAGGAATCCGGCTGTCCATCTCGGGCAAGGCCTTAGCTAAACA 

TCTTCTGGATGCCAAGCAACCTGTCAGGTACGCGGGAAAGTTGAAGGCTACAAGAAGACC 

AAGGAAGCTGTTTTGCTCCTTAAGAAACTTAAAGCCTGNAATGATATCAAAAAGGTCTAT 

GCCTCTCAGCGAATGANAGCTGGGCAAAGGCAAAAATGGAGAAAACCGTCGCCCGTATCN 

AGCGCAGGGGGCCCGTGCCATCATCTATAATGAGGGATAAATGGGTTATCAATCAAAGGC 

CCTTCAGAAACATCCCTGGAATTACTCTGCTTAATGTAAGCAAGCTGMCATTTTGAAGC 

Sequence 538 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCACGGTACGCGGGGGTGGCGGCGT 
TGGGTTGAGCGGGCTTTTTGGAAGNTNGNGGCGGGAGTTCTGTGATATGAGCAACANTGG 
ACCAGANGATTTTATCTCTAGCAGCNGAAAAAACAGCNGACAAACTGCAAGAATTTCTTG 
. GGCAGGGCCTGGGGAATGC I Nil IATCTCATATTAGTGCCTGTGATGGCATCTTTCATC 
TAACACGTGCTTTTGAAGATGATGATATCACGCACGTTGAAGGAAGTGTAGATCCTA7TC 
GAGATATAGAAATAATACATGAAGAGCTTNAGCTTAAAGATGAGGAAATGATTGGGCCCA 
TTATAGATAAACTAGAAAAGGTGGGCTGTGAGAGGAGGAGATAAA 
Sequence 539 

GGTACACTGATCAGGGACTGGAATCTTCTTTCCAATTTCCATGGCATATGCTTTCACTTT 

gctgagg rmrm I AAGTGCAAGTAGAGCTTATCTTGGTATTCTATAGGACTTGCAGT 

TGTCTCTGGAGTTTCTTCCTGGGAGTTTTCTTTAACAGTTTCTGACAMTTCTCTGAGTC 

TATATGTATAATATTTGGGTATGAAACTGAAGACTCCCCCAGCTGTGATCTCTCAGNGTC 

TGTCTTAAACGGCAGCTTGTCATTTCCTTGGCCAAGTGGGTCTATTTCTGAAAGAGAGAT 

GCTGGCTGGTATGTCTCCCCTGTTTTCATCTTCGCCGTCATCTGAACCCACACCCTTTGT 

GAGCATAGTGTGGAAAGCCAAACTTTTTGACC 
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Sequence 540 

CCGGGCAGGTACAGGCTGTTCACCAGCTTGCTCAGAATCACGCCATTCTTTCAGCCAGAC 

CTGGAAGCCCAAGCCGCCCANCGGGTCTGGGNCGGGCCANATTCAAGGGNCCNANCTTGA 

NCTTATTGNATCCNACTTCCAACNAGCCCGGCTTCCCTTCCNAGCTTCCCTTCGGTCANT 

ACTTTCTTTCTTCGNNTTTTTGGGANCTTGGCMCTTTCGGCGGGCTTCAATGGCCCNTA 

GGGAAAGGGACCCCTTGGTTGGGCCCATGGTCNTGGGGGAAAGAAGGCTTCCTCAGGGGC 

TCGCAAGGAAGGAGTGAAGACTTGTGACCTCGGCCGCTCTAGAACTAGTG 

Sequence 541 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATCTGGGTGGAAGGTGACTCCTG 

TGGGCGC GATGA TGGGGTGGAGGGTGTCTGATGGAATCCCGTTGGGTCCCGGCTGCCCTA 

CGTCACCCTTTTCACCCTTAACTCCGTAGAAACCAAGTCCTCTGTTGCCTTGCTGTCCTT 

TTGGTCCAGGGGGTCCAGGTGGTCCTGGTCTCCCTGGAGCTCCAACTGGACCCATCTGTC 

CGACATGCCCGGGCGGCCGCCCGGGCAGGTACTCTCAGAGAACAGGAGATATGTGTGCAT 

GCCTTAGAAAAAGCCCTTGAGTAGGTAAGGAAGGAAGAAACCTATGGCAGACAGGATTGC 

TGCAGCCAAGGGGGCTTCAAGCAGAAATGAAAGAGATGGCTTGTGAAGCTGCCCGTGTCG 

TGT 

Sequence 542 

H II ill l l I II I II I'l nCGNTTGGGGGACCCAAAAACCCCCAANAACNAGGGGACCCC 

CCGGGCNGCAGGAANTCGAAAACAAGCTAAACGAAACnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnTNCTGGGGCNGGGGGGAAAAGGGNANCCGCNCACAANNCCANANAACAAACGAGCCGG 

GAGCAAAAAGGGNAAAGCCGG 

Sequence 543 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTCTCTTCTTAA 

CCATTTGGTCCTCTGATATAAAACTTGCTTAACTGGCAAGATGGCTCACTTGGTCCTCAA 

GCCAACATCAAGAAATATTTAGCTGTCTTAACACAGGTGGGAGAC I I I I II HA ATGAAA 

TACATAGGAATCCATTAAGTAACATTAGCATGTCATGTGTTAAATAGCTTCATTTAAAAA 

TGTTACAGATCAACAGAAGGGAAATCAAAAACTTATTTTTCTCAGTAACTACTTTACATA 

CTAGTGTAATTAAAGGTTTAACTGGMGAAAAATTCAGCATAAAAGTATAATTAAAGTAA 

AAAGCTGGTATTCCACATTCCTGTTCCTATGAACACCTGAGCTGGCACAAAAGCTGAGTA 

GTTTAATGCATTGGCATATAT 

Sequence 544 

gaan tgaagctccaccgcggcggcggccgcccgggcaggtacaattagaaaacctgcatc 
ttttaatacagtgagatttngtatgcatacact^ 

tattggcttaaaagccanaaaggtgctcatgcatgtaatttatacttggagcaacgaatg 

ccantgtatgtggncgcgtntgtgtgcacntttgaaaggnaggagtttttagaagnaatg 

agmggaaaatttnctttcttgggcattcattaaataattcagngtgaatatccttgtac 

ncttggccgntctanaaaccagntggatcccccggggctgcaaggaaatttcgaantntt 

caanggcttatttganncccggcgacctccnanggggggggcnccgggtaccccaanctt 

TTT 

Sequence 545 

CCGGGCAGGTACATTATTGTTTATNTGAAATTTTAATTGAACTAACAATCCTAGTTTGAT 

ACTCCCAGTCTTGTCATTGCCAGCTGTGTTGGTAGTGCTGNGTTGAATTACGGAATAATG 

AGTTAAGAACTATTAAAACAAGCCAAAACTCCACAGTCAATATTAGTAATTTCTTGCTTG 

GTTGAAACTTGTTTATTATGTACCCTTCGGGCGCGCTCTAGAACTANTGGGATCCCCCGG 

GGCTGCAAGGAAATTTCGATATCAAGGCTTTATCGGATACCCGTTCGACCCTCGAGGTGG 

GNGGCCCCGGGTACCCAAGCTTTTTTGTTCCCTTTAAGTGGAGGGGGTTTAAATCTGGCG 

CCGCCTTGGGCCGTTAATTCATGGGTTCATAAGCCTGGCTTTCCTGGTGTGGAMTTTGG 

TTANTTNCGGCTTCAACAAATTTNCCAACAACCAAACATAACCGAAGCCCC 

Sequence 546 

CNGCCGGCCGAGGNACCAAAATTTGTGGGAGGGNANTNGAAACCCGGCAGANTTTAANAC 

CNNATGCCNATAAANGGGGGGNCAGGGGANGAAGACGGGGGGCCCGGGNGAACAAAAACC 

ACACGGNCTCTANGGAAAANGNGGAGAGAACTGAGAGCGAGGTGNGGCAAGAAGCAGGCT 

CGGAGCCGGAGGAGGGNGGCTANCGGCNNTAATCTCAGGGAGAGATGGCGCNGTGCGGNC 

AATGATGCAAGCNGGNAAGGGACCNGGCGGGGAGGGAAAGGGACGAGGGAAG 

Sequence 547 
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CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAGACTGGG 

TCAGAAACAGATATAATACATTATTATTTTAATATATTGCTTGTTAATAGCTTTAAAAAT 

ACTACA7TCTATGGTAGATTATTGGTTATATAAAAGGTAGAAATGTCATAAGTTTTNTAA 

TTNCCAGTCTGTAAAACCTGTCAGTTGAGTGCTTTGAAGGTACCTnnnnnnnnnnnnnnn 

nnnnnnrinnnnnnnNGNTTGCTGGGAAATTCCGNTATCAAAGCTTATTCGATACCCGTCG 

ACCTTCGAGGGGGGGGGCCCGGTTCCCCANNCNTTTTGCTTCCCTTT 

Sequence 548 

GCAGCAGGAGCCTGGGAAAGAGGCCNTCACCCTGGCGATGGCCTCTTTCCCACCTCCCCT 

TCAGTTCANANCCAATATGGNTCTAGAATCTGGCACTTTACTCATTTCCTTTGATAAATT 

AGTACCTCNNCCGCTCTAGAACTAGTGNATCCCCCNGNCNnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnNCAGTMTCATTNTCATANCTTTATTTCCTTTGTGAAATT 

GTTATNCCNCTCACAAT 

Sequence 549 

AGGTCACAGTTATGGCAMGTAGACAMGCATTTGTTCATTTGGAGCTTAGAGTCCAGGA 

GGAATACATTAGATAATGACACAATCAAATATAAATTGCAAGATGTCACAGGTGTGATGA 

AGGGAGAGTAGGAGAGACCATGAGTATGTGTAACAGGAGGACACAGCATTATTCTAGTGC 

TGTACCTGCCCGGGCGGCCGAGGTACCTTACATCAATGGCAAGTTTAAGAAGGATAATTA 

ATTACACAAACCCTTCACAGACTGCTCTGGTGCCTGGTGGTGCTAGCTCCTCCCACCTCA 

GCACCTGCTGATTTCTGGGAGCAGC 

Sequence 550 

CGCCCGGCAGGNANTA I I II Hi II II I 11 1 11 I I 11 II II M I I II I 1 1 I M 11 1 1111 

I I 1 1 1 1 1 I I I I I 1 1 I 1 1 I I 1 1 1 T 1 1 1 1 1 1 1 I 1 1 I 1 1 1 I 1 1 I 1 1 1 I I I 1 AAAANNTNNAAA 

ANTTTTNNNNAAAAAAAAAAAAAAAMNGNCCNAAAAAAAAAAAAAAN ITTTTTTTTTTI 

AAAAANNCCCNTNNCNTTNNAAAANNAAAAANCCNCNNCCCCCCAANANNAAAAAATTTA 

AATNAAANGTTANCCNCNNCCCNGAGCCGGGGGGGGGNNCCCAANCCNNNTTTTTNTGGC 

NTAAAAAAAANGTNAAANNCCCCCNNANAAAAAAAAANNCNNTTTCCCAATTCCCAANNN 

NAAAACANTTNNCCCTTCCAAAAAAAAANAAAANCNCAACCNGAAAAANAAAAANNGNCA 

ACAACCNNNGGNCNAAANAANACNCNCCNNNNNNGTATNGGGGGNCCNCCNCNTTCGGGA 

AAAAAAAAAANAAAAACCCCCCCCAAAAANAAAAANAAACNGCCCAAANNGGGGGNAAAN 

NNNNTTNGGNTAANNCCCCCTTNNCCCATTTNCTNAAAAAAGNNNGGGGNNGNTTNGGNN 

NNNGGGNGNGNNNNNNNAAATCTTCTCCCCAAAGGNNNTATTCCCCTTTTAAAAAAANTG 

GGGGAAAANCTANGAAAAAAAATTNGGGCAAAACCCNGCCCCNGCCCCGGGGCCCGTNNA 

GGGGCN N NTTTTNTANNTN N AAACAACCCCCCCCCCCC N AAAAAAAAA 

Sequence 551 

CCGCGGTGGCGGCCCGCCCGGCAGGTACTTGTTATCAACACGTTTGTATCAGAGTTGCTT 

TTCTAATCTTGTTAAATTGCTTATTCTAGGTCTGTAATTTATTAACTGGCTACTGGGAAA 

TTACTTATTTTCTGGATCTATCTGTATTTTCATTTAACTCAATATCATACTACCGGCTAC 

ATCAAATCAGTCCTTTGATTCCATTTGGTGACCATCTGTTTGAGAATATGATCATGTAAA 

TGATTATCTCCTTTATAGCCTGTAACCAGATTAAGGAATACAGCTCTTAAAAAATCAAGA 

ACTTCCTGAGTTTCACATATAAAATGGTGACAAACACCTGCTTCTGATAAAGTTTATCCA 

ATTTTGTATTTATAGTATTCTATTGTAATAATAGAATTCTATTATGTGCTTCCCTCCTTA 

GACATAGTTTTTCTCCTTTTGTACCT 

Sequence 552 

CCGCGGTGGCGGCCGAGGTACAAATTCTCTTCTTAACCATTTGGTCCTCTGATATAAAAC 

TTGCTTAACTGGCAAGATGGCTCACTTGGTCCTCAAGCCAACATCAAGAAATATTTAGCT 

GTCTTAACACAGGTGGGAGAC I I III I I I AATGAAATACATAGGAATCCATTAAGTAACA 

TTAGCATGTCATGTGTTAAATAGCTTCATTTAAAAATGTTACAGATCAACAGAAGGGAAA 

TCAAAAACTTATTTTTCTCAGTAACTACTTTACATACTAGTGTAATTAAAGGTTTAACTG 

GMGAAAAATTCAGCATAAAAGTATAATTAAAGTAAAAAGCTGGTATTCCACATTCCTGT 

TCCTATGAACACCTGAGCTGGCACAAAAGCTGAGTAGTTTAATGCATTTGGCATATATAT 

GCCTTAAGGACTGTACCTGCCCG 

Sequence 553 

CCGGGCAGG I I Ml I i III I I GTC ATTTTAAATGTTTATTTTATTTTG C AC AATTTTAAC 

AGTAGAGTAATTTTGCTGTGGAAAATAAAATTTCATCTTGCAAGAAAGAGAAAACTCAAG 

ACCACCAACTGGCGTCTTTATTTACATATTAGAGCGGGTCAGCCAATGAAAATCCTGGGC 
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CTTCCGTGTGGCTTGTGGCTCAATCATGATGTTGATANGAGAAGGTTTAGTTGTGTCTGC 
TAGGCTCTGCCTCAGGGA 1 1 I I I GGAGTTCTTCTGGTGTTTGTACCT 
Sequence 554 

CCGCGGTGGCGGCCCGAGGTACCCCGTTCTGCCTGAGCAI MM I CCTAAAGGG AAG AAT 

C AATA GTTTCTGACTGTTTTAACAGCTGAAAGCTCCAACTGGAGGCAGAAGATGGGATGG 

CTTTTCACACACGTGCGTGCMGTTTAGCCACCTCAAAGGCCTTGTTCTTAAAGCAACAG 

TGCTGTTTGCATTATGAAATGTCTCTGGAGTTCCCCTTTGGAAAGGCTGCTGGTGGGCCA 

CATGGTCACGATACTTTCAAGTCACACCCTACTTTGTGACCTTATCCTCAGAGTAAAGGC 

TTTAGAGGAAAA GGGA CCCCACAGTCTCACCCATTACCTGGCTGTCAGCATCTCCATATG 

CTCCTGGCTGAGTTTTATTGAGCATCAGCTGGGGATGTGAGCAGAAACCTGAATCCTTGA 

GACAGGTGGTTTTCAAAAAGGAAGCCATAACAATGAGTGGCTTAGTACCTGCCCG 

Sequence 555 

CGAGGI IMIIIIIlllllllllNillllllll I 1 NGAAGAGAAAAGATATTTAACCATT 

TCTCAGGCTAATCTNTTAAAGCGCTCAAAGTGTTTTCACAAGTGCAAGGGGAGGGATTGT 

TCTCAGCCCAACATTTACCCAGTTNTTTTTCTTANACCACNCAAAAAACACTCTCNAATT 

TACGGTGCCTTTANACCTTTACTTT 

Sequence 556 

GGCAGGTACACAGCGGCAGTCGCGCCCACACGTCCATGACTGGTCGTCCTAGATTTTAGG 

TGTCGATGAATACGGCCCACTGGGAACTGCAAGCCGGCTCTCTGCGAGCGGGAAACCGCC 

TTTGTCTTGGCCTTTCCGGAGTCCTTTCCAGCCTTACCGCCAGCCATTTCGAATTCCGCT 

GAAGCTTAAAGCAAGCAAGGCAGAGAAAANGGNTAATCGGACCCACGGNGGGNATCCCAC 

CAACNTAN TTNl M M I I GAAACCGGGAANAAA AAAAG GGGGNTTTTTNCCCCCNTTNCC 

CGGNNNCCTTTTTAAAAANCCCCCCNGGCCCNTTTTAAAAAANAGGGGGGCCCCCCCCGC 

NGGGGGGGGGGNNNTNTTTTNAAAANN 1 1 1 M M I I CCCCCCCCCCCCCCCNGGGGGGGG 

GGGGGGGCCCCCCNNNCNA I M M i M I I M M M M M I NNAANAAAANAAANCCCCCC 

CCCCCCCNAAAAAAAAAAAA 

Sequence 557 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATGGGCTGGAAGTGGGGATGTCCATGGCCCAA 

GAGACCCCTCCAGCTTCCAGCCCCTGGCAACAGCGGGAGAGCAAACCCAGCACCAGGGAC 

TTGGCGTGAGCCAGGGCCGGGAGGCCCCCAATCAGGCTCTGGGGTATCAGANCAACCAAN 

TTGACGTAGTGTGAAAAAATAGTATTCCTTTGATAAAAAATACTGTCCCTTGGTCTTCTC 

TAAGTTTGAAACACCCTGGGGAGCTTTATTTTTTAGCAAAGCCAATTTCCCATACCCACC 

CAAAANTTTTTNCATTCCAAGTTTTAGGN^ 

TAAACTTTAAATTTTTGGAAAGGNCCTTTTGNANTTTAAAAMCCC 

TTAAAAAAATNANGGGGNNCTNCCCCCGGGGCTCTGGGNGGNGAATNTTTTTTNTTTAAA 

NAGTTTTTNTTANTAACCCCCCNCCCCCTTTTNGGGGGGGGGGGCCCC 

Sequence 558 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGI M M M I I GAGAC 

AGAGTCTCACCCTGTCACCCAGGCTGGATGGAGTGCAGTGGTGTGATCTCGGCTCACTGC 

AAGCTCCGCCTCCTGGGTTCACACTTTTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACT 

ACAGGCGCCCGCCACCACGCCCAGCTAATTrrrTTGTAGTTTTAGTAGAGTCGGGGTTTC 

ACCGTGTTAACCAGGATGGTCTCGATCTCCTGCCCTTGTGATCCGCCCGCCTCGGCCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCTACCACGCCCGGCCGAGGTACTGTTCCTGTTGG 

CCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGT 

TACAACGGAAGTAAAATCTGTCGAAATGCACCATGA 

Sequence 559 

CTATAGGGCGAATTGGANCTCCCCGCGGNGGCGGCCGAGGTACAAGTTGGGGTCATAATT 

ATCGAGTCTCTTGATATNATCACAATTACTGTGCCCCCTGCACTTCCTGCTGCAATGACT 

GCTGGTATTGN GTATGCTCAGAGAGGACTGAAAAAAAACGGT MTTTNTGTATCAGTCCT 

NNAAGAATAAATATTTGNGGNCAGNTCAATCTTGNNTGNTTNGACAAGACTGGAACTCTA 

ACTGAAGATGGTATTACATCTTTGGGGGANNCAACGAGNGGAAAANGCNCCGATTTCTTT 

NACCANAAGAAAATGTGTNCCATGCTGNNNNTGGTNAAACCCCAGCCNGTTGCTAGCANG 

GCTACTCCCCATTCACTTACAAAA 

Sequence 560 

NCTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAAA 
CTGCCGCCACATGCACTCCTCCACCGCTGAGAGTTGAATAGCTTTTCTTCTGCAATGGGA 
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GTTGGGAGT6ATGCGTTTGATTCTGCCCACAGGGCCTGTGCCAAGGCAATCAGATCTTTA 

TGAGAGCAGTATTTTCTGTGTTTTCTTTTTMTTTACCTTCAGTCMC 

TCCTGGATTTCCAAGATCCGCGTCTGAAAGTGCAGTACCT 

Sequence 56 1 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGAACTTGCTACTTTTT 

rTTTTTTTTI IGAGACAGAGTTTTGCTCTCATTGCCCAGGCTGGAGTGCGGTGGTGCTAT 

TTCAGCTCACCACAACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTG CCTTAG CCTCCCG 

AATAGCTGGAATTACAGGCACGCACCACCATGCCTGACTMTTTTGTATT7TTAGTAGAC 

ATGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCCCACCTTCAGGTGATCCGCCC 

ACCTCGGCCTCCTGAGGTGCTGAGATTACAGGCGTGAGCCACTGTGCCAGCTTGCTAATT 

TTCACAGAAGTTGATGGCAATTCTTCACATGTAAACAGTGCCAGTGCACAGAACCTTTAT 

ATA I II J I JGAAGCCAGTACCTGCCCG 

Sequence 562 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAGTCAGTCCAACAGTTAGTGTTAATTACT 

AATAATATATGAAAACCCTGCCAACACAATTGCTGCTACATCACCAATATAATTATTAAC 

CACTGTCGGAAAAACACACATAAATTCAGGTAAGACTAAAAGCTGTCTCACAAAAAGAAA 

AAAGAAATCCAATGGATCCACTAATGCTATCAAAAGGGACATGCAGGAATGTAACATGAC 

ATTTTTAGAAATGTGTGTrfCTAAAAAGAAAAAAAAATACACTAAAATGCCAGTGGACTA 

TAATTCATTCAAMCATCTTTAGTGTTCCTTCCCAAAGATCTTGATCTGCTCAGTAATTG 

CTTCACAAGATCTATCACAGCCATCTTTTGGAGCGTATGGTTAGGCTGGTCCTCCTGTGG 

TGGTAGGGGCAGTCTTTTTGMGCTTTAAGTATCTGGTGGTACCTCGGCCGCTCTAGAAC 

TAGGTGGATCCC 

Sequence 563 

TTNCAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCGGAAATTATCTACA 

ATGAAGAAAATTTNTTGGNAAACTTATCCAACAGCCTGGGCAAGCTACCTCTCGCATGGG 

AAATTGATAAATCTGAATTTGATGGGGTGACCACAAATTCGAAACACAAATCAGGCAATG 

CAAAGAAACAAGTTTCCAAGAGAAAAACTTCAGATAAAAAGGGAAGATATCAGAAGGAAT 

GTCCTCAGCATTCTCCTCTTGAAGATATTAAACAGCGGAAAGTATTAGACCTCAGACGAT 

GGTACCTGCCCG 

Sequence 564 

CAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGI II I I I I I I I I 1 1 I I I I I I I I I I 
I M I 1 I I I 1 1 I GGNCCCAGTAATTTATNGATTAAATCATNGTAATCNCCAATANAGATTN 
CAATAGAGANCNCCAACATGATTNCATGCNTTNAGNGGANAAATNTTNCCNIGGTTAAG 
Sequence 565 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGGATACAGGAG 
AG AANCN ANCG N NTTTGTCTTTGATCN N AANGAATCGGC ATNGTANAAAGTGGAANATGN 
N ATGNG ATGN ACACATTAATTATN ATCATNCCCTTN N NCTACCTAAAATACTCGC AGTGG 
. CTCNACCTTACGCATATAAAAACTNCGCATTCCAGCCCACCGCCTTCA 
Sequence 566 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGC AGGTA CAATCTCATTT 

TCATAAAAACAAAAGAAAATCATTATGGACATGTGTGTGTAAAAATGCT7TTTGGAAGGA 

CTAGAAACTAGATCAACCAGCTAAACAGAAGTTCCTTCTGTAGCATGGCAGCGGGTTTCC 

ATTTTTTATTAGTTTCCTAGTTTTTrAMTATAAAAAGAAT 

TTGTTTATTATGCCTACAAAAATTAATATTTATATTTATTGTGTATATATATAAM 

ACAAACACTTAAAGTTTATACATATACACACTAGATATGGCCTGTGAATTGTACCTCGGC 

CGCTCTAGAACTAGT 

Sequence 567 

TACTACAGGGCGAATTGGAGCTCNCCGCGGNGGCGGCCGAGGTACTCCAATGCGTGGCTC 

TCCTGTGTGATTCTTNCTCTCCACATGGTTGTGAGTCTTGACAATCGCAGCCCCTGCTTT 

TTCCCTTCCCTGGGAGGCTAGAACANAGAAGCCCTTAC 

Sequence 568 

GAATTNGAGCTCACCGCGGGGGCGGCCGCCCGGGCAGGTACATTTTTAATCTGTTATAGT 
TNTTTGCAGTATTAAAATGGTGAGTAAAAANAAAAAAAAAAAAAAAAAAN NGGGCTTGTA 
CCTCN 
Sequence 569 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGGACCTCATCCGCCTCCT 
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GCTGAAGCACGGGGCCAACGCAGGTGCCAGGAACGCAGACCAAGCCGTCCCGCTCCACCT 

GGCCTGCCAGCAGGGCCACTTTCAGGTGGTGAAGTGTCTGTTAGATTCGAATGCAAAACC 

CAATAAGAAGGACCTCAGTGGAAACACGCCCCTCATTTACGCCTGCTCCGGTGGCCATCA 

CGAGCTTGTGGCACTGCTGCTACAGCACGGGGCCTCCATTAACGCTTCTAACAATAAGGG 

CAACACAGCGCTGCACGAGGCTGTGATTGAAAAGCACGTCTTCGTGGTAGAGCTGCTTCT 

GCTCCACGGGAGCGGTCAGTTCAGGTGCTGAACAAGCGGCAGCGCACGGCTGTAGGACTG 

TGCTGAACAGAATTCAAAAATAATGGAATTGCTTCAAGGNGGTACC FCGCCCGAGGGCTC 

CTTCCTTATGACTCCATTCAAATTTACGGGNTATTAAAGCAGGGGCTTCAGTGGCCCGGG 
GTCCCT 

Sequence 570 

CCGCGGTGGCGGCCGAGGTACTAAAAACACAAAAGCACTTATCAGACACCCTATGGCATT 

TGTTGCTGAAATGTCTACCGGCTGGAGATTAGCCTTGAGCCGAAGCAGCCTCAGAGCCAA 

CACTGGCTTCCTCCTGTAAATCATCTCCAGCAGAGGACTCTGACTCAGCCGGAGAATTTT 

CTTCTTCTAGTTCAGCATATTCATCAGAGGTTTCGTTCTTTGTTGTATCTTTATCATTAT 

CGATGGTGACAGCTTCATCCAGGGCAGCATTTGTAACCTTGGGGGTTGTTTCAGGGGTTT 

ACTTCCGTGGGTTTCCCTCGCCGCTGCATCTGTGACCTCTGGCCCCGGTTTCAGCACTGA 

CTGCTTGTGGGTCTCCGGAGCAGCCTTCCACGTGACCTGGGACAGGCAGATGCCTCTTTT 

ATGACCACAACTTGGTAGGTACCTGCCCGGGCGGCCCGCTCTAAAACTAGGGGGGGATCC 
CCCCG 

Sequence 571 

GTATAC GACTN CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGAAAAGTT 

TTATTCCTTTTGAGGACAAACAATTAAACCACATCCAAGGTCTTAACTTACAGACAGAAA 

CCAAAGTAGCCATTTAAAGCGTTAGATATCGGATACAAGACATACACTGGGGAGAATGCT 

TCACCATCTGAAGCTCACACCACAATGGCCCAGTGGACAGCTGTGCACTCTGCTTGTGCT 

TAAGTGCCTGGGTGTGGCTGAGGGGAAGGCGTGTCTGCAGAACAGAAGAACAGCTGTGTT 

TCACAAGTACCTGCCCG 

Sequence 572 

CCGCGGTGGCGGCCGAGGTACCTCCTAAACAACACAAACTCAAAATGGCTCTAAAAACCT 

GCAGTGCCAATCAGGCTGTGCATAGCAATTGCAACCCGTCCAGCCAATCTGTGATGGATG 

TAAAAGGTCTGTAGCTCCAAGTCCACATGAAGTGAGTGTCTTCAATGTGTTTAGAATAGT 

GAAGTAATACTAAGGTCACTTCTGAATAATATTTTTGTC I HUM \ GTTTGTATGAACA 

ATCCATTGGTCACTTAGACCCCTATGTAAAAATAAAAGAATCCTCAAGGCATGAAAACAC 

CAGTAATCTGTAATCTGCAGGACTCTTCCCCTGTCTGGGAAAATATTTGCCACAATTTGC 

CTCAAGC AAATTC TTCTTGGTCACTCAAGGCTAAGAAAAAAATTCAGGATTGTAAGGAAG 

TGGAAANTTTTTTCTCTCTCTC M I I I I I II I I I I I I I U 

Sequence 573 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACG 

CATTTAAAATAAACCATATAATTTTACCAGTAAGAAAGCCGATAGTTAAGTTCAATTCAA 

ATT GGAG TTCATMGTTAGTAATTTAMTCCTTAGACAAAGTTACAGAAAGTGCATCTTC 

TTATTTTCCATCTTCATACAATGTTAA 1 1 I I I I I I I I GGTGTTTATACCTTTTAAAAAAT 

AAAAACAGCCAAATACTTAAGCAATATGTACCTCGGCCGAGGTACGCCAAGGGCGCCCTG 

GAGACGGAGGATCCTGCGTTAATC 

Sequence 574 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTTATCG 

CTCAACAACAGAGCTCCCATTAATTGGGAACAGTTCTAACACTCCCCCAACTTCCTCTCC 

AACGCCTAAAAGCTTCAGAATGGTTATGTGGCGCAGACCAAAATTGCACGCCTGACCACT 

GAGGGCAGAGAGCTGCTGTGAATAGGCCCAGTCTTCATCGTTAGGAGCAGAGATCACCAG 

CAACCTCCTCCTCCACCGGAACCTGGATAGGAAGTTCTCCAGGGACTGC I I i I I GTCCTC 

TTTGCAAACAATGCCCTCCTTCTTCTGCTTCTCCATATCTTTGATTCGGGACTGGAAAGT 

ATCGATCAGATCAAACACAGACTTCATTGTTATTGGTACCTCGGCCGCTCTAGAACTAGT 

G 

Sequence 575 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACACAAGGGAGAGGCTCCT 
GGGCAGTGACGGTGGAAGCTCCACTACCTCTGGGATTAGGGGCACTGTTTCCAGAGTCTG 
TAAGGTCGTGAGGATGTCACTTATGCTGTGCTCCTGTGGGTGGTTCTCCCCTCCAGGGAG 
TCTTTCCCCAGCCTCAGCTGGTTCCTGCCCAGCCTCAACCCCAGGCTTGCCTTCAGCACC 
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TCGCTCCAGGCCTGGCTGTGAGAGCCNAAAGCGTAACCTTGCCCGGGGCGGCCGCCCGGG 

CAGGTACAGGGGGTGCTGCAGGTGGCAGAGTGAATGTCCCCCATCATGTGGCCCAACTCT 

CCTGGCCTGGCCATCTCCCTCCCCAGAAACAGTGTGCATGGGTTATTTTGGAGTGGTANG 

NNGAACTTGTTTACTCATTGAAGCAAAMTTCTC 

NGAAGGAAAGGGTCAGAATGCGTGCCCAACTTCTTTACCCCCCAA 

Sequence 576 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAACTGTAGAGTCA 

GAGTTAATGTTGACAAGGTATTTTTTGATTTAATGAGAGAAATTCGAGCGAGAAAGATGG 

AAGACAGCAAA GAAAA GAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAG 

AAAGATGCTGCATTTTATAATCAMGCCCAAACTCCTTTCTTATCTTGACCATACTAATA 

AATAATTTATAAGCATTGCCATTGAAGGCTTAATTC 

AAATTGTTGTATATCACTAAAAGCATGAATTGGGAACTGCAATGGAAAGTCAAATTTACT 
TTAAAAAGGAAATTAATATGGCTTCACCAAGAAGCAAAGTTCMCTTATTTCATAATTG 
CTACATTTATCA 
Sequence 577 

TTAGGGCGAATTGGAGCTCCCCGCGGGGGNGGGCGAGGTACTCATGACGGANGTTGCCGC 

TCGCCCACTTGTGCAGCAGCGTACTTGTCATTGTCCAGGTCACAGGTCTCGAAAAAGCGG 

GTGGTGCAATGCTCCATGGGGATGAGGGGAGCACGCAGTGGAGCCAGCTCGGTGTGGGAG 

AGGTACCTGCCCGGGCGGCCCGAGGTACCAGAAAGATAACGGAATGTAAAAACTGGAATT 

ATGAAATCTGGAGTTATTATTTGGGAAATGGCAGAAAAAAAGACAGTAAATACGAAACAT 

ACTTGA AAAG AACAATGCAAMTTTAGTAAGATGAAAGGGGAAATCTATTAAMTGTTTG 

AG AACATTTTANAGCACGGGATTTGG NAAATTTCCCC ATAGGACAGATGG A 

Sequence 578 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCATAAAGAGTGTT 
GAAGTTTATTTATTATAGCACCATTGAGACATTTTGAAATTC 

AACAAAAAGCATTTGAATTGTATTTGGTGGAACAGCAAAAAAAGGAGAAGTATCATTT^ 
CTTTGTCAAATTATACTGTTTCCAAACATTT^ 

CACTTGCACTGGTTGGCAAGATTAGAACAAGAGGAACACATATGGAGTTAAA I 1 11 M M 
G 

Sequence 579 

CCGCGGTGGCGGCCGAGGTACAATCTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAG 

TGATT CTCCT GA CTCAGC CTCCCGAGTAGCTGGGATTACAGGCATGCACCACCATGCCTG 

GCTAATTTTTGTATTTTTTGTAGAGACAGGGTTTCACCATGTTGCCCAGGCTAATCTTGA 

GTTCCTGAGCTCAAGTGATCCAACCACATCTTGGCCTCCCAAGGTGCTGGGATTCCAGGT 

GTGAGCCACCGCCCCCAGCCAACAATAATTTTTAAGGAGAAATACCAAATCCAGAATACT 

GATTACCTCTGGGGAAGAAAACAAAAGAACAGAATGAGAGAGAGATACACAGGGAACTTC 

AACTGGTTTGTCTAATATTTTA 

Sequence 580 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACATCTTTGAAGG 

GATGGAGCTGCGCGGGGCTCCTCTGGTTGTCATCTGCCAGGGCAAGATCATGCTGGAAGA 

TGGCAACCTGCACGTGACCCAGGGGGCTGGCCGCTTCATACCCTGCAGCCCGTTCTCCGA 

CTATGTCTACAAGCGCATTAAAGCACGGAGGAAGATGGCAGACCTGCATGCCGTCCCAAG 

GGGCATGTACCTGCCCGGGCGGCCGCCCGGGCAGGTACCAAAAACCAACATGACACACAG 

GAAAAAATAAAAGTGCAATTTTAATATAGTGAATGTGATACATGTATAATTCCTCATAAC 

AAAATGGTCAAAACCTTTAAAAGATCCACAATAGATATCTGAAAATCTTAGCAATGCTGT 

ATATATTTTGAGGACTAAATGATGAATTTATATTCAAAATTGGTCAAATATATT 

Sequence 581 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCCTCGATCATCTCGCGGAGGGCCAGGAA 

GTAGGGCCCGTTTTCATTTGGGGCTCGGCGCCAGTCATAGGGAGCCCCTCGGACATCCTC 

ACCCCGTGTGTAGCCCCAGCCCACAAGGCTCTCCACCATGGTGTGGAAATAGGAACCCAC 

GCTGCTTTTGCTGGGGTCCAGGAACTCCAGTGAGAAGGTCTTCCCAAAGCCAGGGACACG 

TACCTCGGCCGCCCGGGCAGGTACTCCAGGACGAAGAAGGAAGATCATGCTTGATACTTA 

GATTGGTTTTCCCAGGGAAGAGGGCGGAGCAGAGCAAAGTCACTGTGAACCCTGGGCCAG 

GCCCTGGCTGGGCCAGCTCCTGAGAGCGTCTCGTGTTGCANACCCTTGCCCACTTTACCC 

A 

Sequence 582 
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CT ATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTCCACTTCAAGA 

AATTTTTTTTTCTTTTTT1 IGGAGACGGAGTCTTGCTCTGTCACCAAGGCTGGAGTGCAG 

TGGCACGATCTTGGCTTACTGCAACCTCTGCCTCCTGGGTTCAAGCAATTCTCCTGCCTC 

AGCCTCCCAAGTAGCTGGGACTACAGGCGCACGCCCCCACACCCAGCTMTTTTTGTATT 

TCTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGATGGTCTCAATCTTTTGACCTCATG 

ATCCACCCGCCT CGGCGT CCCAAAGCGTTGGGATTACAGGCATGAGCCACCGCACCCGGC 

CTCACTTCAAGAATTTTTTACAAGCACAGAAACTATATCTCAAGTGTATGATAAACTGGT 

ACTATAATACTATATTGGATTATAAATATACAAGCTATTTGAGGGGGGGTGATAGCTTCA 

CTACCTTCACCAAGCTTAGGAATATATATAATCTACTTTG 

Sequence 583 

ACTATAGGGCGAATCGGATCTCCCCGCGGNGGCGGNCGCCCGGGCAGGTACCACTTCTGA 

TGATGGAAGCAGTGACCTGGATCCCATAGAACACAGCTCAGAGTCTGATAACAGTGTCCT 

TGAMTTCCAGATGCTTTCGATAGMCAGAGAACATGTTATCTATGCAGAAAAATGAAAA 

GATAAAGTATTCTAGGTTTGCTGCCACAAACACTAGGGTAAAAGCAAAACAGAAGCCTCT 

CATTAGTAACTCACATACAGACCACTTAATGGGTTGTACCT 

Sequence 584 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTGAAAGAAGCCAGAC 

ACAAAAGGTCTCATACAATTCGATTTATATAAAATAGCAAGAATAGATAAAACCATAGAG 

ACAGAAAGCAGATTGGTGGTTGCTGGGGGCTGGAGGAAGGAAGGAATGGAGAGTATCTGC 

TTMTGGGATGGGGTTTCCTTTTAGAGGAGMCATTTTTTTGGAACTrr/^ 

GT GGTTA TGCAACATTGTTAACMTACTAAGTGGCACTGAACTTCTCACTTTAAAATAGT 

TM7TTTATGTTATGTGAAGTTTACCTCAATAAAAAAATTCTTAAAAAAAAAACTCANTG 

ATATCCAACCATTAAAAMTNTAAGTTACATAGCTCGTATATAATCAACTTACTTATTAC 

TGCACTATGAGGGGCCCCAGATATGTAAGACAGAAGTCCCAACC 

Sequence 585 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACATCCTTTCAA 

ATATTCTTTCTTTATTCTCCAATTCACTTTTCAGGAGAATAGATAACCTCAATCATATTG 

ATTCTCAGCCTAATGGTCTCCCTTACATAAATTATAATGACATAAGCATTTTGTATCACT 

TTATTGTTACCATCGACCTCCTACAGTTTGGGGGAACCCAATCATTCTTTAAAACATTAG 

AATGACAACTCCAGTTGATTTGGCATATTGTATAGGTGAGTGGGGAAAGACTTGTCAAAT 

CCCAGAAAGGGGGCGTGGCACCAGCACCACCCAACCAGAAGAAAAATCTCTCTTTGGTGC 

TAGTGTCATAGCTCCCTATCTGTTCCGCTAACCCATTAAATGATGTGGTGGAGGAGGATC 

CTTTCACAATCTTTGTTCATTTTTTTACAGA 

N AAATTC A I MM 1GGGT 

Sequence 586 

NCGGGGGGGGCGGGCCGCCGGGCCGGGTCCTTNAAAAANTNGAGGAANTTNTTAATGGNT 

CCCANTAGGGCNTTTNAMNCTNCAANCGGCTTTCTTTNTTNAATTNNAANGCTAATNCT 

CTNGCAAATCCAATAAAATCGGNCNTTTNNTATNCATTCCNGCTTTATATTTTNATATTT 

NAMNGANGCCNCCNGTAAATNCTATTTTNTTTGGCAAAATACCAAAAGGGNCAAAAAAC 

CTTTAAAAATTNCNGTAMNCATAANGTAGTTCTNAGGGGGAAANGTTATCATTCCCAAN 

GTCCTNGNGTNGTTCCTTCCTTAAT 

Sequence 587 

CCGGGCAGGTACTACTTTGAGGGCATTAAACAGACCTTTGGCTATGGCAGCACGCTTATT 

TTAGGGCCAGCGGCAGGGGACTCCAGCCTGCTGGACACTTCAGTCCAGCTCTCTCCTGAC 

TGGGGCTTGCGACTCACAGGATTGCATCGTCCCAGCTGCTAACTTGGGGCCGGGGCCCCT 

CCCTTCCACATATACCTTGGGTTTGTGCATGTTTTCTGCTGGGTGGGTTCAGAGGGCAGG 

GGACTCCAGCCTGCTGGACACTTCANTCCAGCTCTCTCCTGACTGGGGCTTGCGACTCAC 

AGGATTGCATCGTCCCANCTGCTAACTTGGGGCCGGGGCC 

Sequence 588 

CCGGGCAGGTACGTAGTMGAGAAACATTGCCACAAAATCCTCTTTTAAAAGCTCTTTCA 

GGACACTAAATTAAGTCATTCATTTCTGAGATTTTAAAGGCCAACAATGTCTCTATTTTC 

AGACTATGTCACAAATGACATAAATGCCAAAATTATAGATCTTAAGAAGACAACACATAA 

MTACAGACTTAGACAAMCAAATGACTAAAGCTGCATAGTTCTGCTAGACTTTATTCAA 

AACTGGCTCTATTCTTAATTATTTGAGAGACCTAAAATTCTAGTGTTCAAATGTAAATAC 

CCATCATATACTGAAAAGTCATCTTCAAATTTGT 

Sequence 589 
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AGGTACCACTGAATCCAAGGCTCTCTTGGGTAGCCTATGTGCCTCTTGGATGGTATGTGA 

ACAAGGTAGGGATGAAACTGCAGATGTCTGAAAAGAGCTTAAGCATCCTGACAGCTTGCA 

rTTTTTTTTTMTTrTTTn 1CCTCACCGTATTCCACATACCTATGGCTGGGGAAGGTAA 

TGTTACCTCTGCAAACAGATGCTCTGGTAAGAGAAGAAAGACACAGGGAAGGCAGAGAGA 

GCAAGACAGAGGGCTCAATCACAMGCTGGGACAAGGAGAGCAGCTGTTTTTCACACATT 

CCTGCTCCTCTCTCAGCTGGGCTCATGTTCACtCCGTTTCATTCCAATCATCAG 

Sequence 590 

CCGGGCAGGTACAGGACAGCCAGCGTCATCATTGCTTTGACTGATGGAGAACTCCATGAA 

GATCTCI I 1 I ICTATTCAGAGAGGGAGGCTAATAGGTCTCGAGATCTTGGTGCAATTGTT 

TACTGTGTTGGTGTGAAAGATTTCAATGAGACACAGCTGGCCCGGATTGCGGACAGTAAG 

GATCATGTGTTTCCCGTGAATGACGGCTTTCAGGCTCTGCAAGGCATCATCCACTCAATT 

TTGAAGAAGTCCTGCATCGAAATTCTAGCAGCTGAACCATCCACCATATGTGCAGGAGAG 

TCATTTCAAGTTGTCGTGAGAGGAAACGGCTTCCGACATGCCCGCAACGTGGACA 

Sequence 591 

CCGGGCAGGTACACGGAAATCTGGACAGTGCTCCACAGATTGATACATTAGCCTTTGCTT 

TTTCTCT1TCCGGATMCCTTGTAACATATTGAMCCTTTTAAGGATGCCAAGAATGCAT 

TATTCCACAAAAAAACAGCAGACCAACATATAGAGTGTTTAAAATAGCATTTCTGGGCAA 

ATTCAMCTCTTGTGGTTCTAGGACTCAGATCTGTTTCAGTTTTTCCTCAGTTGTATATT 

GACCAGTGTTCTTTATTGCAAAMCATATACCCGATTTAGCAGTGTCAGCGTATTTTTTC 

TTCTCATCCTGGAGCGTATTCAAGATCTTCCCAATACAAGAAAA 

Sequence 592 

CCGGGCAGGTACGGTGTGNTGTCGATGAGCACGATGCAATTCTTCACCAGGGTCTTGGTA 
CCAGTGCCCCTTTTCAGACAGANTTTGATTCGCTCTAGACI ITTTTTTTTTTI IAATAGG 
G AAAAAAN TTG ATAATTTTTC i I I I I I I CTACATGCACTTAAGACTAAAAACACAGGTTT 
GGATTAATM I I ATTNTGCTTNC I Mill I CCGCNTTTTCNTTNNCGCNCGAGNCCTGAA 
NGGNAAGAATGTTCCAGGGCAGGGGNAACCCACA 
Sequence 593 

CCGCGGTGGCGGGNCCNCCCGGNNTTNNAAANNNGGGGNNNTNTACATAGAGGTGAGGGT 
CATGCCCGTGTTTCAGCTCATNCAGTNCAGGGACTTCGCCTTGCCCCACCCATNATGGGT 
NAGGCCGGGAAGGGGGCCATTTTGGAAGCCAAG 
Sequence 594 

AGGTACTCTGAAGGTAGGATGTGCAGCTTTTTCTTAGGCAGGATAGATGTAACATAGATG 
ACTGCATAAAAAAGAGGGCAGAAAGGGCAGGAAGCCCA I rTTTTTTi I AACACTTCTNCA 
TTGTGTCTCTGCCTCACCCATAATGGCTGGTTTATTCAATTATTCATATATTCCCGTTTG 
TGTTTNCAANNNAGGTCTACACAGCACCAGCCTACAAAGGCAAGGTCCAATCCGTGTAAA 
AAAT 

Sequence 595 

CCGGGCAGGTACCGTCGCCCGGCTCTCCGCCGCTCTCCCNGGGGTTTCGGGGCACTTGNG 

GTCCCCACAGTCTNGGTCCTGCTTCACCTTTCCCCTTGACCTGAGTAGTCGCCATGGCAC 

AGGTTCTCAGAGGCACTGTGACTGACTTCCCTGGATTTGATGAGCGGGCTGATGCAGAAA 

CTCTTCGGAAGGCTATGAAAGGCTTTGGGCACAGATGAGGAGAGCATCCT TGACT TTTGT 

TGACATCCCGAAGTMTTGCTTCAAGCNCCCAGGAAAATCTTTTTGCMGCTTTTrAA^ 

AACTNTTGTTTGGCAGGGGATCNTTNTGGGNTGACCTTGAAATCAAGAAANTAACCTGGG 

AAAAATTTG 

Sequence 596 

GGAGATTATGTCTACTTTGAGAATTCCTCCAGCAACCCATACCTAATAAGAAGGATAGAA 

GAACTCAACAAGACTGCAAGTGGCAACCGTGGAAGCAAAAGTAGTATGCTTTTATAGACG 

ACGTGATATTTCCAACACACTTATAATGCTCGCANATAAGCATACTAAAGAAATTGAGGA 

AGAATCTGAAACAACAGTTGAGGCTGACTTGACCGATAAGCAGAAACATCAGTTGAAACA 

TAGGGAACTCTTTTTGTCACGCCAGTATGAATCTCTGCCCGCAACACATATCAAGGGGAA 

A 

Sequence 597 

CCGGGCAGGTACTCATGGAAAAGGAGTTCCCTGGATTTTTGGAAAATCAAAAAGACCCTC 
TGGCTGTGGACAAAATAATGAAGGACCTGGACCAGTGTAGAGATGGCAAAGTGGGCTTCC 
AGAGCTTCTTTTCCCTAATTGCGGGCCTCACCATTGCATGCAATGACTATTTTGTAGTAC 
ACCTAGAATCTTTGTGAATAATGTGAGGCCAGTTCTTCCATAAGGAAGGCTGGTTATGGA 
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TATTCATAAGGTTATTTCAAAGTTAATAAAGACAAAGTGGCAACTGTAGAAAGTGTTGCC 

TCCAATCTTGGTCCGTATTTCCAAAGC 

Sequence 598 

NCGGNGGCGGCCGAGGNNNNNNNNNNTGGTTGAGGGGGAGGAAAAACCCGCCACCCNNGC 

GGGAGNGCAGNAGCACGACCNCANTTTACNGCAACCGGCAGCNCCCNCNCANAANAGANN 

CNCCNGCCNCAGCCNCCCAAGNAGCNGGGANNACAGGCANGCGCCANCACACCCANTTAA 

NNTTTNTNTTCCCAGGGGNGAGGGGGCCCCACCAAGAAGGCCAGAATGNANNTGNNCCCC 

NGACCGCAAGGGANACACCCACCNCAGCCNCCCAAAGGGCNGGGANNACAGGNGNGAGCC 

ACCGNGC 

Sequence 599 

CCGGGCAGGTACAA AGCT ATAMGGAACGTTT1TAGAGAAAGCACTGAAGACACACATTT 
TGCTGACCTAAAAGATTTTAAAATGAATTAGAATAATTTACATCATATAAAGAGGTATTT 
AGTCTTTAAGTGGAGAAAGT 
Sequence 600 

NGNGGCGGCCGAGGNACNNCANAANTNNNTTCGGGGGNCANAAAAACCCCCCACNCCCCC 

NNNCNAGGACACCNCGNGGAAGTTTTTNAAAAGCCNCAGGGGGGGGGACCCAANAAAGNG 

AAGNGACCGNCNCNNCCAAGGGCCNCCAANANGNCCGANGNGANGGCAGCNGCNNTNNAA 

GANGANTNNTTNTNNNGCGGGGGNAACGNCCCCNANAAAAGAGGGACCNGANCGGNCNCC 

CCGCCCGGCGGAAGCACGAGGAGNNCNGNGNCCCCACGNCCACGGNNCCCGCNACNGNGN 

CCAACAANGNGNNNNNANCCGANNNCAGCANCAGGGANCNGCAGNCCAACGNGGAGCACC 

NGA 

Sequence 601 

CCGGGCAGGTACTGGTCTACAGGGACAAGCAGTCTTTAAAACGAAACTCACCTTCAGACC 

TCACTCTACGGACAGTGCCACACATAGAAAGATGACTCTGTCACTTGCAGATAGGTGTTC 

AAAGACACATGAAGATTAGAATCTTGCCAATGGCTGGTCGTGACCCTGAATGCCAACGCA 

CAGAAATGATTAAGAAAGAAGAAGAACGTTTGAGGGCTTCCATACGTAGGGAATCTCAGC 

AGCGCCGAATGAGAGAGAAACAGCACCAGCGGGGGCTGAGCGCCAGTTACCTGGAACCTG 

ATCGATACGATGAGGAGGAGGAAGGCGAGGAGTCCATCAGCTTGGCTGCCATTAAAAA 

Sequence 602 

AACNCCACCGCGGAGGCGGCCGAGGTACTGCATCAGTCGTGAGGAGGAGGGAAAACCCTT 

GGGATTGNAAGACTATTAAGATCCTAGMAAGTCNNTGGTAGNTNACNGGGATGNGACCAT 

NGAAGANCCCCCCGNN GNCA CTAATCACACAAATAACAANNNNGAGAAGNAGGCTATAAA 

ACAAATTNAGGNTGCTNTTTTCNANNGGGAAANAAACCAAAAAAAAAACAGAAAAGANGN 

GAATCCCCCCCAGNNANGGGGGAAAGGNAAAGCAACACCAAANAAAAGCCCA 

Sequence 603 

AGGTACCACGCACAGGAGCCGTCACTTCTCTTGATGCAGGCGAAGAACTTGGCCTGGTGC 

CCGTTGATGTTCTTNTNTGTGACCCANTCCATCCAGAGGCACTCGTCCGGGGAGGAGATG 

TAGCACCGGGATCATGGGGCAGCTCGTGATCTTGCACTCGCAGC C C ATNTGGTACCTGCC 

CGGGCGGCCGACCGGGCAGGTACTTACTTTCCCCATAGGAAGACAGACCATAGGCAAACT 

CTGTTTTGGGATCTNAACTCCATCACCTTTGTTTCAATAf I M I I 1 I I CTCTCTTGAACA 

AAACTGAGATAATTTAN 

Sequence 604 

CCGGGCAGGTACAGGCACAGCATATATTTGAGAAAACATCTTACAAATTTCATTTACTAT 

AGGTTTCTCAATMTCTTTACATTTAATCAATGAGAAAAGTGATTCAGTCTCTTGAATTT 

TAAGTTAAAAAAATTAAAAAGTATTTCCAGGGACTCTTAAAGCTCTCTCCCAAAGTATAA 

AATATTATGTACTGTGGGGTCAGATTCAGC^TACTTGAGGGAACGAAAGACTTTTCGTTG 

GGGCGTGACCCGTTTGATGTTTGGATGATC7TCAAGTGTTAGCAGCCCCGCTTTCTGTTT 

TTCTTTTATCTGAA 

Sequence 605 

AGGTACAGACATTTTCAAAGTTGCCAGTGTTACTTTAATTGGACTGCCTTCGTAATTCAT 

TGCCTCTGCTTCAACAATGTGCAACTCATCCTTTGCACCAGCCCCTAAACTGACCGTTCT 

TAAAGATAACTGGTGCTCATTTTCATCATTATCCACCTTAAAGTGATAATCTTTGTCGGC 

CTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTCAGGGGGCTCATGTCCATGTCCAT 

CACAGAATTCTTTAATATAAATACGAACGGATGGGGATAAATAATATTTAAAACTTAGAT 

TATTCAATGCTTGCAAAAAA 

Sequence 606 



Page 84 of 635 



WO 01/51628 



Table 1 



PCT/US01/00798 



CCGGGC AGGTA CATATGATCCTTAGCCACCAGGGCACAAGCTTACCAGTAGACAATACAG 

ACAGAGCTTTTGTTGAGCTGTAACTGAGCTATGGAATAGCTTCTTTGATGTACCTCGGCC 

GCCCGGGCAGGTACCTTAACATTCACATGGAAGTAGTAAAAATAAGATTCTGGGTGCAGT 

TCTCCAATGACAGGAAAAAAAACAAAGAGAATTTGAAGAATACCGTCAGAGACAAATACA 

TTACAACCAAM7TGACTTCAAGGCACTTTTGAAGGAGATCAAATTTATAACAAAATAAT 

TTAGTGAAAGTGAAAGCTT 

Sequence 607 

CCGGGCAGGTACCTGGACCTGCTGTCCCAGCCCTGCCGCGCTGTTTACATCTTTGCCAAG 

AAGAACGACATTCCCTTCGAGCTGCGCATCGTGGATCTGATTAAAGGTCAGCACTTAAGC 

GATGCCTTTGCCCAGGTGAACCCCCTCAAGAAGGTGCCAGCCTTGAAGGACGGGGACTTC 

ACCTTGACGGAGAGTGTGGCCATCCTGCTCTACCTGACGCGCAAATATAAGGTCCCTGAC 

TACTGGTGCGGCGCACCCTTCAGGGTCTTGAGATTGAGCTGCAGTCACAGCTGAGCATGA 

AAGCTGCCTTGGAAGACACACTGGC 

Sequence 608 

CCACCGCGGTGGCGGCCGAGGTACCCAGGATGNTTTCTGGCAGGAGGGAGCNTGTGTTCC 

TGTGACCTGTGACCCACCTCCACCAAAATTNCATGGGCTCTACCAGTGTACCTGCCCGGA 

CTTCCTCTCCTACCTACAGTCCCTGCTCCTATTCCCACCTCAGCAACTGAAGCCGCCCCC 

AGAGCTCTGCCTTGAGGATATAAACACATTTAAATTACTGTCATATGCTTCCTATTGCAT 

TGAGCATGGTGATCTGGAGCTAGCAGCAAAGTTTGTCAATCAGCTGAAGGGGGAATCCAG 

ACGAGTGGCACAGGACTGGCTGAAGGAAGCCCGAATGACCCTAGAAACGAAACAGATAG 

Sequence 609 

AGGTATTAACACATGATGGAAAAGTCATTGTGACGCCAATGAATTTCATTGAGTATAAAC 

TCATCTACTTCAAATTTATTTTATAAGACAACCTAAGATACTCMGATAATTATTTAATG 

GTTAGCTCTTAAGTTGAATTGGTCTACATAATGCGTGGGAAGAAAACCAGATTTTTAGCC 

TTCTTGCCAAATCCAGACCTCTGGTTGATTTTTCTTTGACAGAAGATGCAAGTTATTTTC 

CAATTTCACAATTAAATGTATTTAACATGAACATTATTTTGCTTTAAAM 

TGTAGGAGAATTATAGCCAGTCTTCAGTTATAACCACT 

Sequence 610 

AGGTACTTCGAGGTCTTCGGCGAGATCGAGGAGGCGGTGGTCATCACCGACCGGCAGACG 

GGCAAGTCCCGGGGCTATGGATTTGTCACCATGGCTGACCGGGCTGCTGCCGAAAGGGCC 

TGCAAGGATCCCAATCCCATCATTGATGGCAGAAAGGCCAACGTGAACCTGGCATACTTA 

GGAGCAAAACCMGGATCATGCMCCAGGTTTTGCCTTTGGTGTTCAACAACTTCATTCA 

GCCCTTATACAAAGACCTTTCGG 

Sequence 611 

AGGTTTTGCAGCTCGTGC CATC ATTTCAGAGCTGGTGAGCATTTCAGAACTAGCTCAACC 

ACTAGAAAGTGGCACCCATTTTCCTCTCTTCCTAC7TTGTCTTCAGCAGTTAGCTAAATT 

ACAAGATCGAGAATGGTTAACAGAACTTTTTCAACAAAGCAAGGTCAATATGCAGAAAAT 

GCTCCCAGGTAAGAGAATGCTGACTTGTTTGTTTTG I I I I I I I A ATATTATATCTAG AGA 

TTTCAGGTACCTGCCCG 

Sequence 612 

CCG GGCAG GTACANATGN GNNGTGTCTN NCAACTTTCATN AGAAAANGCCATATCTATAC 

CAT ATTTT ATTCGGAGTCACTGANGATGTAATGATATATCTTTTTCATTATTATAGCAGA 

ATATTTTTATGGCANGATATTTAGANGNCTTGANCATNCCTATTAAAATANTGCCAAACA 

CCAAATATGAATTTTNTGATGTGCNN 

Sequence 613 

AGGTACAGAACTTGGTCATAATATCTTGCATTTTATAGATTTATTAAAGATTAGTTTCAA 

GTTCACATTCGCTATTCAGTTGTAAACCGAATGGATGGGAGGGGAGAAAATATAAGCTCT 

CCACACAGGTATGCTCCTCTCTTTTCTGAGAGAGAAGGCATGGGATTTTCAGCATAAATT 

CCATGTTATGTGAGTGCTGTTTGAGTTCTGAAGTTCCTATCAATATCTGTTCCTGCAAGT 

GATCTCTGTAAGACCACCTTACATGCTGGTCTTAGTTATTGTTAAAATTGCMGGTTTCT 

TCACACCCTCTTTGATAAGAAGTGTTTAGCTGGCAGAGC 

Sequence 614 

CCGGGCAGGTACCTCTTGAGTGTCCCTTGAATTGGTAACCGGTTTGTTGTCATTATTTTC 
TAAGCGAATATGCCGTAATtGGTTATTGGGAACATCTTTGACAAAGATCCATTTAACTTC 
AAATTTGCCCTTCCACTTATCCTGAGACCAGACACCAGCATACGCATTATAGTCCACAAC 
AGACTTCATTTCAGCCACTCCACAAAAATGTCCACTGCCATTCACACTGAAGAGTAAATA 
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GAGTGGGCCTTTCCCATTCAGGGAACGGTAAGCTGCATCCAAACGCTTATTACCATGCTC 

AGTACCT 

Sequence 615 

CCGGGCAGGTACTTTTGATTGTGCACGCTTTTAAATAGAGAGCAGAGTTGCCCACTTGAA 

ACTACTCTCTTGCATGGGATATTTCAAGCTGTTTTACTATGGGCAAGGAGCAGGGACCAA 

AATGCTGCCAGGGCTTAAAAAGAGCCGTGATCAGATTAAACAGAAGTTGGAGAAGTGGAG 

GGATGTGGGCCACACACGAGAAGAAA7TCAGGAAGTAAGAAGTAAGAGTGACCCTATTAT 

GCTTCTCAAGGACAGGATGGTGAACAGCAATCTTGCCAGTGTGGGAAGAACTAAAGGAAA 

TTGATGTGGAAGTGAGGAAGGAGATTGAGGATGCTGCCCAGTTTGCCAC 

Sequence 616 

CCGGG C AG GTACATGTAC ATTATAAAAAAAAAAGTTCAACTAGTATG AAAGGG TTATAAA 
GTMCAGAAAAAAAAAAAAAAAAAAAAAAAAGGANTTAATCACAGTGGAAGCTTGCACGG 
TGGGCCACG 
Sequence 617 

CCGGGCAGGTACAMTGTGMGAAAGCTTTGTGAAAATTCCTGGCGAGAAGGACTCAGTG 

ATCTGCCTTAAGGGCAGTCAATGGTCAGATATTGAAGAGTTCTGCAATCGTAGCTGCGAG 

GTGCCAACAAGGCTAMTTCTGCATCCCTCAAACAGCCTTATATCACTCAGAATTATTTT 

CCAGTCGGTACCTCGGCCGCCCGGGCAGGTACCCAAGGGATGTTTCCAAGCTTCTTGATC 

TTTAATCTTTTAGCAACCTTCTGATACTGGATTTACCTTCCTAGGTGTGGTGCCCTCCTT 

CTGGAATTTAGATCTCTGGGGCAAACCCTGTCAGGAT 

Sequence 618 

CCGGGCAGGTACCCTGTGGCAAAATTAGTCACTGGTAATGAGAAAGATATCACTGAAAGC 

CTCAAGTGCCTGAGAAACAGTTTACTCATCCATGGGATCTCGCCAATTGTGAGGAAACAG 

CTCAATGGCCATTTCCAGTTATAAGCAGCTTATTTTTACTGATTGGACCTGGTTACCTAT 

CATTTCTAAAMTAACTTCTGATACAATTTGTACTTCCAATTTATMTGMTACTTT 

AGATTTTAGGTAGGAGGGGAGCAGAGGAATTATGAACTGGGGTAAACCCATTTTGAATAT 

TAG C ATTGC CAATATCCTGTATTCTTGNTTTAC AT 

Sequence 61 9 

GGCGGCCGAGGTACTCAANACATCTTTCAATAAAAATGGGGNTCGGCTTCTCTGGAGTCA 
CGGACAGTTGTTAAGGGCACCCCAAAANAGTTACTCTNCCAGGTTGCCTTTGNGGTGGAT . 
GGCCGGNGTTTGGGCTTTGGTTTATTAGNGTTCCTCCTTAGGCTGCGAAGAATATTCCTC 
NCTCCGAGGGTCTTCGNCTGTTTCGTATGGAATGACAGCATTGTCCCCTTTATAACCCTG 
GGATGCCTGATCCTNCTCTTTCTTNCGGATGGGCCCCAGCTCATCATCACTCC 
Sequence 620 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACTGAACAAAG 

GCATAGCCCTTGTGCACAGAACAGCCGGCCACACGGCCATACTTAGAGAAGATGGTCTCC 

ACATCTGATTTCTTCACCAGAGCTGTGTTGAGGTTTCCAATGAAGACTCGAGAGTTGATG 

GACTTGGGGTCATTCTTGTTGGTTACATTGCTTGCCTGAAGCTTCAAGGACATGGTGCCC 

ACTTAACAAACATTTCTTGAGTGTCCTCCTCTGTGCTGGCACTGTGGAGATGCTGGGAAA 

CACCGTGCTCACCAGTCTTCTGCTCTACTTGCTTTTCAAGAAGCCCCCTCTCTGGGAACC 

AGTAACAATGATGAGCTTAGCCAGCTGTTTCCTCCT 

Sequence 621 

TTTAGGGCGAATTGGAGCTC CCCGCGGTGGCGGCCGAGG I > Ml II I I ll ) Hi I I I ) I I 
I U I I I M I I I I I I H II II I I M I I AACAAANAGTTTGATTTTATTG ATATTN AAATAT 
ATNAAATNTTTCACTGAAANACATGGTTNACCATCCTCCCCCACCCCCACAGGGGTTACN 
TTNTAAAACCAAAGCCCNCGGCCTCCCACCTNCTGACTNCTTTACCAACTGGGNGAGGAA 
AGGGACAATGGTNNCCCANGGGAAGGGCATGGNTGGCNCTNNGGNACCTGCCCGGGCGGC 
CG NTTTAAAACTAGGGG ATCC 
Sequence 622 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAGAGCACA 
ATTGGAGCGGCCTTCCTCACACAGACTGTCTGCCTGGATGACACAACAGTCAAGTTTGAG 
ATCTGGGACACAGCTGGACAGGAGCGGTATCACAGCCTGGCCCCCATGTACCTGCCCG 
Sequence 623 

CCGCGGTGG CGGCCGA GGTACAT TATATAAGGGAI I I I I I t I AAG7TGAAAACAACTTTC 
TTTTCl l \ \ IGTATGATGGTTTTTTAACACAGTCATTAAAAATGTTTATAAATCATAAAA 
AAAAAAAAAAAAAAAAAAAAAAANAGGGGGGGACCTGCCCGGACCTGCCCN 
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Sequence 624 

CCGCGGTGGCGGCCGAGG \ I ' l H M 1 I I 1 1 I H 1 I 1 1 ) M 1 I H H 1 M M 1 1 11 I 11 M 
1 1 ' n I M M M 1 M 1 i 1 1 1 I 1 1 1' 1 1 1 > > N M M 1 1 M I M 1 I I M i I 1 I M M | M 1 1 
TTTTNNNNAAAAANNNNN 11 111111 fN NTTNNGGGGGGGGGNNGNCAAAAAAAAAAAAA 
NNNNCCCCCCCNNAAAAGN I I I 1 1 I I I IANNGGGNCCCCNNAAAAAANNCNGGTNNTTTT 
TTNGGGGNAAAANCNNNNTCCCCCCCTNNTT1TTTTTTTTT 
Sequence 625 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACGGCNNNAATGACA 
ACACTTAATTCCAGCTATACGNGGCAAAAGATGTTATGGNAGGGAANAGAGAGGTTNAAA 
TACANATGAAATAAAGGGTCACCATCTCCTCA 
Sequence 626 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAAAAAA 

MCATTTGGTATCAATMTTTGGTTGTGCATTCATTTATTCAGTCMCAAATATTTAGCT 

GAGCACTGGCTAGCrGCCAGGTATTGCACTAAGGACCCAAAGATGGAAGAGATGATGTCC 

CTGCCCTCATGGAGCTTGCAGTCGTGTTGAGCAGACTGTCAAACAGATTTAGGTAAGGCA 

ATGTGACCAGTGCTATGATACAAACAGGATGCTACAAGAGTACCT 

Sequence 627 

GGGCGAATGGACTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTCGAGCAGCGGAAG 

AAGGAGGAAGAGGAGCTGGTTGCCTTGAAGGAGCGCATTGAGCGGCGCCGGTCAGAGAGA 

GCCGAGCAACAGCCGCTTNAGAACTGAGAAGGAACCGCGAACGTCAGGCTAAGCTGGCGG 

AAGGAGAAGATGAGGAAGGAAGAAGGAAGAGGCNCAATGAAGCCGGTGCAAGAGGATTGA 

TGCCAAGAAAAAAGAAAGGGTGCTTGTCCAACATGGGNGGGGCCCTATTTTGGGCCGGCT 

ACCCTNGGTTCAAGNGCAGTAACCAGTAAAGCCNTGGNTAAAGGCGGTCAAGACCGGGGG 

GCCGGGTAGATTNAAAGGGTGCCGCATTCCCTTCTTTCCCGGAGCCGTTAANNAAAGCCC 

TTCTGGGACCATTTGGACTTACCATTGGGGGGGGAAGGGAAAACCAGGCTTTCNCGGGGG 

ANGAAAAAGCCCCCAAG 

Sequence 628 

GAACCGCCAGGAGCCAGAAGGCGAGCTTGGCGAGTTTTGTGNGCTTTTGACCTGCCCGGG 

CGGCCGAGGTACTTGTTATCMCACATTTGTATCAGAGTTGCTTTTCTAATCTTGTTAAA 

CTGCTTATTCTAGGTCTGTAATTTATTAACTGGCTACTGGGGAAATTTACCTNAATTTTT 

CTGGATCTAATCTGNNATTTTTTCATTTNAACCTCCA 

Sequence 629 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGNACCANNTNCTTNTTNG 
AGNNNCNTACNAGNTGTACNNATCGCNGNNCNTAAAGANNTNCCCNNACCACNTCCTNTT 
GNGCTTNTTTATTNGNATTGTTNGAACANTCTNNNNTNTGNGANTCNTTACATTGAATCA 
TCTNGNCNANTTNATCCN AN NCTTNACTGACTCAN ACTCNTTATGCTGAACTN NTNAATN 
CAAAATC N CTATTGCTTTAGG ACTCTGGGAAGCN G NTGATAAAAAGATAACACGAGCTAC 
TGATATGGGCAGGANATGTTTNCTAANCAAATTGGAGAAATATAAACATGAAATGTGGCA 
ATGATCC 
Sequence 630 

CGGC GGTG GCGGCCGAGGTGC 1 1 11 I I 1 1 I I H 1 i M I II I 1 I I hi I I 1 1 \ NCCAAAAT 

TAGCATTTTATTC AAAACGCTGGGGTCAGAGG NGTATN ATTTATATTC NGG N NGGAATNG 

TATTNACCTTTNATAATTATTATTAGNGGGGGTATGGCTAANGAGTTCTGATTCACCCAC 

GAAGGTTACATTTATGCTGAATACATTNGNAAATNCTCAGTTTTATNCNGNATGTATATG 

ATGGCTACTCTAAAGGTTNGGATATATGTATNGNAATTAGAATTGTNGGCATGATNACAT 

TTCATTNGNGCCAAAAATATTAAAAANG 

Sequence 631 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACCAGTTGAAATTTCCAG 

TAMGCCTGCTGATCTTAAGAAGAGAATAGAATCTTTAATTGACCGGGACTACATGGAAA 

GAGATAAAGAAAATCCAAACCAGTAGGCTGGAAGAGGATCTTGGGCGCCGCCAGTCTTTA 

GCACCAGTTGGTGTAGGAGTTGAGACCTACTTCACAGTAGTTCTGTGGACAATCACAATG 

GGAATCCAAGGAGGGTCTGTCCTGTTCGGGCTGCTGCTCGTCCTGGCTGTCTTCTGCCAT 

TCAGGTCATAGCCTGCAGTGCTACAACTGTCCTAACCCAACTGCTGACTGCAAAACAGCC 

GTCAATTGTTCATCTGATTTTGATGCCGTGTCTCATTACCAAAGCTGGGTTACAAGGTGT 

ATMCAAGTGTTGGGNAGGTTTGAGCATTGCAATTTAACGACCGTCACAACCCGCTTGAG 

GGAAAATGAGCTTAACCGTACCTTGCC ■ 
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Sequence 632 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAGG 

ATCGATAAAAGAGGCAAAGTAAAAGGGACCCAAGAGATGAAGAATAATTACAATATCATG 

GAAATCAGGACAGTGGCAGTTGGAATTGTGGCAATCAAAGGGGTGGAAAGTGAATTCTAT 

CTTGCAATGAACAAGGAAGGAAAACTCTATGCAAAGAAAGAATGCAATGAAGATTGTAGC 

TTCAAAGAACTAATTCTGGAAAACCATTACAACACATATGCATCAGCTAAATGGACACAC 

MCGGAGGGGAAATGTTTGTTGCCTTAAATCAAMGGGGGATTCCTGTAAGAGGAAAAAA 

AAACGMGAAAGAACAAAAAACAGCCCACTTTCTTCCTATGGCAATAACTTMTTGCTTA 

TGGGATATAMGAACCAGTTCCAGCAGGGAGATTTlTrrAAGTGGACTGNTTTCTTTCTT 

CTCAAAATTTTCTTTCCTTT 

Sequence 633 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCGGGACCGGGGCCGCTG 
GGGTCTGGACGGGGGTCGCCATGATCCGCTTTATCCTCATCCAGAACCGGGCAGGCAAGA 
CGCGCCTGGCCAAGTGGTACAATAAAT I I I I I I I GGTCAAATTTAAAAAAAAAAAAAAAA 
AAAAAAAAAAAGGAGGGGGGACCTGCCCG 
Sequence 634 

GGCGA ATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGN' l rTTTTTTTTTTTTTn 
IIMIMlll Hl l lMIHlMllimilHlllllHl l lllllllHIU I lllH 

rTTTTTTrnrn iattaaaaccccnnccnaaaaaaaaaaaagggancccccngnnnaaan 

NNNAAATTANAANAAAAAANTNAAAANCGCCCCCCCCANNCCCGGGGGGGGGCCCCCCCC 

CCAAAANNTNNNTTCCCCATAAGAAANNNNNNAAAAAAAACCNCCCCCCNGGGGGGAAAA 

AAAANGGAAAAAAAACNNNTTNTCNNGNGNGAAAAAAAAAGGGNTTANNNCCCCANAAAA 

GGNCCNAAAAAAAAAAAAAAAACNACGGGGGGGGNIAAAAAAAAAAATANANAAAAAGCTG 

NNGGGGGGGNNCCCCCAAAAAGGNGGGCCANCNCCCCNNNNCNAAAAATTTGGGG 

Sequence 636 

CACTACTATA GGGCAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAGTCGGTTAA 

TTTACACTTTA I Mill I AAAAAGTTGATTTAAAAAAGAAACAACACAAGTTTAGAATCC 

ATAAAATGTCAGCAATGCTGATGTGCACTGGACTGAAACATCTTGATCATCTTCTGATAG 

AAGTAATATTCCATACAAAAAGATTCTTAGATTCCA I I I I I 1 GCTTCATTATTGTTTGTG 

GCTTGCTTTCTTTGAGCAATAAAGGGGTACCTGCCCGGGCGGCCGCTCTAGAACTAnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTAATTGCGCGCTTNGGCCGTAA 

TCATGGGTATTAGCTGTTTCCTGTGTGAAAATTGT 

Sequence 636 

CTTAGGG CN ATTGGA GCTCCCCGCGGTGGCGGCCGAGGTACAAGC1 1 11 1 I ( 11 1 1 II N 
1 1 M M I I f I 1 1 I 1 1 I I i I I 1 1 (I M M II I I I I I I I II M I I I I I II 1 1 1 1 I I 1 1 1 1 1 I 
TTTTNCCAAANNNNAAAAAAMNCCCCCCNCCCNTTTTTNANTTTTTTNAAAA^ 
GNCNTNNNAAAACAAANCGGMGGNTNNAAANTNCCCCCTTTNACNTTTCCNAANAAAAA 
AAAAAAAAANTTNCNNNNCCCCAAAAAAAAAAANCCCCCAAAAAAAAAAACCCCCCCGGG 
GNNAGNCTNAAAAAAAAAAAAATGGGGGGCAAAAANANNNCCCCNAAAAAAAAAAAAAAA 
AAMCNCCCCCCCANAAAAAAAAAAAAANNGGGGNNATAAANNTTTTTTACNCCCAAAA 
Sequence 637 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAACACAGAT 

GGGCACACTCTAGATATAGAACACGTTTCTTGCTGTGGTTGTGTGACAAATGGCTAGAAT 

TTTAGGAGGCTCTGGGACTGCACACTGTATATGCAATCCATCTCAGACTTTGGGATGGGA 

ATCCATTTCATCATCCAGCATGGCGAAACTTGGTTCATTCGCCCCAATGTGAGTCCTTCC 

TGGGTAAAATTAAAGGCAGACCGGNTGGGTGCTAAGGATCCCTTGGTGTGGAGAGGCCAT 

TTCCGTTGAGCAGCTGGACTGTGAAATCAATACCAAAGGGGATGGTGGGGTCTGGGCGTT 

GGGGAATTCTTTGACAACTTTCTTGTCCTTCCTTAACCATGGGCTGTATCCCCNGGAGGG 

TCTTTCCGGCTTGTGGTTCTTGGGCCTTGGGGGGGTC 

Sequence 638 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCGCTGGGCCTGCAGGTCTCT 

GTCGAGCAGCGGACGCCGGTCTCTGTTCCGCAGGATGGGGTTTGTTAAAGTTGTTAAGAA 

TMGGCCTACTTTAAGAGATACCAAGTGAAATTTAGAAGACGACGAGAGGGTAAAACTGA 

TTATTATGCTCGGAAACGCTTGGTGATACAAGATAAAAATAAATACAACACACCCAAATA 

CAGGATGATAGTTCGTGTGACAAACAGAGATATCATTTGTCAGATTGCTTATGCCCGTAT 
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AGAGGGGGATATGATANGTTTGCGCAAGGGTNTGCACACAAACTGCCAAATNTGGGNGAA 

AAGNTTGCCCTAACAANTTTTGCNGAANATTTTGTCCCTGNCCGGGNGGGCCNTTTTAAA 

AAACTAGGGGGGCCCCCCCGGGGCNGGNGGNAATTTTNAANANAAANTTTTANNNGNCCC 

CCCNCCCCCCTNGGGG 

Sequence 639 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAAAAAAAAAAAAAACCTCGGCCG 

C CCGG GCAGGTGTCTTGCGCCACCATGCCTGGCTAAI 1 I < I GTATATTTAGTAAAGATGG 

GTTTTCGCCATGTTGGCCAGGCTGGTCTCAAACTCCTGGCCTCAAGTGATGCACCTGCCT 

CAGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCAAAGTCAGCTT 

TC AAAATCC AAGCC ATMTTGGTGAG GG GGG AGTTTCAGMTTAC ATATAAAMTTAATA 

TTTGAAAAAATAATTCTGAMTTTCGAATTTAAAAACAGATGTGCTGCTTCTGGGTG 

GTAGTAAAAGTNTAGGAAMGGAACTGTTTTCCTTAGAAGCGGACTGTGGAANGCCTTAT 

GTAGANTGTCAAAGGGCANCANGAGCCCGNGTTTTTAAATGGCATTACTGTACCCTTCGG 

CCGGTTCTAGANCTTTrrGNAANACCCCTTTACNTNCCCNAATTTTNATTNTNAAAGGCT 

TTNTCGANTCCCTCNTANTTCGNGGGGGGGGGCC 

Sequence 640 

GGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCGAGAAAAGAGCTAGGGTAGGC 

AACTAAAACTTACACAGTGCCAGTCTCAGGAGGTCAGTAGCTCACAGAACTCAACAGATA 

AACTGGATTAAAACTTAAAAGTCTTCTTTCTATTTGAGCCCATAATGACTATTTT 

TGGCTCTTTTGCTGCTGCCTATATATAAA I III! I ATTAATTTTCTTGTATTGGGAAGAT 

CTTGA ATACGCTCCAGGATGAGAAGAAAAAATACGCTGACACTGCTAAATCGGGTATATG 

TTTTTGCAATAAAGAACACTGGTCAATATACAACTGAGGAAAAACTGAAACAGATGTGAG 

TCCTAGAACCACAAGAGTTTGAATTTGCCCAGAAATGCTATTTTAAACACTCTATATGGT 

TGGNCTGCTG I Mill I GNGGGAATAATGCATTCTTGGCATTCTTAAAA 

Sequence 641 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTCAGGGGTGGTCTGT 
GGAAGCCTTAAACTCTCCACACTCAGAGTCCTTTGTTTCCCCAGAGGCTGTTGCAGAACC 
TCCTCAGCCAACGGCAGGTATGTCTCCCTGCAACACAGCTGTTGTACCTGCCCG 
Sequence 642 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGNACAAGCTNTNT 
I I ( I I IN ( M i I I I I I I 111 I I I CGGAACCGTACCAGAAAATTTTATTAAAAAATTAAAA 
CTATT 
Sequence 643 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTCAACATGCTAGA 

GGTCCAGCATTTCACAAATACGCCATGCAGTTACATGAGGACTGCTACAAATTTTGGAGC 

AATGGCCATCAGCTCTGTGAGGAGAGGAGTTTAACTGATCAACACTGTGTGCACAAATTT 

CACTCATTACCTAAATCAGGAGAAAAACCAGAGGCTGATAGAAATCCGCCTGTGCTATAT 

CACAATAGCCGAGCTCGATCTACTGGTGCTTGCAACTGTGGAAGGAAACAAGCACCTCGA 

GATGATCCCTTTGATATCAAAGCAGCCAATTATGACTTCTATCAGCTTCTGGAAGAAAAG 

TGTTGTGGAAAATTGGATCATATCAATTTCCCAGTATTTGAACCAAGTACCTGCCCG 

Sequence 644 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACTTCAGGGCTGT 

GAGGAAGGGGTGCCTGGAGTTCTGCAGGACGCGGTTCTCGGTGAGTGTGTGGGCCACCTC 

GTCCTTGGCCACGATGACTTCCTTCTTGAGGATCTTCATGGCCGTAAGTAGCGGCCTGTG 

GCCTTCTCCTTCACCAGGATCACCTTGCCGAAAGTGCCCTTGCCCAGCAGCTTCAGGTAC 

CACTCTTCCAACTCCTGAAGGAAGCTGGCTGTGCCCGTTGACTCTGACACCTTTCGAGTC 

GCCAGTTTGCAAAGCCGCTCAACAAAGCCAGGAATGCTGAGAAGGGCCGCTTGATTGATT 

TCAGCTGCAGAAGGCCCCAAACTCAAGCTCTTCTTCTGTTGAGCATTTTTGGCAAGAAGC 

GTGTGCTTGTCATCATTCCCTGTATCCATCTCTGAAAATGGTGACAGCCAAGAATCCGGC 

AGAAATTAGCGAGCTGCTTGCTGATCGAAATTG 

Sequence 645 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACGCAGCAACGCGGAG 

GAAACGGGAGTGAACGGAGAGCGTAGTGACCATCATGAGCCTCCTCAACAAGCCCAAGAG 

TGAGATGACCCCAGAGGAGCTGCAGAAGCGAGAGGAGGAGGAATTTAACACCGGTCCACT 

CTCTGTGCTCACACAGTCAGTCAAGAACAATACCCAAGTGCTCATCAACTGCCGCAACAA 

TAAGAAACTCCTGGGCCGCGTGAAGGCCTTCGATAGGCACTGCAACATGGTGCTGGAGAA 
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CGTGAAGGAGATGTGGACTGAGGTACCTGCCCGGGCGGCCGAGGTACCTGCAGAAGGCCT 

ACAGGGTGCCAGGCACTTCTTTMTGTGTTCTTTCTTTATGTGATTATTTGATTAATCTC 

TGCCTCCCCCACTAGGACTGTAAGCTCCCTGAAGGCAAGAATCCTGTGCTTATGCTCAAT 

ATTAGCTCNTCCCTTGGCACAGAGTAGGCACTTCAACAAATGCTCCCCA 

Sequence 646 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATCTACAAAGA 

CCTCTATTACATAAACACTGCTTCTTACAGGAAACAMCCTCTTCTGGGATCTCC7TTTG 

TGAAMCCAGTTTGATGTGCTAAAAGTAAAAAGTCTATTTTCCAGTGTGGTCTTGTTCAG 

AAGCAGCCAGATTTCCAATGTTGTTTTTCCCCTCCACTCAGAAACCCCTGCCCTTTCCCT 

TCAGAAAACGATGGCAGGCATTCCTCTGAGTTTACAAGCAGAGACTCACTCCAACCCAAA 

CTAGC TGGGAGTTCAGAACCATGGTGGAATAAAGAAATGTGCATCTGGTCTCTTCTGTTG 

TTTTTATTTCATATCAGATTAMTTTCTTTACCATGTTGGCTAAGTCTAAATATTAGAGA 

TGAGGGCTGTGCCTACTCCCTGGCCAGCTCTNCTGATAGCCTATGATGGGGTTCCAATGG 

GGAAATGACTCTTTACTATTAAAAGACAAGGGAAAGGC 

Sequence 647 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGATTACAG 

GCGTGAGCCACCATGCCTGGCCAGAAATCTATGTTTTCTTAGAACATGTGGAAGAAGGAA 

AAAGACAAAAAAGGAAGTCTGGATTCTGAGGACCACGTCTCACCCAGGGTGACATCAGGA 

ATGGGTGCTAGCCTCTGCAACACGACACCCAGTCTGAAGAAGCTCTATACAGGGTACCCT 

NGGNCCGTTTTAGAAACTTAGGTGGGATCCCCCGGGCTTGCAGGAAATTCGAATATCAAA 

GCTTAATCGATACCGTCGMCCTCGAGGGGGGGGCCCGGTANCCCAGCTTTTTGTTTCCC 

TTTAGGG 

Sequence 648 

CCGCGGTGGCGGCCGAGGTGTAGGGTGTTATCCCTGTGACATTGTCTCTTTAGTTTGCTC 

TTTCAAGAGATACTTACAGATGTTGAGATGGCTGCCCTGCATTTCCAGCTAATCTCTTCT 

GCTCTAAATATTTAAAAACAGTTCTTCTCAAACATTTTCATTCAGATAGCTTTCTGAA^ 

TTCCCTATCCCTCTTTACCATAATTTTTTAMTGTAGCCACATTGTAATAGTAAACTTCA 

TATATAATGAGTGCTTCATATTTTTGTTATGGGAAAGCAATATATTATGCAGCCAGTCTG 

TAGAAACATTCAGATCCCTCTTCCTTTACTCAAATACAGTTTCAAAAGGAAGACTCATGA 

GAAAmCATAAAATACAANGTTTTTAGATGTTT 

NTTTTCCTGCTNTGGAGNCNCTAACTCTGGAAAATTTAAAAATATGTAAACNAAAGGGGG 
GTTT 

Sequence 649 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTATACATG 

CAAAACTCTTAGCACCTGAATTTGTAAACATTTACACGTATCCGTGTATTCCAGAATATG 

AAGAAAAGGACCATGAAGTTGCACTCATTTTTCCTGGACCTCAGTCTATCTCAATAAAAG 

ATATTTCTTTTCATCTGCAAAAAAGGATTCAAAATAATGTTAGAGGCAAAAATGATGACC 

CTGACAAGCCATCTTTTAMCGCAAAAGAACTGAAGAACAAGAGTTCTGTGATTTGAATG 

ACAGCAAGTGCAAAGGCACAACACTGAAAAAAATTATATTTATAGATAGCACCTGGAACC 

AAACAAACAAAATAT TCACT GATGAGCGACTTCAAGGGGTTGTTACAAGTTGAGTTGAAA 

ACAAGAAAMCTTGCTTTTGGCGCCATCAAAAAGGGAAAGCCAGATACT7TCCTTTCTAC 

AATTGAAGCCAT 

Sequence 650 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGAGGGGCTGCTGGA 
CAGAGCGGTAGCGATCAATGCACAGGATGAAGACACTGAAAATGGACGCTGTGCTGGCCA 
CATAGTCCATGGAAAGCCAAAAGAGGCAGAGAGGACGGCCCAGTGACCACTTGACCTGCC 
CGGN 

Sequence 651 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTCACTAAAAGA 

TTGAGACCCCCTCCTAGGACTATAGAACCCTTCCTCTCCCTCCATATTTTATCACCACAT 

TACTAAAGGCCTGTTTGCAGCAGTTCCTCTTAACTTGTGTGTCATGTCCAGTAATCAAGA 

AAAAAATTACAAGACATATTAAAGGCAAGAAAAAACAGTTTGACGAGAAAGAGCAAGTAT 

CAGAACCAGAGTCAGATATGGCAGGGATGTTAGAATTATACACCAAGTGGGATTTATCCC 

AGGTATTCAAGGCTGGTTCAGCATTCATTCAAAAGTCAATTAAGGTAGTCCATCATCACA 

TCAGCAGGCTAAAGAAGAAAAATCATGATCATAGCAGTAGATGCAGAAAAAGCATTTGAC 

AAAATCTAACACCCACTTATGATCTCAGCAAACTAGGAAGAGAGGGGAACATCCTCAACT 
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TGATAAAGGAATATCTACCAAAACACCTATTAGCTAACATCACACTTGATGAAAAACT 
Sequence 652 

TTAGGGCGAATTGGAGCTNACCGCGGTGGCGGCCGCCCGGGCAGGGTACTTTTAGTAGAG 

ACAGGGTTTTACCGTGTTAGCCAGGATAGTCTCGATCTCCTGACCTCGTGAGCCGCCTGC 

CTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCTGGCCACGTCCCTA 

TTTTAGAAATGAGAGGAGTGACTGCACATAGGAAAAATGCCACTTTTAGCAATTCAAAGT 

GGAAAAACTTCTTTTATATAAAAATTATCCCAACTCCCACCCCTTGGCTCTCAGTGTTGC 

ATCTCCCACAGAGGTAAAGTTGTGCCATTTTCCCACGGCTTTAAACAAAGCAAAACAAAA 

CCACCAATCCTAATAACCCC 

Sequence 653 

CCGCGGTGGCGGCCGCCCGGNCAGGTACAGAGTTACAGAGCATAATTTTAAACACTTAGT 
TGCAG GTATA TAAGTATGGTATGTTTCATGGACTGGAGACTTTGCCTGTTCATTTTTTAA 

ATATTATTTTCAGGTCCTTTGCTTACCAAAGGAGGCCCAATTTCACTCAAATGTTTTGAG 

AACTGTGTTTAAATAAACG 

Sequence 654 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGAGACT 

GGGAAGAAAAAGAGGTTTACTTGGACTTACAGTTCCACATGGCTGGGGAGGCCTCAGAAT 

CATGGCGGGAGGTGAAAGGCACTTCTTACATGGCAGC^AGAGAAAATGAGGAAGAAGCAA 

AAGTGGAAACCCCTGATAAGCCATCAGATCTTGTGAAACTTATTCACTATCACAAGAATA 

GCATGGGAAAGACTGGCCCCCATGATTCAATTACCTCCCCTTGGGTCTCTCCCACAACAC 

GTGGGAATTCTGGTAGATACAATTTCAAGTTGAGATTTGGGTGGGGACATAGCCAAACCA 

TATCATTCTACCCCTGGCCCCTTCAAATCTTATGTCCTNACTATTCAAAA 

Sequence 655 

CTACTATAGGGCGAATTTAGCTCCCCGCGGTGGCGGCCGAGGTACACGGAGCTTGAAGCC 

GAAAAGGGTGAGGATCACTTCTACAGAGAAGGAGCCTCGGTCCACAAAGTCACCATTAAA 

TATATAGGGGTTGGTCTCCGAGGGTAAACCGTTGAGCTCGAATATGTTGAGGAGGTCATA 

GAACTGGCCATGGGTGTCCCCACATACTGTAATCTTCTCTGTCTCTTTGAGTGTGGTTTC 

CACGAGCGTGCTCAGCTTGGAGAGGACCTCTTTGACCTGTACTGAGTTTTATTATCTCCA 

ATAATAATGGT CTGGAA CTCAGGAGGATCATAGTAAAACCTCCAGTGAAGGTTAAAATTC 

AGGCCTGTrGATTTTTTCTTCGTTTTATGTTTTCCAGCAAGGATATCATAAGGACCAACT 
AAT 

Sequence 656 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGNCAAGGTACTCTCCCAC 

TGGAACTCTGGGGCCCACTGAGGCACCATTATTGGGGATTTCAGGGTGGGCTGGGCACTG 
CA 

Sequence 657 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAACTGGGAAACAACACTTGGTTAGTCTCTT 

TTAAGTTACAAAAAGCCAATTGATGTTTCTTATTCTTTTTAAATTTTA 

TAAATACTCACAGGATACCTTATTTCCCTAGCTATCATCTCCTGACTTMTGTTTTTTAA 

ACCCACCAATATAMTTTAATTAMGATATATGTTGTAAAAAAAGAAAAAAAAAAAAACC 

TTGGC CG CCCGGGC AGGTACTCTTTTCCCACTCTGCGGTTCTGCCAAGCACCTTGCCTGG 

GCACTTGTGTGTGTGCAAGACTCTACAGTGTGGTGTCTGCTGC I I 111111111 I CTTCT 

TC 

Seq uence 658 

AGGi uilllll IGTCTGAGTCCTCTGACGTCCTGCCCAAGNGAACAGCTGCGGCAGCCC 

CTCCCAGCCTACCCCTNCTGCGCTGCCCCAGAGCCTGGGAAGGAGGCCGCTATGCANGGG 

TANCACTTGNAACAAGGAGAACCCACCTTGAAGCTTATTNCCTANCCCNTNAGCCCACCT 

TGGGGNGGNTTAANTTACCCTG GGGGGACC CCTTNCTTGCCCNATNGCCNTTCCANTTTA 

C AAAAAACAAATCC AAATTTG C NTTTTTTTTTTTTTTGGTTC C C AAAAN NTAAAAAAAC N 

CTNAGGCTTTATNCNTCTTGCCCNANNNTTTTANTCNGNANTATTTGTATTCNANAAAGT 

NGGGGGGGGGGGGNNACNCNTTGCCCCAGGGACGGGCCCNCNTTCTAAGAAACCTAAGAC 
GNGAATC 

Sequence 659 

CACTACrTAGGGCGATTGGA GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGf 1 I t 111 1 (T 
TTTTTTTTTTTTTTTTTTTTTTAAC AG G AGG AAAAAAAATTTAATAATATATGTTTG C AC 
AGGAGTTCCACGAAATATGAGACTCCAAGAAGGGTCAGATGATTGACACTCATACNCCAT 
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CGTGAGCTATCGAAAAGAACGGCAGTTTGGGAGTTCTGCAGGGAGTTGACCACANAAGTG 

GGAGAGTGAAGGGAAGAAGTGTGTCGTGAATAAAGCTTGGCTGGTTTTCANATAAAAGGT 

CTTGCCGAGTGGCCAGGTGTGGTGGCTCACTCCTGTCACGTCCCANCACTTTGGGAGGCC 

AAGGCGGGCGGCTCATGAGGTCAGGAGTTTCGAGACCAGCCTGGCCAACATAGTGAAACC 

CCGTNTTTACTAAAAATGCAAAAAAT 

Sequence 660 

CNCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACTGTCTGG 

GCTGGGGGGGACAC TGTCCAA GGGAGTGGCCCCTATGAGTTTATATTTTAACCACTGCTT 

C AAATCTCGATTTCACTTTTTTTATTTATC CAGTTATATCTACATATCTG TCATCTAAAT 

AAATGGCTTTCAAACAMGCAACTGGGTCATTAAAACCAGCTCAAAGGGGGTTTAAAAAA 

AAAAAACCAGCCCATCCTTTGAGGCTGA I I I I \C\ \ I I I I I I AAGTTCTATTTTAAAAGC 

TATCAAACAGCGACATAGCCATACATCTGACTGCCTGACATGGACTCCTGCCCACTTGGG 

G 

Sequence 661 

GACTACTATAGGGCAATTGGAGCTCCCCG CGGTGGCGGCCGAGG I rTTTTTTTTTTTTTI 
M u N M l I I I M ! M I I I I M I I I I I I I I I I NGCCNCGAGGGGGGTTTNATTTTCATT 
ATTCANACAAANAATTTTNTANAANATCCCGGGNCAAACCGGAAAATTTGGCAGNCCNAT 
NGGGGGGGG 
Sequence 662 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATTTCAATAATAAATTA 

AAAAAATGAGTAAAGTAAGTCTGAGAGAAGTTCATAGTCACTATATTACTCAATGACATT 

TTTCTGCCTTCATTTCTAAGAGTTCATCTGATCCAAGGTCTGATCATCTTCTTGACAGAA 

AAACTTGTTTTGACAGCACATTTGAAATGCCGAGTGTTTTGTTATGTCTAAGC^ 

TGATGTTTGTTAATAACAGAAACACTTTAAAAAGAATCAATTGCAATCMGMGTGTTGA 

TGAACATTTGGACAANGNTACTTGGCCCGGGGCCGGCCGCCCGGGCAGGTACAGCAAGTT 

TTCATGTCTTTTTTTAATAAATAGGATTTCTAGGAGTCAGTATATATTTAATACTCTT 

TCCTTAAGAAATMGAAGTTTAGGNCCMGTGTTAAGCTTTATCCTTGACACTGGCCTTA 

TCTCACATGGAGGGATTAAAAGGGCC 

Sequence 663 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAAGAGAGAAATTCA 

AACAAAATATTGCTGTTCTTCAGTTTTGTTTGTGGAATTTGAAATT 

TAAATTACTGGACTGTGGAAATAAAAAAAAAAAAAAAA^ 

GACCNGCCCG 

Sequence 664 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTTTCATAT 

TACTCTAAATCCATTACAAATCTGCTTAGCTTCTAAATATTTCATCAATGAGGAAATCCC 

AGCCCTACAACTTCGGAACAGTGAAATATTAGTCCAGGGATCCAGTGAGAGACACAGAAG 

TGCTAGAAGCCAGTGCTCGTGAACTAAGGAGAAAAAGAACAGACAAGGGAACAGTCTGGA 

CATGGCATCAGAGATCCACATGACAGGCCCAATGTGCCTCATTGAGAACACTAATGGGCG 

ACTGATGGCGAATCCAGAAGCTCTGGAAGATCCTTTCTGCCATTACACAGCCTATGGTGG 

GGGGGTGGCAAATTGTGGGCCTCTACCGCACAGGCAAATCCTACCTGATGAACAAGCTGG 

CTGGAAAGAAA 

Sequence 665 

TAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCCGCCCGGGCAGGGTACACNGTGGATGG 
GGCAGTCGGTCTTNTCTGGGCAGGTTGTTGTTTCTNTCGGTAGTGTCCGGCGAGATGACC 
TCCTGCTGCTGGGCCTGGATGGCCCCCAGGATTCCCCAGAGCAGGAGCACGGAGCAGGTG 
CCCTGGAGCATCTTGGCAGCACCTGTGGCCCTGAAGTCCTGGTCCCGAGGAGGCACCT 
Sequence 666 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGI ITI I I i i I I I I I I I I I I 

GGCAGTGAGGACTTTATTGGGGAGGGTGACTTTGGCTTCCAGAAAAGAGGGAAGGTGGGG 

ACCTAGTAACAATAACCATTTATTCCAAGGAGGCCCTGGCCCTGAACCCGGGGTTCCCAC 

AGGAATCAGGGAGGCACCTGAGTCCCCCAGGACCAGGGCATCCCAAGGCATCATGGCAGC 

TGCGTTGTTCAAAAGGAAGTTTCATTGAGCTTCATCTTGGGAGGTGTGAGGGGGAGTGCC 

GAGACCGCTGGAGGGCCACGGGGCTGGTGTGGGTGGGCCGTCGGAGGTCTGGCCCACTCC 

GCACCAGTTCCTGGCAGCGTTCCACAAAGCTGCCCCCACCACGGCGCCGGGCCTTNAGCC 

TGCGGGGGGCTTGGGCTCCCACGGTGGCCAGAAAGGGAAGGTGTTNCTGNAGGCTNAGTG 



Page 92 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table i 



GAGGCTG 
Sequence 667 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCCTCCTCGTCCTGCCG 

CAGGGCCAGAACCCCTGACGGTATTCAGCTGCGCGTAAGTCTGGCCGGTGCCATCTGTCT 

CCGCAATGCCCCCCAAGAAACAGGCTCAGGCCGGGGGCAGCAAAAAGGCGAAGCAAAAAA 

AGAAGGAGAAGATTATCGMGACAAAACTTTCGGTTTGAAGAATAAGAAAGGAGCAAAGC 

MGAGAAGTTTATCMGGCTGTCACACATCAAGTTAAATTTGGTCMCAAAATCCACGTC 

AGGTAGCACAGAGTGAAGCTGAAAAGAAATTGAAGAAGGATGACAAGAAGAAAGAATTGC 

AGGAGCTAAATGAGCTGTTCAAACCTTGTAGTTGCTGCTCAAAAAAT 

Sequence 668 

CCGCGGTGGCGGCCGAGGTACCCCTGTTGGTATCTAGGAAGTAACTAGCTTGCTTTTGAT 

TTTAC AGGCTCATAGGTGGAAGGGACTTGCCTTGTCTCAGATGAGACTTTGGACTGTGGA 

\ [ l » i G CGTTAATGCTGAAAAGAGTTAAGACTTTGGGGGACTGTTAGGAAGGCATGATTG 

GTTTTGAATAGTGAGGACATGAGATTTGGAGGGGCCAGGGGTAGAATGATATGGTTTGGC 

TATGTCCCCACCCAAATCTCAACTTGAAATTGTATCTACCAGAATTCCCACGTGTTGTGG 

GAGAGACCCAAGGGGAGGTAATTGAATCATGGGGGCCAGTCTTTCCCATGCTATTCTTGT 

GATAGTGAATAAAGTTTCACAAGATCTGATGGCTTATCAGGGGGTTTCCACTTTTGCTTC 
TTCCTC 

Sequence 669 

TACTTAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACCTTTTGGGAATCTAAT 

GTATT GTAA GGTATTTTACACGTGTCCTGATTTTGCCACAACCTGGATATTGAAGCTATC 

CAAGCTTTTGAAATAAAATTTAAAAACCCCCAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 670 

CT ACTTA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAACTGGATCTATAG 
ATATTTTGAACTGGACCAGGTGGGTATTGAAGTAACCCATCAAAATATGCTCTGCAGTGA 
T TCCGCTTAATGTTTAAATTCAGTA ACCGTC I 111 H 1 11 1 I M I I HI II I 11 1 I H IN 
Ci I M I M I I 1 I 1 1 N M I 1 M M 1 I 1 t t 1 1 I M 1 M 1 M 1 1 1 1 I 1 1 1 
Sequence 671 

GCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGCCTGCACCAG 
AAGATGTCTGCAUACTCATTGCTAAAAATGTGTAG 

TGTTTACAAGAAGAA ATTT AAACTCTACGTTTGGTTTTCACATACAGCAGCTCTATTGAA 

TAACATGCATCTGAATTTTAAGTTGCAAAGGTATCTGAATAAT1TTTCATGTGCATCTTT 

TGTCGAATGTTTTGGTTCAAGAAAGAATGTTTAAAGCTTTTTAAAAGACTTCA 

ATGTAACTGTACAAAAT TAGTT GTAAAAAATAACATAATTTACCAGTAAACCCACTCATA 

TAGAAATGTGCAAAGCCTTTTGATATAAAAAGGTTTGTACCTGCCCGGGCGGCCGCTCTA 

GAACTAG 

Sequence 672 

AGGTACTGAAGCCAGCCACGCTGCGCCCGGCCCTGCCCCGGGCCTTCCTCGTGCCTGGGA 

GGTCGTTCTAGGGATGCTCCTGACCTCCGTCTCTTGGACCTAAGATGGAATGTGTCCCCA 

GCTCAGGGATTGCCTGAACCAAGAGGCCAGGAGCCCCCATGGGCCGCCCAGTACCTGCCC 

GGGCGGCCGCCCGGGCAGGTACAAAGTGCTGCAGTAGCCGGTGAGCAAACTCATGTGTGG 

CTCCATCTCGGCTCCCTGTTCTTCCTCAGGAATCCACACAGCTTCCCAAAGCACTGTTGA 

TGCAGGAAATCTAACCTGCCTATTCAGCCCATC 

Sequence 673 

CCGGGCAGGTACAATAAATCCTACAGGAATAATAACAACAGAAACATATTAACAGTTTCC 

TTTGGCAMmMCGGAAGTGAATGCTGATGTGAACATAGAAAAAATAATTACAAAAAC 

ATGAAAMTGGAAACTGACATAAMGTTAAAATAAAGGGATATTTCTAACTTTTAGACTG 

ATGAAAGAAATGTCTGAGGCCATTAAATGCAAAAAAGAAATACATTCTCTTTAAATTATT 

GNGTACCT 

Sequence 674 

AGGTTTTGC 1 I 1 I 11 I 1 G AAAAAG GT AAGTTGCTG ATT AAGTCT AATTGG AATTG AT AAT 
TCCATAGTCTTAGATTAAAATGAGGATATTTTCTCCTAGATTTTCTCATGTTATGCCATG 
CATTTATATATCTAACCATTAATTTCACACTAAGGATGCTTCACCATATAATAAAAGGAG 
CAAGATGGAAGCACTTTGAATTTTCTrrCATTGAGAATAACTGTTTTATGTAAGAATCTG 
TATTTATAACACCAGATATTAAGATAGGCTTCCATTTTTTAATGCAAGCCACTTACTTAA 



Page 93 of 63 5 



WO 01/51628 



Table 1 



PCMJS01/00798 



TCTTGNATTCTTTTTC 
Sequence 675 

A GGTA CACTTCTGTCTTCAAACTGTATCTTCTTTGGATGGAATTAAGATGTMCTGTATA 
GTTTTAAGATAA ATAAA TGGGAAGTTGGTCCAACTAAGATGACAGCAGATATATTACATG 
CAGGATTTAATATTTTCTAATTCTCTCTTT^ 

AAAAAAGTCAAAAAAGAACCCAGATTCAATATATAAAAATGTCCCACAATAGGTCGACAA 
TGGCAGTAAAAATAAGAGGAAAAAAGTMTCTTTCTGGAACATCATTTTTCAATAACATA 
GAAACACTAAGTGCCAGGAAGATTCCTAT 
Sequence 676 

CCCTTTCGAGCGGNCGNCCGGGCAGGNAC MINI NTTTTTGTTAACTTTATTTTTAATT 

CATGACACTTTCACTGATCATTAGGATGGCAGGGNATCGNTAAGAACCAGGATTATAAAG 

ATGAACAGGCCTCAGGTCTGTNTNTAACANGCTTATAATCTATGGTAAGAGGCAAGGNAT 

TTAATCTCCTTTGCTTCTACGATNATGGCTCANATATGTTTGCTCTGCTCTAAGAAAAAT 

GCATCAGAAGCACCTTAGGTTTGGCAGGNAGGGGCTCTGATGTTAACAGGATTGGGTAGA 

GAAGCTCTTGATGCTTCTGGAA 

Sequence 677 

NCCCGGNNNNNNNNGGGGNNGGGGGNNNNNNNNNNNNNNGGGGGNNNNNNGGGGGNNNNN 

NNNNNNCCCNNNNNNNNNNNNNGGCCGAGGTACCATATCAACCAGAGCCAGNCCACCACC 

ATTGG AAATAAAGNCCCAGCCAGTNAAGGTGGCCAAATTATTCCCACCATTAGGAAACAT 

TTTTTCAGGCATTAATGGACCCAATGGAAAAATTTTCCTGGTTGGCC 

Sequence 678 

AGGTACTGCTTAAATGAACACAAGATGGTGTCAGGATTCCMGTTTCTAGTTCTTTCTAG 

GGAGTTACCTAAGGTCTAAGTTTTATCCTGGACAGAATCTCTTGGGGAACTAAAATGCTA 

CCAATGAAAAGCTCTTGATACTCCTACATCATGAATATTATTGAGCTCTAAAACTTGAAC 

TGGTCGTGTCAGGAGAACCTTTAACTGCTCATGGCAGTTCTTGTGGACTGAGAGGAAGTA 

CCTGCCCG 

Sequence 679 

AGGTACTGAACATGTTCCAAACTCAGATAGTGATGATTGCACAACTCTGCGAATACACAA 

AAAAACATCCTCGGAGGGAGTCTGAAGGTATGTAAATTACATCTCAGTAAAGCTGAAAAA 

CTGCTTTGGCTAAAGTGGCTATCCCTCCATGGTGCTGGGTCCTGCCCGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGA 

CCTCGAGGGGGGGGCCCGGTACCCCAACC I I 1 I IGGTTC 

Sequence 680 

TAGNAACTAGTGGGGATCCCCCCGGGGCCTGGCCAGGGAAATTTCCGGATTATTNCAAAA 
GGCCTTTTAATTCCGGAATACCCGGTCCGGACCCTTCCGGAGGGGGGGGGGGGCCCCCGG 
GGTAACCCCCAAGCC I Mill IGGTTTCCCCCI I I I I AAGTTGGAGGGGGTTTNA 
Sequence 681 

AGGTACAMCTATTTCATTATCAGCAAAAATTAACATCGACTGGACCGAGAAATGACTCC 
TTTTTACAAACGGGGAAAGAAACGGAGACTCCTGGAGGAAAGCTGCTTCTGCTGGACATG 
TTCTCCCGCGTACCTGCCCG 
Sequence 682 

CCGGGCAGGTACTACTCAGAGAATTTCATTGCTACTGCCTGTTCGTTAACCTAGCAAGTG 

AGTAAATTGAGGCTTTAATTAAAAAAAAACGGTTGTTGTTCTTTGATATACATT^ 

TAGCTCAAGCTTCAACCCTGCCCAGTTCCTATGCTCAGACCCCATCTTCAAGGATGCTCC 

TCCTGCCTAACTCCTATCTCAGAATATTG M I I I I CCTTTCCCGTTTAATCCCCAGGCTC 

AGCTCCTGGGACCAATCCTGGTTCAGGCTCCTGTTGCCTGCCTCATCCCACAGCCGTCCA 

TMTTTGGTGGTTACGTCATGCACCTGAAGTGGTAGTGAGCATTTATAGTAATAAAAA 

Sequence 683 

AGGTACATTTTGAACAATCTACTCTTCAAGTGGATCATTTACAAAACAAGAACATAATTC 

CCTGAATCAGTAAATTTTATTTTCATGAAAGGATGACAAAGAAATTCCATTTACACTATC 

ACGATTTACATTTTCCTACAATTCATGCAGCTTTCTTCAATTCACAATCTACAAACCACA 

ACAATGCACAAATTTGATGTTAGAGGATACTTTAATGTTTAGTTCTCATCTCAACAGTCA 

GTGTTGCTAGTGTCTCTCATTCTCACTCTTCACTGTTACTTAAAGAACAAAGAACTAAAG 

AAGCCAGAAAGAACTAGATCAAGCGATGAGACCAATTTTCAGGGTAAGGCATAATAA 

Sequence 684 

AGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCACTGAGC 
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AAAAGGTGAAGT TCAAT TAGGTCAGTTTTATCACTTCTTTTCCTACACACAAACTATGAG 
GAGAGATC ATTTCTTTTCTTTCTTTTATTTATTT A 1 111111 1 GAGACAGGGTCCCACTC 
TGTeGCCCAGGTTAGAGTACCTGCCCG 
Sequence 685 

AGGTACCTCCTGCCTTGAGTGATTTAMGCATTTTMGATATGCTTTGCTGCCTCTAGAT 

ATTTCTCTGGTTAGATCCCCACAAAGATACTGTAACATTAAGTTCTGGGACCATTTGAGA 

ACAAACTAGCTGCTCACAGATTCCTGAAAGAAATGCAAAACCTTTTCTCTGGCCCCAAGT 

ACCTGCCCG 

Sequence 686 

AGGTACTC \ I l I ICAAMTGCCATATTTTGAAATTGCTATTTGCTATTATATGATGCTGT 

GTCCTTGAGAAAGAGAATACATGGATCTGGCAACCAGGAGTGTGTGTAGGTAGAACTGGC 

TTCTC TCATCATAT ACCCCAGGGATCCAATGGGAAATGTTTrCTTCCTGTCCTTTTAACT 

TCAGG! 1 1 I I I I I ICGGGGGGTGGGGGTAGCGGTAGGGGGTTAGTGAGGATCTTCTAGCA 

GAGAATACCTTACAAGGATACCACTAAATCTAAAGCTCCAACTACCTNCTGGNCACTTNG 

GGCTCCTNCTGACACTAGNCCAGCNGGCAAAAAGGNTACTGAACT 

Sequence 687 

AACCCACCGCGGNGGCGGCCGCCCGGGCAGGNACTNCTTGCNNGGGATTAAAAACCCGAC 
AGCAGCAAACNGCAGAANCNNCAGACNGCAGGNTTGCNGANGGNGAGAGGGAACNCCCGC 
CCNAAACCCACNGCCACNGAACCNGGCNGGGANACCAGNGGCCCNGGNGGANGCACCANA 
G ATGTTTGAGCC NGG NTN NCTGGCC AGGGGG NGGCCGCC ACC N NAN N N 
Sequence 688 

CCGGGCAGGTACGCGGGGGGATTCAGAGTGAGAAGGCATAAAGGAGAATCCCCAGCTGAC 

TTGTGCAGTGGTTAATTGAAATTATTCAGGCAAGAGATGATGGTGTCTTGGACCAGGGGA 

TGAGGMGGCTACAAAATGTGTCTACCTGTATTCTGTGAGGAGAACGTGTTCCCTGGTTT 

TAGATACTGTGAAGATGGATCAGGAGAGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGC 

GATTCCTTCAGCTACACAGGATTGAAAGGAGACATTTCTGAAGGGGAAAAAGGAAATGAA 

AAANNNNANANNNNNNNNNNNNNNGGTACCTCGGCCCGCTCTAGAACTAGTGGATC 

Sequence 689 

AGGTACTGCAAACTAATATTAGTAATAATTTGTTGATATCAGCTTGTGAGCTTTAAAAAG 
C CTGGT CAAACAAGTTTTATGTATTATGTTTATGCATGCTATGTTTAAAATGAATAATTA 
AATT TTGAGTAATTATTAGGATTTGGCTGGGACAGGGGATGGATGAGCTTTCTGTTGTAA 
TTTTGCTCTTAGTGAGTAGTAGGTAC CTGC CCG 
Sequence 690 

AGGTACCACTTGTGATGTGTTTMTAAATTCTGTGTGGTTTTTACATAGCCCTGAGAGGT 
AAGCAATGAGTCAGGAGTTTCCTCCAAGCACCTGTTAAGTGGCATCATATAAATAGCAAT 
GTCAGGAAATCAGGTACCTGCCCG 
Sequence 691 

CCGGGCAGGTACTGGAATGAAAATACAGGTAAAGCATTTGAAMTTATTTTTCTGGCTTA 
GACTGTTTACTGGCTCGATTCTAGCTCTGTCCTGGTAAAGGCCTCCAGAACCTTCACCTA 
CTATTATCAAGTCTGGCCTCATAGGGAGGGACTTTTACTGTACCTCGGC 
Sequence 692 

GAATTTCCATCGGTTTGCTGAGAAGCACAACTTTGCAAAACCCAATGACAGCCGTGCTCT 
CCAGCTGATGACCAAATGTGCGCAGACTGTGATGGAAGAACTAGAGGATATTGTGATCGC 
GTATGGACAGAGTGATGAGTACCTGCCCG 
Sequence 693 

ATTGGAGCCTCCACTCGCGGTGGCCGGCCGCCCGGGCAGGTACCATAAGAACATTAAAAG 

CTGAAATTCAGGTTGGTCTGCAGCCTGGGCAAGGGGAACAGCTCCTGGCCTCTGAGCCCA 

GCTGGTGGACCACTGGAAAATATGGAATATTTCAGTACCTnnnnnnnnnnnnnnnnnnnn 

nnnnnr.nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnT 

CATGGTCATAGCTGTTTTCCTGTGTGAAATTGGTTATCCCGCTCACAATTCTCACACAAC 

ATACGAGCCGNGAGCA 

Sequence 694 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACAACAAAAG 

TTCATCCTAGTAATTCTTAAAGCTCTTCAATTCCTATAGAGTTTAGCCTTTGTCAATAGC 

CAAAATATGTGCTTGAAAAATAACTTCTTTGAGTTTCAAAGCAAATGAAAACATAAAACA 
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AGCAAAAAAG GCTTT TTTGTTGGTGNTTTTCT 

C ATAM TACGGTTTTCAAAMGCATTGCCTCAGCCAAAATTATTGCCCTTTTTAAAAATG 

CTTTTCATGTATACACTTTCTACATAACTGCTTTTCTTTACAC 

Sequence 695 

AGGTACATGAAGGCCCCCA GTTC CCCCATGCTAGACACGTCCCCAGAAGCAGCACCTAAT 

GGGCAACACTGCGGAATCATTTTCCACCCAGATCAGGGGCATCCCACGGACACTTATTCC 

AGAAAACTGAAGCTGGGCCACAAAGAAGGCTCCCATCCTTGCTGCTATTTGCCCTGGACC 

ACTTCAAAATGTGACACATCGGGCTGCAGTGAGCTGAGATCGTGCCACTGTACCTGCCCG 

GGCGGCCG 

Sequence 696 

CCGCGGTGGCGGCCGAGGTACTGAACATGTTCTAAACTCAGATAGTGATGACTGCACAAC 
TCTGCGAATACACAAAAAAACATCCTCGGAGGGAGTCTGAAGGTATGTAAATTACATCTC 
AGTAAAGCTGAAAAACTGCTTTGlGCTAAAGTGGCTATCCCTCCATGGTGCTGGGACCTGC 
CCG 

Sequence 697 

CCGGGCAGGTACAGTGAACACGCCTGTAAAACTATCACTCATATGAAAGAGAGGACAATA 

TCGGCGCAGAATACATACTAGCAGATACTGCTTTTTTACTGTGATTTGGAMCTTTTGGT 

ATTCAAGTAACCACTAAGGAATATAACTCTAACATGGACAAGGCTTAGATGCATCATGAA 

AGTCTGTCATGGGAAAGGTCTGTGGACAGAGGATGTACCT 

Sequence 698 

AGGTACAAGGCAGCAGAAGCAACAAAGGCTGTGGCCAGCAACGAAGGCAGAGTCCCCGGT 

AGGCGGAGGTCCCCGTGCACAGTGTGGTGCAGCAGTGAACCCAAGATTTTCCTTCACAGG 

GCAGGGGAACTATTTCCTAATAAGCATTGGCATTCAGGCCACAAGGGCAGCCATTACTCA 

GCACTGCACTGACCTTGGGAAT7TGCTGGGCAAGGAAAATGACGTGGCCCTCATCATCGA 

TGGCCACACCCTGAAGTACCTGCCCG 

Sequence 699 

CCGGGCAGGTACTCTAACCTGGGCGACAGAGTGGGACCCTGTCTCAAAAAAAATAAATAA 

ATAAAAGAAAGAAAAGAAATGATC TCTC CTCATAGTTTGTGTGTAGGAAAAGAAGTGATA 

AAACTGACCTAATTGMCTTCACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGC 

ATACATTGGCTCAGTGCCAGCAGTACCT 

Sequence 700 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTGAAAAGCCTAGTG 

CACTCCAGAAGGCAGGTGGAAGGTGTTGTTTGAAGTGAACCCTATTTTATTTTTCTCAAG 

AAATTCTTGAGCCGATTCTGTGACATGAACATCTAAACAGAGTAACCAGATTCCAACTGC 

ATGACTCGMTAGGTTTACACATTTTTCTTCTGTCCCGCCTTTCAAACAAAATGGTAAAT 

AACTTCATTTCCAGGCCAGTTTAGCATTCCCA I (IN I ACCCTGATG AGTCCC ATTTTAA 

TGAGCAATTTGCTTTATCCAGTAGTGCATTAGTGMGTGTTTCCTGATTTTATGCGGCAA 

GTGTTTAT 

Sequence 701 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTTCTAAAAATGAGTGAACATGGTGAC 

TTACCAAAGTGTGATTTGAGTTTCCTGGACCCTCTGCATGAAATGTGAACATGAGCTATG 

CTGAGATCACAAGTTAAATTTACTTTTGAAACCAGGTATACAGTTGATGGATGTCAAATG 

ATAAACGTATGCATAGTGATGAAACGGTCTTTTAAGTTTTAGGTGTGCATGTTTGCTTTT 

TTCCTGAACCTGATTCAAATAGTTGTAATTTGTACCTGCCCG 

Sequence 702 

AGGTACCTGCAATG AAGGA GAGGAGAGGACAGGGAGCCAGATTTCCAAGGACGAGAGGAT 
GGATGCGGAGCAGGTTTTCTGTAGCATGTGGAGGAGGGTAGGAGGTGGCATGTGATGACC 
TCGTGCTCCAAGAATAGTGCTCAGTACCTGCCCGGGCGGCCG 
Sequence 703 

CCGGGCAGGTACATCCTACTCCAAACAGCCTAACAAAGAGTCAATCAAAGGAGTTCTTTG 
ACCTGCTA CTTG ATTTGAGAGTTCCTAAGCAGTTTTCGTGTGTGGTTCTAGAATACCCAG 
AATGCTGATTTTGGTAGGAGGCTGTGGATATGTACCT 
Sequence 704 

GCGGTGGCAGGCGAGGTACGTGACAAATGTCATCTAATAACCCTGGGAGGGCTGGGTCCA 

CCAGGCTGTCCCAGTATCCACCACCCTCTTCCTATCCTCAGGGTGCAAAGGCCAATTCAC 

TCCCTCTCCCGTTTCCTTTCATTCTTTCAGTGTTGCTGTATTATGTAGTACCTGCCCG 
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Sequence 705 

GCTNATTGGAGCCTCCACCGCGGTGGCGGCCGAGGTACGATGCAAACCAGCAAGCCAGGA 
AGCACCTCCTTCCCGTCACCACAGCAGGCCTGAGCAGCAAGGAGCGTCACCGGAGCCCAC 
TGGAGAAGCCCCACAACGGNCTNCTCTTCCCCCAGCACCGGGGACTATCAGTACCTGCCC 
G 

Sequence 706 

CCGGGCAGGTACCCTCCAGGCCCTGTGTATTATTAAGATCTTTTAGTAGCGAGTTGCTCT 
TTCTCTGGGAAATCGGCTGTTAMGTCAGAGGGAGCTCTTAATAGTTTGCATGGTATTTG 
ATTAAATGGAACAGTTGGATCAGTACCT 
Sequence 707 

ATTGGAGCCTCCACCGCGGTGGCGGCCGCCCGGGCAGGACGCGCTCTCCTTCGAAGTCCG 

GAGGAGTTTCCTGGATTTGGCACTCTCGTGCAAAGCGGTCATATGCTGCAGAGTGTCTCC 

TCTGCAGAAGTCTGAGATAGTGGATGTGGTGAAGAAGCNGGGTGAAGGCCATCACCCTCG 

CCATCGGGAGACGGNGCCAACGATT 

Sequence 708 

AGGTACATCTCTAGC TGATG ATTCAAAAAAGAAACCTTTTAATCTCACTCCACTGATCAG 

CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATC 

TTCAGACTCCAGTG GCCG AGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 

GTTTCTGCCCACAGTTTTCAAATTAAGAACAATGTGTTTGCACTTAATGAMCAACCTC 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAG 

TACCTGCCCG 

Sequence 709 

AGGTACAGCATCGCTGGTGGTTTCAAAAAACGTAGTCATTCCTCTCACTGCAACAATGTA 
AGATAAGCAGAGTAGATCTGTTATTTCCAAATTAAAGGTGATTAAGATATATGGAGAGAG 
AACATGGCATGTGAGGTTTATAGGGCTAGAAACTGCAG AACCATGTAG AACCC ACATTTA 
ACTACAGTACCTGCCCGGGCGGCCGCTCGA 
Sequence 710 

AGGTACATTTGAGATGGTCTCACGTGAGACATCAATACGGCTTGCTGGGGGGCACAGGTT 

TAGGGCAGATGAAACTCACAGGAGGGCGGGTCTGGGTTAACTGAGCTAAAGAGCTTTTCA 

AGCCACTAGAGCAACANAGCTGCCCACAGTTGAGTCAGATTAACCTGGGAAGCCTCCAAG 

TGAATTGGNTACCAGCACCACATTCACAGATCTCAAAATTTATTGAAACTGATTGAGAGG 

NTGGATTTTGATAACTAAGA 

Sequence 71 1 

ACCCCCTATAGGGCGAATNNTGGNAAAACCGCCCCGGCGGCCGAGGNACCCAAACCTNTN 
C ATGG NCAAGAGAAAACCNTTTGTG AGGG AAANNTNTTAAAAN N ANGACATGAAAATGGA 
GACGGANATTAAGAGAAACAAAAAGACTCNNCNAGACCAGCATGGACAGNACCTGCCCGG 
GCGGCGT 
Sequence 712 

GGGCGAATTGGAGCTCTTCGCGGTGGCGGCCCGAGGACTCAGGATCTGCAGAGGCNAGGC 

TGCTTGGCTCCCATCACNGGAGG7TTGCTTNNCAACCCAGAAGGAGCTTTCAAACAAGCC 

TAGATATTCCTGCCTGGCCTGCTGCCATTCTCCMGAAGGCMGTAGGCACMTAMTTA 

GACTGGACGGTAACAGTTCACATGTTGCCCTTTMGAAGTGTTGATATTTTATGGCAAGA 

GGTTTGTTTGTACCTGCCC 

Sequence 713 

CGAGGTACTCAGGATCTGCAGAGGCCAGGCTGCTTGGCTCCCATCACAGGAGGTTTGCTA 

GGCAACCCAGAAGGAGCTTTCAAACAAGCCTAGATATTCCTGCCTGGCCTGCTGCCATTC 

TCCAAGAAGGCAAGTAGGCACAATAAATAAGAACTGGACGGTAACAGTTCACATGTTGCC 

CTTTAAGAAGTGTTGATATTTTATGGCAAGAGGTTTGTTTGTACCTGCCCG 

Sequence 714 

CCGAATTGGAGCTTTTCGCGNTGGCGNCCGCCGGCAGGTACTGTATGTATGTTGGTAAAA 

TGTTGTGAAATTTMTGAGGACTATTTNTTGCCATAAGACCTTCCGGACAGAATCTATCT 

CTGAAAAAGAATTCCCACTCCTAAAATAGAGGTCTGTGAGCAACAGATGTGGTAAATGAC 

CCTTGTTCAACTGAACCAAGAATATATTGTGATCTGTATGAGGCACAATTCAGAAGAGAG 

GGGAGGTGGCTACAGCACAGCCTCAGTGTTGAGTTTGGTTGTGGGGTTTAAACATATTTA 

TTTATTTCATAAAGTTTCTCTCATAAGGCACAATGCAAATGAGGTTTTAATTAATCAATC 

TAATGTTCAGAGGATGGTCTTCATGCTGTTTATAAAACAATTTACAI I 1 1 I ATTTTGTAG 
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TCTAAAGAACGTGTGTCGAAAAGCATGTCAGTGAGTAGCCATAACCTTCAGAGGGGATCT 
ATGCCTGGCTCTAAAACTAGNGGATCCCCCCGGGCTGNGANGAATTCGATATCAAAGCTT 
ATCGATCCCNCGCCTCGGNGGGGGGGNCCGGGNCCCANCTTTTGGTCCCTTTAAGG 
Sequence 715 

CGCCCGGGCAGGTACATACC CACA TACATACCCACACATAAACATATAAATAATATTTTT 

TAAAGAACCAAGGACAATAAATTTTCCTGATGAGCCAACATTTTAAGGAAAAAAATAAGA 

AAATTGTATCTTTGTGACCTGACATAAATACCCAAGGATAACTGCCATCCAGCCTTCTCA 

CGGTGTGACTGAGCACAGTGGGTCCTCCCTTAGCTGAATTCTAAGAGTGAAGAATAGTTT 

ATTGTTTGAAGCAGTGAGGAGAGCACTAGGGATATATTTCATAATACATGAAGAAAGGAG 

GGAAG ACTAATGCAAG TTTG TACC T 

Sequence 716 

AGGCGAATGGACTCCCGCGGTGGCGGCCGCCGGCAGGTACATACCCACATACATACCCAC 

ACATA AACA TATAAATAATATTTTTTAAAGAACCAAGGACAATAAATTTTCCTGATGAGC 

CAACATTTTAAGGAAAAAAATAAGAAAATTGTATCTTTGTGACCTGACATAAATCCCAAG 

ATAACTGCCATCCAGCCTTCTCACGGTGTGACTGAGCACAGTGGGTCCTCCCTTAGCTGA 

ATTCTAAGAGTGAAGAATNtTTTATTGTTTTGAAGCANGTGAGGAGAGCACTAGGGATAT 

ATTTCATAATACATGAAGAAAGGAGGG 

Sequence 717 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAATTTAATGTTTAATAATATTAAATAT 

CTTTAATATTAACGAAGTGATCTTTTAAGAGATATTGGTATTGCAAATGTTTACAAATAG 

TTTCTTAGTTGAATCAAGATTGGTATGAGTTGTAATTGTGGAGCTGGGTGATAGATCATG 

ATAATTCATCATATT GCTCA GTTTATATTTGAAATTGTATATTAAACTTGAAAAGACACT 

TTAATTTGAATTACTATTTTATCTTTTAAATGTGGTATTTCTCCTTMTT^ 

AAACACTATAATTTTATCACAGTACCT 

Sequence 718 

CCGGGCAGG TACA GAGACTCCATCTCAAAAAAAAAAGAAATGAAAGATAAAAAGATTATC 

ATTGATCCCATTTTACAGTTGAGAAAACAAAGACAGTTCAGTTCCTCACTGAAAGTCACA 

CAGATTGTNAGCAGTCACACTGGGAACATAAGCTGGGCACTCCTAGTTCATGATCCTAAC 

CACTGTGCATTAATGCGGCACAGACATGAAATACTGATGAGACTATTCTTGAAAAAAAAT 

CACTCAAATAATCAAGTGTGTGTTAATTGGATTTGACTTTTTTCAGTACTCCGTCATACA 

GCAAAGGTAAATCAATGGGTTGTT 

Sequence 719 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTTGGTTTCTCAAACAACTCACTGATTTAT 

GGTCTTGAGACCATAAACTCATTTTCCTTATATGAATGACATTTCCACATGCACAACANT 

ACCACCAAATATATGTATCTAGTTCTTACTAACTGCAAATCCTCAAAGTGAACTGCGTGC 

ATTTTAATGTTGCGTAGTTTGCTGATTTATGATTACCCTTAATGTACCT 

Sequence 720 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTTGGTTTCTCAGAC 

AACTCACTGATTTATGGTCTTGAGACCATAAACTCATTTTCCTTATATGAATGACATTTC 

CACATCCACAACAATACCACCAAATATATGTATCTAGTTCTTACTAACTGCAAATCCTCA 

AAGTGAACTGCGTGCATTTTAATGTTGCGTAGmGCTGATTTATGAmCCCTTAATGT 

ACCT 

Sequence 721 

C TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTCTGGATTCA 

CTTTTGATAATTATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnTCGGCAATCAGTGGTAGTGGCGGTGGCACACnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnGTTTTAAAATTAGNGGTATCCCCCGGGNNTGNAGGGAATTTNNATA 

TCAATNTTTATCCACCCCCCCCCCCCCCGGGG 

Sequence 722 

CCGCGGTGGCGGCCGAGGTACCCTTGGTTTCTCAGACAACTCACTGATTTATGGTCTTGA 
GACCATAAACTCATTTTCCTTATATGAATGACATTTCCACATCCACAACAATACCACCAA 
ATATATGTATCTAGTTCTTACTAACTGCAAATCCTCAAAGTGAACTGCGTGCATTTTAAT 
GTNGCGTAGTTTACTG ATTTATG ATTTCCCTTAATGTA 
Sequence 723 

TGTAGCTCNCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACTCTTGTTTAACCATCAGAGG 
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TGATCCCATCACCTTCACAGCCCCAGCCTCTGCTCCAGTCCCTCCCCAGCGAAAAGGGCC 

GCC CATG CCATCCTGCTTGCTTGGTGATTTGCTTTGTGGTCATGGACTCAGTTGGACATC 

ATTATTTTATTAACCGTGTGGTAGGTTT^^ 

GTGGTCCATCTTTAGGCTTCCGTTTGTTCTTTGGTACCTCGGCnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnNGAATTNTNATATCAAAGCTTATTCNATACCGTCNAACCTTTN 
ANGGGGGGGG 
Sequence 724 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAGTTAGAGC7TCAATCTCC 

AGTGTGATGGTATTAGGGTTAGATCTTCAATCTCCAGTGTGATGGTATCAGGGTTAGAGC 

TTCAGCCTCCAGTGTGATGGTATCAGGGTTAGAGCTTCAGCCTCCAGTGTGATGGTATCG 

GGG7TAGATCTTCAATCCCCAGTGGTGGTGGTTAGAGCTTCAATCTCCAGTGTGATGGTA 

TTGGGGTTAGAGCTTCAATCTCCAGTCTGATGGTGTTTCGGGATGGGGCTTTTAAAGATG 

TAATTAGGGTTTAAAGATCATAAGGGGACCTGGTCTGATGGGGATTAAGTAGGCTTATAT 

TGAAAGAAGACACAAGAGGG 

Sequence 725 

AGGTACI" rrTTTTTTTTTTI ICACGTGGTCCCAGCTTGAGTTTACTGAGCCTCCTCCAGG 

CTCAGATA TGTGC TTGCCTGGAGAAACTGNTTCCAAAGGCCAAGCAGCCCTTGCCTTTGG 

AGAGCTGCTTTTTGTAACTTACMTAGTGTTTCCCAGAGTGCATNCCATGAAAATATTAG 

TTCTTTCAACATGCTGAACAATGAAAGAATCCATGGTCAAATCCTGAGAACNTATTGTCC 

CTNTTTTAGAGATTCATAAGGTACCTGCCCG 

Sequence 726 

AGCTCCCCCGCGGTGGCGGCCCGAGGTACAGAACTCCAAAAGAAAATCAGGCCTCATTGC 

CAAAGCTCAGGGATAAGTCTAAACAGAAAGGCATTTATACAGCAACAAGAAAGTTACTGG 

GGGCTGGGGATGAGGGAGGGCTGGGGTAGGAGGATACTAAAATATTTTCTGAGGTGCCCA 

ACTGCTTGTCTTAGAAGAGGCTAAACTGAGCCAAGCGTCTCTGTTTGTCTCCTCCACCCC 

CTCCTCTACAGCTTTACAATGTTCTCTAGCAGAAGCAAAAACAGGGTCACTGCCATCATA 

GATAAAAGGATG 

Sequence 727 

AGGTACACGTTTTATATCAATATTTTATCAAGCAAAAAGTTGAAGCCAACACCCAAAGCT 
GCCAGATATGGAGAACCCAGTGACMTTTTGMTTMCCTGAGTGACCCTCTTTCTTTCA 
AA GTTGC TAAATTGTTTCACA GATTG CTTCTGTGGGATGTAAATAACAGAGTATATATGA 
CCll l l lAAAAAAAATTCTTTCTTTTCTGTGTTCTAAGCAACTGGAAACTAAAAACTGCC 
x CTGGCCTATCATAAGGGAGAAGATGGGAGTCTTTTGCTGTCCATCTTGAATGTAATTCAC 
ATTGTCCATGAGTGCTGATCTAACCCCATATGCCTCTGCCCGACTGCCTTTATGAAATAT 
GGTAATTTAATGCTTAMCMCTGTTTCATTTGTGTACCTGCCCGGGC 
Sequence 728 

CGAGGTACCAGATCAAMCCTGGGMCTTCGTATTTGTCCTTTTCTCTCTGCCAGGAATA 

TCGTCCTCTCCATTTGCCCAATGAGCCCCACCTNCTCAACATTTTTACCTNTGTGGAAAT 

CCCTTCCTTCCAGAAGCCTCCACTCGCTTTNTGCCAAGGAGCTTCTGCGGCGCCCTGCAC 

GCACCTTTACAGATGCAGGTGGCTGTTTCCTGTGTCAGACTGCAAGCTCCCCCCGCGTAC 

CTGCCCGGGCGGC 

Sequence 729 

AGGTACTCCACAAGCTTGCCTGCCATGGGCTGTCGGGATGTCCACGCAGCCACAGTCCTT 
TCCTTCCTGTGTGGAATCGCCTCAGTAGCAGGCCTCTTTGCAGGGACCCTGCTTCCCAAC 
TGGAGAAAATTACGACTGACCATTCAACAGAAACGAGAAGAACCTGACTGTTTACACAGG 
CCTGTGGGTGAAATGTGCCCGGTATGACGGGAGCAGTGACTGCCTGATGTACCTGCCCG 
Sequence 730 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATCCAACTGTTCC 

ATTTAATCAAATACCATGCAAACTATTAAGAGCTCCCTCTGACTTTAACAGCCGATTTCC 

CAGAGAAAGGGCAACTCGCTACTAAAAGATCTTAATAATACACAGGGCCTGGAGAGTACC 

TGCCCG 

Sequence 731 

ATTGGAGCTCCCCGCGGTGGCGGCCGAAACTTTATAATCTTTTAACTAAATGTAATTGTC 
ACCATAATCTTATAGACAAAGCATTGAGGTTTATTGAGCTAATGCTGAAGGTAGTAAGTG 
GAGGAGCCAGGATGAGGTCAGAATCTGAGATTTTAACCATGCCTATGCTGTCACTTCTTA 
CACTTTAGAATACCTCCATGCTCATGTGGACACCTAGGAACAAATGAATATTTCTATTCT 
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TCTCCCAGAATTTTAAAACATTAAACATGTTAAACTGTATTTTTGTTTACCATAAAGCCT 
TCCNAGGAGGAACAAGCACTAMCACAGTCTCTGGCTTAAGGATTTTGGATGAACATATT 
TCAAAAGCCATCTGCTTCNCAGCAATTCATAATCCATACCCCTTTTCCTTTTTGGCCACT 
TATTCAC CAAG AATCTN CANTAGTTCCCTCGGGCCGCTNTAN AAACTNANTGGGATCCCC 
CGG 

Sequence 732 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGA 

TCCCT CTTGT TGAGTAGCTGCAGAGACGGCAGATGGAAAATATGCTCAGAAGTTGTTTAA 

TGACCTTTTTGAAGATTATTCCAATGCTCTTCGTCCAGTGGAAGATACAGATAAAGTCCT 

GAATGTGACCCTGCAGATTACGCTCTCTCAGATTAAGGATATGGATGAAAGAAACCAAAT 

TCTGACTGCTTATTTGTGGATCCGCCAAATCTGGCACGATGCCTATCTCACGTGGGACCG 

AGATCAGTACCT 

Sequence 733 

AGGTACCCAAAGAAGTAAACTTAGCAACCTCATAATTCTAGGCACCTCTCTCAGGTAATC 

TCAGCTGGCTTCTCTAACACCTCTGGTTCCTCCCCCTACCAGCATGAGGACATGAACAGC 

CATGGATGGTGTTCTGCAAATGTTATCTCAGATAGTCTCTGGACAGGACCTGCGAGCAGA 

CTTGCTCAGGGTCTAGAAAGTCTTCCCTGGACAAATTCCAGATCTCTAGCTAGCTAAGAC 

AACTGCAGCCATTTTCCTGTACCTGCCCG 

Sequence 734 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATACA 

ACGATGCTATAAATATTTGTATGCATAATTCAACAGTMTCAGTGGTGTTTATCTAAACT 

AACTGATAATCTACAGATTGCAGTGCATTTATGATTTCAATGGAATTAATCTAATTCTCC 

ACACTTAATTGTGAGAATAGCTATAAACAGATTGTCAAGAGGAGCCTTTTAGTGCCAATG 

CTTTAC7TGAGGAAAAAMTTTCTTTTGGGCAAACCCATCTTTATTCATTGCAGAATACA 

ACGATTCTCAAAAGTAGCTTAACAACCCCAACTCCGCTGGGTAAGTGTGGCGGCACACGC 

CTGTAATCCCAGCTACTTGGGAGGCTGAAGAGGGAAGGCTGCTTGGGGCCAGGGAGTTTG 

AAACCAGCCTAGACAACATAACAAGAGTCTGTCGCGAAAAA 

Sequence 735 

CTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTAACCTG 

GGCGACACGAGTGGGACCCTGTCTCAAAAAAAATAAATAMTAAAAGAAAGAAMGAAAT 

GATC TCTC CTCATAGTTTGTGTGTAGGAAAAGAAGTGATAAAACTGACCTAATTGAACTT 

CACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAG 

TAGTACCT 

Sequence 736 

G CGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACCATCCTGTCATAAT 
TCI II I I M IGGCCAGGGGGAGACAAGGGTCTCACTCTCTTGCCCAGCATAAAGTCCTTT 
TTAAAACTGTAAAATAGTTTATACATTTGAGCATTATTATTATAAGCTTTTGTTT 
CTCAGAAGAATATATTTTCAAATGATAGACTTCTGGGACTTTTGGTACCT 
Sequence 737 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATN N NTCAAACATACNAC 

NTAATG NGGGCATGANC ACATACTTTGCTGATANC NGACNAATGACTGAAGATGANTGGT 

GNAACCANGGCATNAAAATGAATCTATTCAATGACATGACAACAGNTCTCATTTCAAAAA 

TATTGCCNTACCTTTCCATAGCCCCCTTTACCAAGTNCCTGCCNGnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTG 

GCGTAATCATGGTCATTAAGNCTGTTTCCTGTGGTGAAAATTGTTATCCCGCTCACAATT 

TCCACACAACATACGGANCCGGGGAGCATTAAAGTGTAAAGCCTGGGGGT 

Sequence 738 

CCGCGGTGGCGGCCCGAGGGTACCACCCATGTGGAGGAGACTGCAAGGAAGCTGTTATTC 

AMGTAGAACAGTCAGCCTTGTGCTTGAGTCCTGCTTTATGCTTGCGTGTTTCATAACAA 

AACACAAAGGCAAGTCTTCATATCAGCACTTAGTCTTGATTCAAGTAGCTGACTACTGTA 

CCTGCCCG 

Sequence 739 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTTGGAAAGAATT 
GACCCAGCTGAATTGGAAAATGTGGGAAGGGGATGGGGAAGAGGCTGCTCCACCTGAGAT 
CTGGCTCCAGGACTTACAGCAAGGGGAACTTGGGCAAGTTACAGACTGCTATGCCTCAGT 
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TTCTTTTAGCAAAACAGAATCATCCCATAAACTATAAGGTCGATGGTATCAGCGGGTCCC 

CAAACTGACTGCACATCTGAGTCATGTTAACAAACACATTCCAGGCCCCACCTGAGCCCT 

CTGAATCAGAATCCCTGTAAGGAGGACGATGAACTTGAATTTGCACTGACTTTCCCAGCT 

GTTTC7TACTCTGATCAACTTGGGGATAGGGACCCATTTGAGCTGCATCACATCATTCCA 

AAGCCAAAACACAACAGCAGGACAAGAATATTTTCAAGGCAGTCTCTAAAGCAGAGGAGA 

AACTGTTGAGGGAACCTAGAAAGTAAMGGAGATCTGGCTTGCTGGGCTCCATTTGAAC 

Sequence 740 

TATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTAC 1 11 1 1 1 1 1 1 1 11 1 1 11 I 
MM tNGGGACCAAGTTTGGCTTACNTGAGAAGTGCAAGGTTGGTTCAAAATACCTACTG 
TTTTCACCACCCTAACAGATTAAAAGAAAAAAGCCATATAATTACCTCAAAAGATGCACA 
GATCACTCTGGGG TTGGGG GAAGGGGAGGACTGGGAACAGAGAAGGTGCATTTCAATGAA 
AACAAAGACTTGGG I Mil I AAATTTGCCTTTATTTTTA I i 1 II i 1 ATTTTAGAGACAGG 
GTCTTATTAGGATGCCCAGGCTGGGGCTTTCTTTTCTTGCAAGTCA 1 11 M 1 11 1 CAAAC 
AATTCTATATGATAATCAACAGGGATGAACTAATGAANGGCNTNTTAAAAGAGGNGGTCC 
AGGTAAAAATGATTCAAGNGAAATNCCTAAATCTGGGGNGNCATGGTTTACCGGGGACCA 
GAAAMCAAAGGGGGGGTCCTTTTACTGNTGGTAAAAAGTTANTCTNTACCTGGTGGATA 
AA AATTT AAAANANCAAAGCCCCCAAAAAGGGGAAATCCCTACCAATGCCAATNCCNTTG 
NCTTTTNAGGGAATNNGNCTTNAAC Tn 11 11 1 M 
Sequence 741 

CCCGCCGGCAGGTACGCCCTAGCTCCAGCTTCCNTTGGGAGACTGTGCATCTCCTGGCTN 

NACTAACATNACCTTCTTCTGACCTTCCAGCCTAGAGATGATGACTCTGCCAGCCTAGAT 

GGGCTCTGGGTTGCTCCTATTCCTGTTTGTTTGAGATTTCCCATTATGCTGTCACCAACT 

CCGCAGCCTAAGCCCTCTCTATTTTAMTTCTCAAGTGGATTATGTTCCTGATTAGTCCC 

TGACTGATATACCACTCTCCTCATGATCTCTGATTAGTTTTCCTGTTAGGTTGTTGCAGT 

AAAAAAAAAAAAAAAAAAAAANNNGGTCCCTCGGCCGCTCTAGAACTAAGTGGATCCCCC 

NGGG 

Sequence 742 

AATANCCCGAATTGGAGCTTTNCGCGGTGGCGGCCGAGGTACCTTCATGCTCTAAATCAT 

NATGATCTACTATCTGAAAAGGAAACGACAAATTTTTNAACAAAAGAATTTCACAACAGA 

TAGGCAGTTGATAGCATGAGGCACTMCATTAAACCCAAGTCTTFCAATGGCACTTGGAG 

TCCCAGGGCCTGCCCG 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTGTATCTCCC 

ACTAGGATGTCAACTCCGTGAMGTAGGAACTTACTGGTCTTGTTCATGGCCCTATTCCC 

AGCTCCTAGAATACAGCCTGGCATGTTGTAGGTGCTCAATAAAATCCTTGTTGAATGACT 

GACTGAATGAMCCTGGCCAAGGTCTAAGCATCACTTCAATGGGATTCACCTGGAAATTG 

GCACCCCCCTCCCCCGCGTACCT 

Sequence 744 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTATCTCCCACTAGGATGTCAACTCCGTGAA 

AGTAGGAACTTACTGGTCTTGTTCATGGCCCTATTCCCAGCTCCTAGAATACAGCCTGGC 

ATGTTGTAGGTGCTCAATAAAATCCTTGTTGAATGACTGCTGAATGGAAAACCTGGCCAA 

GGTCTAAGCATCACTTCAATGGATTCACCTGGAAATTGGCACCCCCCTCCCCCGCGTACC 

T 

Sequence 745 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACCTGATTACCTGAAA 

CAGCAGCNNAGNCATGCAGAGGAGGATCAGCAGCAAGAAGGGGTGAGCCATGAGTTCCTG 

AAGCCAGGAGGGCTCCATTCTTCTCAGAGGTCCTGAGCTCTGGAAAGCCTGAGAGAACTC 

CCCGCGTCCT 

Sequence 746 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTCCCACACTCTTCCTGACAA 

AGCAGNTGAGGGACTTTCTCTGCCAAAATGAGAGAGTAAACTTAAAAAGAAGACAAGGTG 

GGATATTGCAMCAAGAGATACAACACAGGAGGGAGCCGAAGAGAATCCCTTGATTTGAG 

GTGAAGCAGCTGTACCTGCCCG 

Sequence 747 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTAGGCTCTCCAGGGAATGC 
CATCAGTAAAGGCAGCCTGTTCTGATGTCATGACTATAGGGAATGTGATGCCTTATTTGA 
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ATTAGGCGCTGTTTGTTCCACTCTTTATC { 111! [ CCTCCAGAACTGAGGCTAGTTGATC 
TTCCCTTGAAAGTTCAGTCCCCCTGAGTAGCTAAGCCCAATCCTGTCTGTACCTGCCCG 
Sequence 748 

NATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCGACAGAGTGGGACCC 
TGTCTCAAAAAAAATAMTAAATAAAAGAAAGAAAAGAAATGATCTCTCCTCATAGTTTG 
TGTGTAGGAAAAGAAGTGATAAAACTGACCTAATTGAACTTCACCTTTTGCTCAGTGACA 
GATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAGTAGTACCTGCCCG 
Sequence 749 

ANGGCGAATTGGAGCTTNCCGCGGTGGCGGCNCGCCCGGGCAGGTACAGTAGTCAGCTAC 

TTGGATATCAAGACTMGTGCTGATATGAATACTTGCCTTTGTGTTTTGTTATGAAACAC 

GCMGCATAAAGCAGGACTCAAGCACAAGGCTGACTGTTCTACTTTGAATAACAGTTCCT 

TGCAGTCTCCTCCACATGGGTGGTACCT 

Sequence 750 

GCTNCCCGCGGTGGCGGCCGCCCGGGCNGGTACCCACCGTTCATCTGTAACATGGCTGTA 

ATAATGGCACCTACTCGTTGCTGTTAGCATTGAAGGAGCCAACGTATGCAAGCCAATCAC 

TTAACAAATGTCCATTGTTCTGCTTGTCATTTCTTTGCTTATACTTCCTGACAACAAGCA 

GAAACCTTAGAAGAGTGGTCTGTTCACTTTAGGGAAGGCTAAGTCAGCATTAGGTGTCTT 

CCATCTTAAAGCACACAANACAGACTGG'GTAGAGCAGCANCGTCACTGATCGCCACTCTA 

ATAGCTCTCTGGCTAAGTTTCTCA 

Sequence 751 

GCGGTGGCTTTCGCCCGGGCAGGTACTCGGGCCTAGAAATTATTTAANNTGGCGACTGAT 

ACGTCTCATGGTGAACTCGTTTNTNCTAAGGCACTCCCACTTATAGTAGGAGCTCAGCTG 

ATCCACGCGGACAAGTTAGGTGAGGAGGGGCTGACATATTCTTGANTCACTCTTGGATCA 

ATCCCTGACTTCAGGCCCTGCTGGGTCCTTCTTGGTACCT 

Sequence 752 

CCGCGGTGGCGGGCCGAGGTACAATATAGGCAGACAGTTTGCCTTCAGAAATTCAGAAAT 

GCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACTAGAGTTGAAGATGATTCAG 

CCACTTTTATTCCAGCCACTCCATTTCTAGCATACACTAAGAGAAGTGTATATTTAAAGA 

GGCCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCCCCCGCGTACCTGCCCGGGCG 

GCCGCTCG 

Sequence 753 

GGCGAATTGGAGNTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAATACTTAAGAAA 

TACTTTTCACAAGTGNCAATAGATTTATGTGTTATGGACTCATTGCAGANGGCTGAGGAG 

CAGTAAAGGAGGGAAGGTTCAAGAAAGCCTTCAAAAAGGAAATAACACTTTACCAGAGCC 

CTGCTCACCAGTAAGAGCAAAAATTGTGCAACTTTGATATTTAAGAAATGGAGGGTTACT 

GCTAGTCGTAGNGGTGGGTGAAGTAATTCTATATGACTAAAGCATCAAACTCTAATAAGA 

G C ATTATAAAAAG AGG CTAG AAAATTG G AC AG G C AAC ATTC AACAT ATCTTATAATT AAA 

GATTTGGTAGTNTTATGTTAGAGGTGATAGACACATAGGAGAGTTTTTTAAATG TGGGAA 

GTAAAATTGGTCAAGACTTGAC7TTTTAGMCAGTAATAAAGGAGTGCCAGNTATTTTTT 

GGGAATTCATTTATGGTGACCCTTTTAA 

Sequence 754 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCACAGGGAAGAATACCTCAGTTATTCACCTT 

TTGTTTAAATGTTTTGGAATAACACAGACACAGCMTTATGMGTTTTTCTCTGCTCTAC 

TTAGATGAACAGATTCTGTTAACTAAAATGAAGGCAAATGAATGAAGAGAGTTCTATGTT 

ATATATCTGAGAGCAGGGTCTCTATTTTTGATACACTGACCATTGTTT CTCTGT AAAAAA 

TACAATTAATACTTCATCCTCTCCGTAACTCAGTTTTAGCTGMTACTATTTTTTAGTCC 

ATTGTATATGGCAGACTTTATGTTAGGCACGAGGGACATAAAGATTCATTCATTCCTTCA 

CGCCAGTAAAATANTTGAGTTGGGAATCTTCC TATG TTCCGAGGCACTGAGTTCAAGTGC 

TGAAGGTTTAACAATGAGCMCACAGACACAGTTTTGCCTTNCTGCAGGGTACCTCGGCC 

CGCTCTAGAACTAGTGGGATC 

Sequence 755 

CGAGGTACTGTAATTTTGGGGAGCAAGCTAACACATTTGACTTGCGGCTGAGCTCTTAAC 
TAAGCAATACCTCAGTATGCTCCTTCGGGAAAAATTAAAGGTTCAGTA GTCAA ATACTTT 
TGGAAATGCTGGGCCATTATGCACAGAGAAGGCCGCAGTMGGAACATTTTAAATTTGAA 
CAGAGAACATCCAAATCTMTTCATCTTAGAATCCATTTGCTATGGAATGTACCTGCCCG 
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Sequence 756 

GTAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAGCAATGACGTGGGGAGAGAGCGC 

NGAGGAAGGGAGAGATCCAGAAAGGTAGATTCTTCTGTGTAGGTGAGTGGATGAGATAAT 

GAAGCAAGTTAATTTGGGCTCTGCTGACTTATTTATCTGTCCCGGGCACTTTTNTCTGTA 

CCTGCCCG 

Sequence 757 

CGAGGTACTGCGGGCNCTCTTGATTCCAGTCTACTGGACTGCTTGTTGCTTTGGGTATTT 

GAGTATTTATGTGTTTCTTTTGGCTCCTGTTTGCATGGTCACTGGCTGTCATAGGGAATG 

GTGATCGGCTTTTCCAGTTGCTCAGGNCACAAAAGAGGTTGGCTGGTGTCATGGACAGCA ' 

CATGGGATTTGGAGTTAGAAGGTCTCAAGTTAAAAACCCTCAGGCCCCTGCATTCTAGCT 

GGGGGACCCTCGGGCAGGTTGCTTGAACCTCTCACCTTGACTGGTTTGTTGAACTTGCAG 

AGTGCTAGCAGCACCTGCCCCCAGCGTGGGGGAGTGGTTTTGCAGATTAAAAGTGNTGTG 

AATGCACTTCCTCGATGGAGGGCCTCTANCACCATGGTTTCGTCATCCTGTATCTGTCAC 

TCACCCTGTANGTNAAGGGGTTTTTCACATTTGNNGCATCAAAAAAGACCCCTA 

Sequence 758 

GCGGCCGAGGTACATGGTGGCTACCACATCCTGNTGTTTGTCTATACTGGGAGAATCAGG 

CNTTCCNAGTCATCTTGGCTGCCCTAGCTCTGAGCTTCAGGGGATATATGACTCAGAAAT 

GCTATGCTTTCTGGAATTTGGATATTTCA ! I I I I ATTGTTCTTGGTAAATTCTCTTTTGA 

CTTAGGAGAAGCTMCTATTTGGAAAGGTCTCTCAGAACTCTAATTACAAATATATGGTA 

CCTGCCCGGGCGGCCGCTCG 

Sequence 759 

CGN CCGAGG NACTGGNTTGACANTGTGTTTAAAGTCAAAAGATTAGGCTTGAG ATCTCTT 

TC TAGTGNGATGGTTTTACAAGTATATACCGTATGTTAATG NTTAAAAATTTCAC ACCCC 

AAAMTGTGCAGNATACCAGATGTTAATTATATCTCATAAAGCTATTAAAATTTTATCTC 

AAAATTATAGCTTTATTGCATTTAGGGCATTATCCMTTTTGAATCTAGTCCAGTTATCA 

TAGCTTAA TGCA GTATTATGAAAATAATGCCTATAAAGGGCCAGTTCCTCAAACACCCTT 

GGAACCAATTTTGCCATCTATATTAGNNACCTTGGGCTGCNATAATGAAGTACCCTCGGC 

CCGCTCTAGAACTAGNNGGGATCCCCCCGGGCTGCAAGGGAATTCCNATATCAAGCTTAA 

TCGATACCCGNCGACCCTCGAGGGGGG 

Sequence 760 

CGCCCGGGCAGGTACTTGTAGCAGTCCACAAAGAGAACAGCCAGAACATTCTCTATGCCA 

CTGCCTGCTTCTGGGTGAATCCCAGGTGTGAGTAATGAGGCTCACATGAGTGGGTATATC 

ACTAACACGGCTTAGGAGCCCCATCTCNAGTCATTATTTTGCTTGACAACCATGAGCTTC 

CAGGATCCCGACAAGGCACATGGCAAATTACAGGAGTCCTCACACTAAGAGAAGCCCTAA 

G CTATGG CCAGCACAGGTATTTATAGTTCCTCCCAGTCCTTTCGCAGTTTACCAGTGGGT 

CATTTTTACCATAAGCAGTGTTGCCTAGTAACATAGCTGACATTCTGCCTGTATGGTTTC 

CACGGGAACAGAGCTGATAGCTGGGTTAAACTGAATCCAAGCCCAA 

Sequence 761 

TGGCGGCCCGCCCGGGCAGGTACAC I ill! I GGCTTATGGGTATCTTAGTTTAACCTTTT 

CTNTTTGAGNGAACTGTGTCATTTCAAAAGCCTGAAGACATTGTGATGACTGCTGCCTCC 

ATAATGGCTACATrCTAGGGGCTTTGCCCTGAATCGAATATTAACTCAAAAAGCAAACAG 

TACCT 

Sequence 762 

CTCCTATTTGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGCCACACCAAAA 

TCTGGGAGAAGACTATTTCAGGAAAGGAGGAGGAAATTTGAAGACCTGGGGCTAGAATGA 

GCTCAACATATTCTAAGATCGGCAAGTTTAGTGTGGCTGGAAAAGAATGAGTAAAACAGA 

GAAACCTGANATAAGATGAGGTCACGGAGAAATGCATGGGTCAGACCACGTATAACCTTG 

CAGGTCATGGTAAAGGCATCTGGACTTTATACTGAGTACCTGCCCG 

Sequence 763 

CTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC IMIHIIIIl 

TTTTTTGGGTTTATT TGCTTA GGATAATGGCCTCCAGCTGCATCTATGTTGCTGGAAAGG 

ACATGATTTrGTTCCTTTTTATGGCCTTGTAATGGTTCATGGTGTATATGCACCACATTT 

TCTTCATCCATTCCAACGTTGATGGACATCTAGGTTGATTCCACGTCTTCACTATTGTGA 

ATAGTGTTGCANTGAACATATGAGTGCATGTGTCTTTTCAATAGAACTTTC! Ml! VCNT 

TTTGGGATATACCCCCCANTAAATAANAATTGCTGGGGTCNAAATGGGTATTATCATTCC 

TCTTTTTAGTAAACCAACTTATAAAAAGATTTTAAATCAAAG 
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Sequence 764 

TTAGGAAT TTNTAT ATCAAAGCCTTATCGAAACCCGCCNACCTCNAAGGGGGGGGCCCGG 

GANCCAAC I I 1 I IGGTCCCCCTTAANGGAGGGGGTAAAATGCGCCCCCTGGGCGGAAAAC 

AAANGGCAANAAACNGGGNTCCCCCGGGGGANAAAAAGGGNATACCCCGCACANATTCCC 

CACCAAAAANTANCCCGGCCCCGGGGAGCAANAAAAGGGGAAAAAAGCCCNGGGGGGGGC 

CCTAAATGAAGTGAAGCCTAACTTCACAATTAATTGCGGTTTGCGGCTTCACTNGCCCCG 

CTTTCCAAGTCCGGGGAAAACCCTGTCGNGCCAAGCTGNATTAATGAAATCGGGCCCAA 

Sequence 765 

CCGGCAGGTACATTCAAATTTTTGTTATCCTCTTCAAGAAAGTNACCTNGGTAGANTNCA 

TATATATCCCAACATNNTAGGAACTCATAAAATGCATACTTTTTTGGTGTGATCAGAGAT 

GGGTMCTCCTTATTACTTGTGGGTAGATAGTTTTGGAAATAGCCTGGAGTCATTGAGAG 

CCAATCCTTATCAACAAAATGGCATAMTGGACAGGAATGGGTTCTTCTTCACTTTAGAG 

ATAGCTCTGAAGACATGTCTCAANTGGGATNAGTTCAGNTGCATATATGAATAGCAGTC 

Sequence 766 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGQCCGCCCGGGCAGGTACGCGGGGGGATTCA 

GAGTGAGAAGGCATAAAGGAGAATCCCCAGCTGACTTGTGCAGTGGTTAATTGAAATTAT 

TCAGGCAAGAGATGATGGTGTCTTGGACCAGGGGATGAGGAAGGCTACAAAATGTGTCTA 

CCTNATTCTGTGAGGAGAACGTGTTCCCTGGT7TTAGATACTGTGAAGATGGATCAGGAG 

AGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGCGATTCCTTCAGCTACACAGGATTGAA 

AGG AG ACATTTCTG AAGGGG AAAAAGGAANTGAAAAAAAAAAATN NNANNNNNN NAN AAA 

GTNCC 

Sequence 767 

TCGCCACCCATNTGTGAGGCACAAACTGTNGNNGNCTNCATTTCACAAATGCAGAAACTT 

AGGACCATGATAGATGAATGATTTGTC 

Sequence 768 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGGACCGCCAGG 

GGCCTCNAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGT 

GCAAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCCGAGACTAATTT 

TTCCATTT 

Sequence 769 

TTAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTAATGGCAAAAGAGA 

ATCTTAATCCCTMGAGCTTTTTCTAATTGATGGACATCATTTTCCAATTGAACAMTTG 

GAATTTATTAGTGGTGACTGTGGCTTTGMTCTGAGCTAGTTTATTCTTGCAGTCACAGA 

GAGTGTCCTTATAGAAAATAAAAAGGGAAAAGCTCAGTTCTGCTGAATATAATCAATATA 

CACMTCAATTGATAATTCACAACTACCTCCTTTGCCTTCTTGCACACTCTTCCTGCCAC 

TCAGAAACATC CATAC TTACTCTTCAGGATGGGTTTTTTGAGCCTGAATTATCCAGATGG 

GCTGTTTCAACTTTTTCAGCCATACCTTAAAAGTTGAAAAGGGATGGCCCTAATTTCCAC 

CTCTAATTCCCCTTCCAATTTCTGGGTTTGAAGCAAGACTNGAGAACTTGGG 

Sequence 770 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGATTTCCTGA 
CATTGCTATTTATATGATGCCACTTAACAGGTGCTTGGAGGAAACTCCTGACTCATTGCT 
TACCTCTCAGGGCTATGTAAAAACCACACAGAATTTATTAAACACATCACAAGTGGTACC- 
T 

Sequence 771 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCTGCGCCAATCAT 

CACAGCCTGGTTGTATCAAGTGTGTGCCAAACTGCCTGCGCTAACAACCAAATGCAAATG 

TCATACGTCTCTGTCACTAGCAGCATAGTTGCTAGAGAGACACGTGTGAGCCACACCTCA 

AAGACAAGCCTTCATCATCTTTTCCTGGAGATAGCTGACTAGCAATCAGATGTTCAAATA 

AAGCTACTTTCTTGTACCTGCCCG 

Sequence 772 

CCGGGCAGGTACTGNTTGTGATCCAGCTACTGAGGTCTTGAACCGAACAGACCTGGGAAG 

NGTCTCANAGCTGTCTCTCAGGCTCTGAGAGAAAGTAGCCAACACTACAGTAGACAGAGA 

GAGACAGAGATCAAGACAGGAGACAGAGACAGCATAAGACAAAAATACTTTATGAAATTG 

CCAAGTCCTCAAGACAATTCTAGNTMCNTCTTTTCAGCNAAANTGGNTGGAACCCAGGG 

CTTTTAACCTCCAAAGGTNNCACAAANNAAATNAAAACTAAATNANNAANAATNGCTTCN 

CTNNGGGCGCGNCTTCTTATGTAAACNTATNGNGGGGNTATACCCCNCCCGTGNGCCNTG 
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GCCATGGGGAAATTTTCCGCMTTTTCCANAGGCCTTTTATTANTGNATNATCCCCGNTG 

CANAAACANCTANTNNAGGGGGGNGG 

Sequence 773 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCAGTCACTTAATGGCCCCATT. 
ACCGCTCCTCTGTCTCCCGTGCTGCTAGCTCCTGCTTCTAACTCTCATCCCTAATCTTAG 
TGTCTATGCCCTGTTCTTTGATGTTTAATCAATTGCTTTTGTGA I I 11 H H I 1 1 1 II 1G 
CTATGCTAAATGCCATATTTAAGTTTCTCAAGA 1 I I I I I I I I A ACCTCAGGAGCAGAAAG 
GTG MTTAAT CAGACTCTTTAGCCTGTGGCAATTTTGTTGTATAAGGTCCACCTGTCTTG 
TCTTATTTTTCCCTTTATGCCAGATTCCACCACCCATCAGTCCCTGGCCTATCCCAACAA 
GGCACCCTACACTTAA 
Sequence 774 

C TNCTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCAATTAA 

TATTTTATTGAGGTGCCCTGTMTCTGACATTCCAGGGAGCTTTCAAGGAAACTTTGTCA 

CACTAAAAAAAATTGAGGATTGTTTTGTGGGTTATTCCGCTCCTTTGTCTTCTAACCCAC 

AGAAGTTCTGTCATGGTTCTTAGGGGTTGTAGGGTGTGTCCACTTATTTTAAAATGTCAA 

AGACCAGAAGAGTGGAGAATGGGAAAGTGGCAAAGTGTTAGCACTGGAAAGATGTTTAGA 

GATCACCGTAGTCCAACCCTTAAATTTTATAGATGAATAAACCTGGGTATCTAAGAAAAG 

CTTAAGTGACCCTGCCCTGGAAATCACACCAACTAGGT 

Sequence 775 

ACAC TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCT 
GAGATTTTGGAGTCTGGTGGGTTCCAGTCTTGCTTTCCACCAATAATTAGCTTCACTTGA 
CTTCTTTGGGTTTCCGTTTCCTCATTTGTATGAGAAAATAGGGATCATGAGAGTACCT 
Sequence 776 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGGCACTGAGC 

CAATG TATG CTATCAA GGAAA GCTTTATCTGTCACTGAGCAAAAGGTGAAGTTCAATTAG 

G TCAGT TTTATCAC TTCTTT TCCtACACACAAACTATGAGGAGAGATCATTrCTTTTCTT 

TCTTTTATTTATTTA I MUM IGAGACAGGGTCCCACTCCGTCGCCCAGGTTAGAGTAC 

CTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnGAGGGTTAATTGCGCGCTTNGGCGTAATCATGGTCATAAGCTGTTTCCTGNGG 
Sequence 777 

CCGCGGTGGCGGCCGCCCGGGCAGGTCTCAAACCCCAAGACTGAAGTGATCCTCCTACCT 

CAGCCTCCTGGGTAGCTAGCACCATGGCATGCGCCACCATGCCCTA I 1IIHIIH 1 AAA 

GACCCTCACCTCTTAAAAAAAAAAAAGTAGATTAAAAAATACCACAATTGCTCAGGTAGA 

TTGAAAAACAGGCATATAGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnTAGTGAGGGTTTAATTTGCCGCCGCTTGGGCGTTAATCATGGGTC 

Sequence 778 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCATAGATATCC 

CTGGGGGATGTAGTGGATTCAAAACATGAGTTGGATTCTCCAACGTGCCATCGGAATTCT 

TCTAATATTCAGCAGCTTTGATTGTTGTTTTACATCTGACCTGTATTTAGATTTCTCTGA 

TTCCTGGAATATTTGTGGAAAATAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCCGCTTGGCNGTAATC 

ATGGTCATAAGCTGTTTCC 

Sequence 779 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTATTCCAGT 

TTTGCAGAtGAAGAAGCTGGGATTTAGTGAAGTTAAATAACTAGTAAATAGAGAGGGCTA 

TGTTCCATCCTMGTTGCTTTTGATTCCAAGACTGCTATGGTATTTTGAGACGGCTGATT 

TGGTACCTGCCCG 

Sequence 780 

CTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTGTAGATG 
GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 
TTGTAGGACCCTATACNGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 
AAATGGTAGCTGAGACCAATGATGCCGACCTCCCTCAAAAGCTGCATTTCTGAATTTCTG 
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AAGGCAAACTGTCTGCCTATATTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnn nnnnnn nnnnnnnnnnnnnnnnCNTCGACCTCGAGGGGGGGG 
CNCCGGTACCCAGCTTTTTGTTTCCCTTTAGTGGAGGGGTTAAATTGCGCC 
Sequence 781 

CTTANGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGGTGAGAA 

GGATGTTCATCAGGAAGGGCATCATATTGTGTAAAGGGCACAGGATTGAGAGGAAAATTC 

ATMGGCAATGMTGTATATTCACCTAGCCAGCCCAGTCATACCGTCAGAGACATTTTTA 

ATTCCAA TATG TTTGGTTACGTTTCTTAAAATTCCAAC C TATGCTCCTTATATGATACAT 

TCACCTCTTTTGTAAGCATAATCTCTTTACCATTACCAATTAATTGCAGCCCATCCTATT 

AGCTGTAGAAGAAAGATGTGGCAAATTTGGGGAAGTAAAGAAAAAAGGGGATCAAGGAAT 

AGACATAAAAGATTTTGTGATCACCTGGCGTATATCTACCCAGNTACCTTTGGGTGGTTA 

CTTCCTTTGGAAAG 

Sequence 782 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAAAAGATCCC 

AAACCTGAATCTAATAATGAAGAAATACGATACAAATGAAAACTGAGAAACATTCTACAA 

TGTAACTGAACTGCAATATTCAAAAGTATCAAGGTCATGAAAATCAGGGGAAGACTGAAG 

AACTGTTCTAAAATGAAGAGGATTAAAGAAACATAACTAAGTATATAACCTATAATTCTG 

ATTAGATCCATTCCCTACACAGGGCATTGAAACAAAAGCCAAAATGTGTACCTGCCCGGG 

CGGCCGCTCG 

Sequence 783 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAG 

GCAAAAAAGCAATCAGTATGACTTGAACTGCTGGGTTTAATTACTCAATATAACACTTGC 

CATTTAAAAATCCATATGCCCATCAGCATGGCAACAGTCTCTCATAAAGATTCCGGTATC 

ATATGGCACAATTTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnn nnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGTACCCAGCTTTT 

TGTTCCCTTTTAGTGGAGGGGTTTAATTGGCGCCGCTTTGGCCGTAATCATGGGTCTTAA 

GNCTGTTTCCTGTGGTGAAAAATTGGTTATCCGCTCACAAATTCCACACCAACATACCGA 

GCCCGGGGAGCATTAAAAGTGGTAAAAGCCTT 

Sequence 784 

CCGCGGTGGCGGCCGAGGTACGCGGGATGTGACAACCCTGCTGAAGTGCCACCTACTATA 

TTTGGTCTCAGCGATTAAACGAAAAGAGATGGTAAGGCAACAAAAATTATCAGCATATAT 

TTTGAGCTTCT1TGAGTTTTGCAGATTAGTATAGTTCAAAGGATAGACAAATTCACTTGT 

TTTCATTTTGTCTTTAAGATAAATAAATTTTGGTACCTGCCCG 

Sequence 785 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTTTCACTCTCATCAAACTAGACTACCGACTG 
TTTCTCGAAAAACl I I I ICCACCTCTTCACCTTTGCTTATGCTTATCTTTCTGTCTGGGA 
AGGTTCCTTCCACCATTCATCTCTCCATTTAAAAATAl Nil IGAGCACI I I I lAATTTT 
GAAAAATTTCAGTGCAGAGAAGTTGAAAGAGTAGTACCTnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGGTTAATTGCGCCGCTTGGCCGTAA 
TCATGGGTCATTAG 
Sequence 786 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGAT 

TATAAGCATATAAGCCACTGCGCCTGGCCCTATATACCTTCTTTATTCCTTTATTTTAGT 

GGCATrrCAAGAGTGTTGAAAACATATGTTAGATCTGGCATGTTTAAATAGAAATCCTTC 

TGAGCCCTACTTTGGGGGCAAAGGGAGCCGGCCCATTTATAGAGCTCTTATGATGTGGTA 

GACAGCGGCAGTACCT 

Sequence 787 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCATGTGCACGTGCGTGTGGTAGTAGGCAAC 
TGCTCAAATGTCAGTTTTGAGATGTTGTTGGTGGGCCTGTGTTTATCTGTAACTGTATTT 
ATGTAGGTGTCTCTTCAGAAATATGTCTGAACTGGGCAATTTGTTGTACCT 
Sequence 788 

AGGTCTTGAGTCGACCCACGCGTCCGTGCCTTACAACTGGATCTCATGAGACATTTCCTC 
AACTGTGGCTTATAGTCTCTGATGAATCTANGTTTGTGCCAAAG7TAATACAAAATCAGC 
CAATATGCCTGGATTTTAAAAGTAATCTATTTAGGCTTTAC \ I I I I GATATTTTGACCTG 
CCCGGCGGCCGCCAC 
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Sequence 789 

AGGTACTTATATGCTAGGAGCCTGGGACATACTTTTATTATTTCATAATTTTACATCTTT 

AMTCTTAAAGAA ACAT CGGTMTATCTGCATGAATCAGAATTCAAACGCAGCCTTGCTT 

TCACAATCATAGATTTTCAGGATCAGAAATTTATTCTGTCAAGAAGAAGGATGACCAAAA 

ATGCCGGACGCGTGGATCGACTCAAGACCTGCCGGGCGGCCCGCCAC 

Sequence 790 

GGGCAGGTGTCTTAGGTAGGATCTTGCTACTCTGTGTGCACTTATCTTGGGAGTCAGAGT 

AAGTTCMTTT GTCT ACTTTATTCTGTGGTAGTTTTTCTTGTATTTTTACACTACG 

GTGTGATTTCATTTTCATTGTAATTTAGTC I I I I I GTTTATTGGG I I I I ICTTCTTTCAT 

ATTGGCTTCATCTMGGAACAATAATGTCATTGTTCATTTCAGAACCAAGAAAATATAGA 

ACACCTAGAAAATCAATCAACCAAACAAACAAATGAATAGTAGAAAATCAAACACACATG 

TATTGCCTCTTGTTTCTTGCTGNAATTTTTCCCTC 

Sequence 791 

AGGTACACTTTTTACACCATAACTGCCATGTGGTGATAAGCTAAGTCATCTCATTCCTTC 

TCCCTTTCTCTGTGCCTGTCTCCATGGTCATTCGGTCCTATCATCTGTTAACAGGTAGGA 

GAAGGCATGTTTGCATTTGAGAATGATGGTTACTATAAGCAGTCAGTTGCTTATACTGGT 

CTATGAAGCATCTAGGACTGTCCCACAAACAAGAGAAGGAAAAAAAGCCAAATGGAGCGG 

ACGCGTGGGTCGACTCAAGCACCTGCCCGCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnTGTTCCCTTTTAGTGAGGGGTTAAATTGCGCGCTTGGCCGTA 

Sequence 792 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTTACT 

ACCCAAATTGATATTACTACCTGAGCATTTTTCATTTTCTTACATAATCTTCCAAAACA^ 

TCATT TTATC TACCACAAAATATTAAATTGAATAAAACATTTTTGCTGAATT^ 

TGGGCTTTTAGACCGGTTTTMCATTTCAGTGTAATATGTAATGCAGCTCAAAAGCTTTG 

TGCAAGTGGT 

Sequence 793 

AGGTACCCTCGGTTGCAAGCACAAGCAAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCA 
CAGGTCCTATCCAAAGAGCACTCGTCCACATCTTGGCAAGACTTCTCATCTGTTAATAAT 
TTAAATCCTTTCTT 
Sequence 794 

CGAGCGGCCGCCCGGGCAGGTACATCATGCCTCCAGTTCTGGAGCTAACACAGATTTCTC 

CAATTTCATCTGTTTTGCAGAGCTGGGGAGGTCCATCTGGTTTCACAATGCACATCATCC 

CACCAGGCATTACATGCCCTACATCCTGGACCGTCAGTGCTGAATTTTTATCTTCAGTAT 

TGACCCGTATTACCCCATAGCTCAATCCATTCATTGAGAGAATGGCTCTTCCTGGCAAAG 

GGGCTCCTGGAACTCCAGGCCTGCGGATTGCTACAGTCATGGCTTCAGCAGACGTGGCGC 

ACGGTCAGATGGCCTCAGGCTTCAGTCCATGACTTTGGAACAG 

Sequence 795 

AGGTACCCTCGGTTGCAAGCACAAGCAAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCA 
CAGGTCCTATCCAAAGAGCACTCATCCACATCTTGGCAAGACTTCTCATCTGTTAATAAT 
TTAAATCCTTTCTT 
Sequence 796 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGGCAATCAGAGGGTGAATGATACGCACACC 
TGTGTATCTCCCATATTCTGCAACTTCTG I I I I I ATTAGAAGTGTAGAATAAGTTCCCAT 
CTTGTCTTCTAAGCTTTCAGGTTCCCAGGGTGTACCTCGG 
Sequence 797 

CCCTTTCGAGCGGCCGNCCGGGCAGGTACTTCC I I 1 I I I I I I k I I 1 I I I \ CTTTTTAGCT 

ACCTGGGTATCTAGCTTGGGACCCTGTCCTCTTCCCACTCCACTGTAGGGTAAGATNAAA 

CTTTTCTAGGATTNTTATTAATGAAATTATATAGCATGGATACTTTrATATCTGGCTTN^ 

TTTGGTTAGCATAATA I I 1 1 I G AG ATTTATCCACGTTGCTTGTATC ATT AGTTTG GTC AT 

TTTGCATTGGTAANCAGCATTTCATTGGGTGMCATAATAAAACTTTATTTn 

TGG AC ATTTAAATTGGTTT N CGATTTGGATAACTGGAAATAAAAACTGCTATGAAC AC AT 

ATGGTCMNTCTTTATGTGGGCAGATAAATGCTATTTTTTAAATTrAAGAAAAA^ 

AACAGAAGTGAATTGANTAGCAAGTCACCCATATTTAAAGTTGGAATCATCTGAAAAGAT 

TATATATCTTGGTTAAAAACTAAAAACT 

Sequence 798 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTATCATCCATAATATCTTACTCTCTAATCCA 
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CATAATGGAGTTGTGCCAGACTCCAAACTGAACATAATATTTATATATACATATATATTT 

ATATATATACACACATACATAAACTATAAACCTGAATATCTTAGCATACAAAACTTATCT 

AACAACTAATAATTTAGGATTCCTTTGAGAGGGAGGGGTTAGAAGAAACTCTTCTTAGTC 

TTTGAAATAGTN CTGACAGTAAGCAATAAACAACATTATTAATTNCTGGGTTCTCATTTA . 

TTCAAGTGAAAGACAAATAAAATAAAAATTCACTTCTTGAGCCACTAGAAATTTTGAAAT 

CATAAAGG ATTA TTTGGGAAATGAGTAAGTCTTTAAGAACAATTTAGTGNTATCTAAAGT 

AAAGTGGATTTTGGTAAGATACAGAGAAAAANTCTATGCATAAATNAATTCTTGNTTCAT 

TGAGCMCTATMTCGCTMGGGGGGTCTTTNTMGNATCTCAGGAATGTTTTTAAAATA 

TT AATAGTG AAT AAATTAN ATTGAGG ATAGGGACC AATC 

Sequence 799 

AGGTACTCAAAAGGCATTTCCGCTTACAATTTGTAGAAACACAAAATGCGTTTTCCATAC 

AGCAGTGCCTATATAGTGACTGATTTTTAACTTTCAATGTCCATCTTTCAAAGGAAGTAA 

CACCAAGGTACTGGGTAGATATACGCAGGTGATCACAAATCTTTTATGTCTATTCTTGAT 

CCCI (Ml i CTTTACTTCCCAAATTTGCCACATCTTTCTTCTACAGCTAATAGGATGGGC 

TGCAATTAATTGGTAATGGTAAAGAGATTATGCTTACAAAAGAGGTGAATGTATCATATA 

AGGAGCATAGGTTGGAATTTTAAGAAACGTAACCAAACATATTGGAATTAAAA 

Sequence 800 

CCGGGCAGGTACTACCCAAGGTCCTCTGTGACTCGCCGCCCACTACCCAAGTGAATGAGT 

CTCCCCTAGAGCTTTGCTACTCAGAGGGGTCTGAGGACAACAGCATGGGCCAACACGTGC 

ACTCGAGCTGCCTGGAGATCTTGTTCAAAGGCAGATTCTGAATGAGTAGGTCTGGGTTGG 

AGCCTGAGAGTCTGTACCT 

Sequence 801 

CCGGGCAGGTAC I I I I I I I I I I I I I I I I I I I I I ! I f GTAAATAAAATTTTAATGGAACAC 

AGCCATGCCTGTTCATTTACATATTGCATAACCCCTACCCCTAGAATTGAGCAGTTGCAA 

CAGAGGCCATATGGCCTGACAAAGCCTAMGTGTTTACCTATTGGGTTCTTTACACAAAA 

TGTTTGCCCACCCCGG ATTATATC ATGG AC TCG ACTTG TTTTGGTTTCATATTCATAGTC 

TGAATATATTTTGGTAGCCTTAACAGTTCTACAGGGAGAGAATATACAAGTCAGGCTATT 

CTAGGTTTTCTGTAGTTTCACAGATTTGTCATTATAATCAGAT 

Sequence 802 

AGGTACCAGGACCTCTAACTCCCCCTGACACAGAGCAATTAGACTCCCATAACAATGGTA 

TCAATTATACCACTCCATTGGAGGGACTTCCTTTATGTGTCACCCAGGATACATTGCTCA 

ACTGCAGTTGCCTTGCAGTTTGATCCCAAGCATGGTTGAGTTACCATAAAAAAATTATGT 

ACCTGCCCG 

Sequence 803 

CCGGGCAGGTACCTGCTAAGTGCTGGACAGCCTTTCTCACATTTCCTCCCAGCAATCCTA 
GAGGCAGGCCTGGTTGCCT7TGTGAAGCTCAGAGTGGTTAAGTAACTTGCTCAAAATCAC 
AGAGCTACTAAGTGGTACCT 
Sequence 804 

CCGGGCAGGTCTTCCCAGGGATAGTTTTCCATTTGATTAAAGTTTTGTTCTTATGTTACT 

TTTTACTGTTGTTTTTGCAGTTTACCTAATGCTAATAGGGTCTCAGGAACTGTATTTGAT 

GTTAAAGTGTGGTTTTTCCAGAAGATGACAGATAATTGGTGGTCTCCCCTTTTCCTCAGC 

AACATAGTTTGTACAGCATACTGACTCAATTCTTAAGTCTGATTTGTGCAMTTTTTATC 

GTACTTGAGAGTTACAAAGCAAGTGAGAACTTGAGGGATCAAGATCCTGGAGAGAAGGAA 

ACCTTAAAAGGGTAMCCCAACATTTGGCTCTACTTTTCCCCTTGAGGTAT 

Sequence 805 

CCGGGCAGGTACAAACCCCAAGTGATTATAGAAAAATCAATGTGGCAGCTACACTAGAGA 

TGTCCAACCCCAAGGCTATGGGCCGTTGCTCCCTCTTTCCCCCCAATCCCAATCCCGCGT 

ACGCGGGGCCTCTTTTCCGTGGCGCCTCGGAGGCGTTCAGCTGCTTCAAGATGAAGCTGA 

ACATCTCCTTCCCAGCCACTGGCTGCCAGAAACTCATTGAAGTGGACGATGAACGCAAAC 

TTCGTACCT 

Sequence 806 

AGGTACCCCTAAGTGGGAACCTACCAGGACATTCAAAGCAAGAGCAGTAAGTTCTGAATG 
TTCTGGGACAACCTGGGTGATATGCATGGATATGGGCTGTGGAGGCTGAGCATTTTAATG 
ATAAC TTAGGGAAACG AGGCATGGCCATG GTGTAAAACTCTCAAATCCCAAGCCCTAATC 
CAACCTTAAAATCCGAGTCTTCTAAAGGGCTGTTTTAACCATGAAAGGACCATAAGAAAG 
GCAATTCACAGAAAATGAAGCCATGTGGCCAAGAAATATAAGAAAAACAGTAAAAGCCCT 
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TAATCTCAATAGCAATAGAGTGGATGCAAATGAATATAAT 
Sequence 807 

CTCTCCAGTCTTTGATTGTCCCGCAACAGTATTACAAACAAAAGGCATTAGGCAAAGCAT * 

GCTGAATTGATTGGAGTCCCTTGGTCAAAGGTATTATTGATTGACGGCAATCAGATCCAC 

TCCCTCAGAAAGGATTTGAGTAGGCCTNTTCTGTCCATCTGCAGAAGGTTCCCCAAAAGG 

GGCAGAGGGCCGGGGCCCTGGTGGGAGGGTGCCATGGGAGTTAGGGTGACCCTGAACCAG 
GAGTAGCAAG 

Sequence 808 

AGGTACTGAATTCTACCCTGGAAAAACAAAACCCAGGTGTCTCCTCAGCTTCAAAAACTC 

TCAGGGAA TGAAT CCCTGTGTCCTACACCCAAGTATGTGGMTTTAAGAACCTGCTGTGG 

ACCTACCTATTTTCTTAGAAATATGCAGCTGAATATAACCATTTTTGGATATTTGAGATC 

ATTATGTACCTGCCCG 

Sequence 809 

AGGTACCAAATAATGGAGCTAGMTTCCTATCAAAATAATGGAGCTAGAATTTCCATCAA 

CATATAAAGTCCATATGTGAGCCTCATATAAGGCAAACTGTAAAATCAGTCAAGGTTCTA 

A GTCTTTC CTCCAAGATCTGGAMGAGTGATTGAGCATTCGTTATTTTAAATTACGGACT 

All I I I I iCCATACMGGMGTTMCATCTAGAGCGATCATTCTCAAACTTTATTGTATA 

CCAGMTCATTTGGAGGATTTATTAAAACACAAAGTGCTGGGCCTTACTCCTGAGTTTCT 

AATTCTGTACACTCTGCCCCCATCCCGGGATGAGCT 

Sequence 810 

CCGGGCAGGTACAGGTATGGGGACCACAGGAACAGTTAAATTCATGGCATGGCTGGTCTA 

CCACACAGTCGGGGGAATTCTTTAAATAGAGCCTGTCACTCTCTTGGCCCATCAATGGGA 

TTTCCTTCTCGAAACTGCTGATTCGTTCAGGTACCTCGGTCGCTCTAGAACTAGTGGGAT 

CCCCCCGGGCTTGCAGGGAATTCGATATTCAAAGCTTATTCGGATACCCGTCGACCCTCG 

AGGGGGGGGGCCCCGGTNCCCCAGC I I I I IGGTTC 

Sequence 81 1 

CCGGGCAGGTACTTAAGAACTCTCCAGGTAAGGAGCATGGCTCTAATGGGAAATCCTTCA 

GGTCGGGGTGAAAGAGGAAAAAAAAATGGTTACCAAGGATGAACTGTAGCTCACTTAGGA 

ACTGAGCTTTCAGACTCCCATAAGAGCTAAATGATAAAAGAA l iTTTTTm I AAGAAAA 

AAGGAAAAGAAAGAAAAGAAGAGGAAGAAACCAGGAATTCAATAATGTCCTCAGAAAACA 

AAGCAGTGATTCCTTCCTCATTACTTTTGAGCAATGAGACCCCTACGCTCCCGCGTACCC 

TCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCT 

Sequence 812 

AGGTACTTTATTCTCTTATCCAGGACTGGATCAAATGATTTATGGCATTGTGCTGTTTTA 

ATGTTCTCATCCACAGGGTCGATTCCAATAACTTGAAGCCCCAAGCCGCCCTAGAGGTTC 

AGTTAACAGCCCACCACCACAGCCAACGTCAAGMTCTTCATCCCCAACAAAGGTTTTTN 

CTGGCTGGTGATTTAGGGMTTTGTTTTCAGAAGATTGTCCCTAATTAAAATGNCACCCC 

TCNAGGTCATTNCATNGGAAATNGAAGAGGNGCATATACTCCCTTGGTTCATNCCCACCA 

TTTGTGNAGCCANGGCCCAAGAAAGGTT 

Sequence 813 

CGCGGNGGCGGCCGCCCGGGCAGGNACTNTTGCCNGGGANGAAAAAACCCCGACAGCAGC 

AAACNGCAGAANCNNCAGACNGCAGGCTGCNGATGGNGAGAGGGAACNCCGCCCCANACC 

CACNGCCACNGAACCNGGCNGGGANACCAGNGGCCCNGGNGGANGCACCANAGANGAGGA 

GCCCGGGTNTTCTGGCCAGGGGGCNGCNCGCACCNCGGCCGCNCNAGAACNAGAGGACCC 

CCCGGGCNGCAGGAANNCGANANCAAGCNNANCGAAACCGNCGACCNCGAGGGGGGG 

Sequence 814 

GCATTGGAGCTCCA CTTTT GGTGGCGGCCCGCCCGGCAGGTACTCATCAGATGGAATGTT 

TTACCCCGCCGAGGTTTTAGTCATGATGTGCTGAGCTCTCTGCGTCTGACGTGACTGACT 

GGTAGCTGGGCGTTGGCGTGACCCTTCTTTCCCTCTAGGATCACATCATGGAGAATTCCA 

CCTATGGGAAAATTAAAATGATTGACATGCCCGTGGAAAGGATGCATCCCCATCTGTCCA 

AAGGCTATGCGTACCT 

Sequence 815 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCACAGGATTAAAAC 
TCCAGTTAAGCAACTGAGCTAATCATTGAAGTAAATTAAAAATACCAAGCTTCTTTAACC 
TATCAATGCTGTTTTAGAAGC ATCAT CCGAACAAATAGAGATTTAGTTATAAATTGNCTG 
GGCTACATTCTGTGATGAGAATTTTGCTTAGTACCACTAGAGGAAGAGAGACTAGAGGCC 
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CAAGAAGAAGGTGAAAGTAGAGCTATGTGCCCAAGAAAAACTTGTAGAGATGGAATAAGA 

AC TGAAA CAATAGTAGAAACACTTGCCCTGAAGAAAAAGGATGGAAAAATGAAATAGATT 

GAM 1 1 IAGCTGGTAGGGA AGAAA GTTAACATAGTCTTAGTCGGTTGTGTTTTCTCAAGG 

AACTGAGAAGCAGGATTAG1 1 I I IAGAAATTTGAANAAGGTAGTGAGAGGTGAAGCCCAG 

C TGGGTCG AGTGGGGACTTGGAAAACTTTTNTGTCGTACCTGCAGGCCTNCTACACCTAC 

CTTTTTTTTGGGC 

Sequence 816 

AGGGANCCNGGAGNCAAAGCAGCAGCCCCGGANGGNGCACNCCCCGGGGGAGACANGGGG 

GAGCCGCAGCCACCCTGCCTGGCNGGCACGGCACACNGGNNNGCAGACAGGCCCACGNAC 

CNGCCCGGGCGGNCGNCAAGAACAAGGGGACCCCCGGGCCGCAGGAANCGAAAAnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCNNNNGGNCCCNNAAGGGAG 

GGGAAAANGCGCGCCNGGCGGAAACANGGCAAAAGCNGGNCCCCGGGGGGAAAANGGNAA 

CCGCNCAACAAANCCACACAACAAACGAACCGGGAGCAAAAAGGGAAAACCCGGGGGGCC 

CAAAGAGGGAGCCAACCCACAANAAANGGGGGGGGGCCCACCGGCCGCNNCCAAGCGGGA 

AACCCGGCGGCCAGCCGNAAAAAAAGNAACGGCCAACCCCCCGGGGAGAGGGGGGGNNGG 

GGAAAGGGGCCCCCNNCCNNNNCCCCGNNCANAGACCCNNNGGGCCCGGGNCGNNCGGNG 

GGGGGGAGGCGGGAANANNCNAACCAAAAGGNGGGAAAACCGGGAANCCCAAAAAAAAGG 

GGGAAACCCCAGGAAAAAAAAAGGGAGCNAAAAGGCNNCAAAAGGCNCGAACCNNAAAAA 

GGCCCGGGGGNGGGGGGGNNNCAAAAGGNCCCNCCCCCGGGCNGGGAAAAAAAA 

Sequence 817 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATA 

CTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCCCCGCGTACA 

TGTGTGAAAACAATATTGTATACTACCATAGTGAGCCATGATTTTCTAAAAAAAAAAAAT 

AAATGTTTTGGGG 

Sequence 818 

GACTTGTAACTGGCATCTGAAGTGTGGGGAGTCCTGTGGGATTGAGCCCTTAACCAACCC 
TAAATACTTGTGCTAACTCCAGGTNGTTAGTATCAAMTTGAATTACATTGTTGAACATC 
CATTTGGTGTCTGGAGAATCANAGAATTGGTTGCT 
Sequence 819 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGTAAA 

GAGGTGCAAGAAAACCAAAAACATAGGCATCCAAGGTAGAAGGCGTATCTCCAAAGAAAA 

CTGAGATGTTCCCAATCTGTTTGATAGAAGATTTAGGCACTCCTTGGCATCTCTGTATAT 

CTGTGCTTCCACTTCTCGGAGGTGGTAGAGGGGAGGCTGTCCTCTGGTCAGGAGAATCCT 

ATTCAGTGCTCCCTTAGACATTCTTCCAGGCAGGATCAAACTCAAAGGAAGAGGAATTTG 

TGAAGCAA 

Sequence 820 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTTAAGCGGAAGCATACGGGAATATGAGTGC 

AAAAGTGTGGCTGAGCCGCGTATGCCCTTGATTGGTTTTGGGAAGCCTGAGGGAGGCAGC 

CTTCCTGGCTATGAGCCATCGCCTGCCCAATCAGGCTAAGGGTGGTQACTGGGGTGGTGA 

AGGGGCAGCTCTGCTGAGCATGGTCTGCCTTATGGCCTGAATTGTCCTCAAGGGGTGTGG 

ACTGCAAATGGTGTTCACATGAACCGGAGACATCACTCTTTTAGGATTCTACTGGCAGCC 

CCTGAATTGGCTCAACGTTT 

Sequence 821 

CCGCGGTGGCGGCCGAGGTACTGTGGGATTTGTCTTTTATGAGTTACCACGTTCAGAGAC 

TAGTTCATCTTTGGTTGTATAGGATGTTAGGATCTCAGTTGGCATTCACAGTTAAATCAT 

GCACCTTCAGGAACTGGGAGCATTTTGATCCAGAGTCACAATCATTCCTTTCTTATCTTC 

ATCCCTATGGTATGTGTGTTCTGAAGTTTAACTGACAGATGGCAGCTGGTACCTGCAGGC 

CTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGA" r GTGGCTCTGGAAGG 

CGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGCTACGAGCGTCTCCT 

GAAGATGCAAAACCAG 

Sequence 822 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGAGGTCACTGACTTACNCCCTTCCCAC 
AGCTACAGATAAGGGCTCGCAAAGTTGGCCTCAGAGACACATCANGAACCAAGGTGGACC 
AGCAGGTGCCGAGCCTGTGTATCTGCTTGGAGGAGACGTTCCAATGTGCTGCTTGTTCAG 
AGATGGTGAGTTGCAAGAAACAGAAACCCACCACAATTTCTNAGGCAAAAAGGGAGTTAA 
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TTATAAGGACATAAGAGCACAAAGTTCCAGTGCAAGAGATACATCCAGGCTGCACAAGCT 

CCGGGAGTGGGGCCTGGCAAGCCAAAAGAAACCAAAGTTTGTCTTGCCTTCTGTTCCTCT 

TTCTGAAGCCACATAACCCTTTATTGACCNGNGTNTCTTTGCATCGCTTTTGTTTTCTTT 
TTATG 

Sequence 823 

NCGCCCGGGCAGGTACCACAAGAACTATGAGCTGGTTATCCACTTCATGTGGAATCATAA 

GCGTCCC AAAG TGACAATACATATAGATTGCCAGGCAGTGAAACAGTTAAGATGCCACCA 

TAGCTTTCTTTTCAACATCTTTCTAAATTACCTTACTATTTCTTTTGT^ 

CCT 

Sequence 824 

GCAAAGGGACAGAAAAGCNGAGATGAACAGNGCTGCAGCAAACACAAGCCGGGCAAGGGG 

GCNCCGAGCCNCGCANCCCCCGGCCGNNCNAGAACNAGGGnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCNATN 

GGNCCCCNAAAGGAGGGGNAAANGCGCCCCNGGCCGAACAAGGCAAAGCNGGGACCCGGG 

GGAAAANGGGAACCCCNCACAAAGCCCACAACAAACCAACCCGGAGCCAAAAAGGGAAAA 

CCCGGGGGGCCCAAAGNGGGNNGCAACCCACAAAAAANGGGGNGGGGCCCACCGGCCCNN 

NNNCAAGGGGGNAACCNGGCGGCCCANCGGNAAAAAAGNAANNGGCCACCCCCCGGGGAG 

AAGGCNGGANNGCNAAANGGGGCCCCCNNCCNNNACCNGGNAAAAAGAANCNNNGGCCCN 

GGNNNNNNGGANCNNrJNAGCCGGNNAANNCCCCNAAAGGGGGGGAAAACGGGGNNCNCAA 

NAAAANGGGGGAAACCCNGGAAAAAAAAGGGGGNNAAAAGGCCCNCAAAAGGCCCCGAAC 

CNAAAAAGGGCCCNNGNNNGGNNNNNNNAAAAGGNCCCCCCC 

Sequence 825 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGAGAAGCAGCTGA 

AGTTTCAGAACATGCTGTTATGCTTGCTAAGGATAGCAGAGAAGAGTTGACTCCAGATGC 

TAACATCAGAGAAGTTGGCCTTGAGGGGGCATTGTTGATCTTACTAGACAAGGAGACAGA 

TGAGAGATTATGCCATGATATCAAAGAGACTTTAAATTATATGCTTACATCTATGGCAGT 

GGAAAAACTCTCCCTGTGGTTAAAGCTTTGTAAAGACGTACCTCGGCCCGCTCTAGAACT 
AGTN 

Sequence 826 

GGGCNAATTGGAGGTCNCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATACTGCT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCACCAT 

ANATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTTAGTGGAGGGTTAATTGCGCC 

GCTTTGGCCGTAATCATGGGTCATTAGGCTGTTTCCCTGTGGTGAAAATTGGTTATCCCG 

CTCACAATTCCACACCAACATACGAGCCCGGGGAGCATAAAAGTGTAAAAGCC 

Sequence 827 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAGCATGGGAA 

CCATTAACCAGCAGGCCATGGACCAACTGCAGTATCCTTTGTGACCCAGAACAGCACCAA 

GGTCAGAGGGCACCTGTATTCATAAACCAGCTGCCTGACTGTGAATCCTGATGAATCAAG 

CTCAAAAGGAGAAAACATAAAATNCATAAAGTACCTN 

Sequence 828 

TGGCGGCC CGAG GNCAACTTTAMCTGTATTGTATTCATGTTGCTAAACAATATTGGCCT 

TCTCGA TNAT TTTATTCATGTTGCTCCAAAGTTAAAACCCTGTAGAACTAAGTAGGTGAA 

GAGATATTTTGTATAAGTGCCCAGAAGAGAAAATTAATAATTAAATAGTGAATTGAGCAT 

CACTAGAATAAAATAAAATG AGTA GGCATTCCTAAGATGTGAAATGATCACCTAAGATAT 

ACATGCTCCAACCATATTGATTTTAGAACAAAACACAGCAGCCCATAACAGTTTGTGGCC 

TCTACTAACTGNCCTCTGCTGCCCATCTAGAGGTTATGTTTCTCTATTTrTAAAATAAAA 

TGTA 

Sequence 829 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTC 1 I 1 I I I I 1 H I M 11 1 1 1 I 1 1 I 
GGTTGAAAGATTCTAAAGTGTGTTTTGATCATGTTTT 

TTACTGAATCCAACCATT CCATC TGGAACTTTGTATTCTTCTGTCATTACAGATCTGCTA 
AGAAAACAGAAAGCACCATTTTCCTTCAATGAAAGATACTTTGTTTACAAATAAAAGATA 
ATCTATTAATAAAAATGGAAGTCTAATGTCTCCCAATGGCTTAAACTAAGGTTCTAGGCT 
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CAGACTCATGTCAAAAMTATGATCTTATTACTTGAGTAGTTAMTTMGAAAmAAGA 
TGACACATCGAGTGAAGMGGGATGATGMTGGAACAGTCTGAAGGCATT^ACTGTAAAG 
AAGTTGAAGTGGAGTTTAGAGAATGTCATCTTAACAGTTACTCTAATTCACCCCTGT . 
Sequence 830 

GNGGTGGCGGCCGAGGTAC H M I I 1 1 1 I 1 1 I I 1 1 1 11 I M 1 I I 1 I H 1 GTTTCTNGGNT 
TCTGAGTCCAGTCCCCTTNGCATAAACNCCCGGCGNGGCTNTCTCANCGTCCCCGCACCT 
TCATGCNCAGNAAATTCTTGGACMCtTTGTCTTTCTCT 
Sequence 831 

AGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACGGGAAGGCG 

AAGAAMGAATAGAGMGATAGGGAMTTAGAAGATAAAMCATACTTTTAGAAGAAAAA 

AGATAMTTTAAACCTGAAAAGTAGGAAGCAGAAGAAAAAAGACAAGCTAGGAAACAAAA 

AGCTMGGGCAAMTGTACTGTTTGAGGTTTCAGGAGTTCACTGGGGGCCTTGGAATGTA 

TCTCCCGAAGATAAGGGGGAACTACTGTAAGCAAAATCGAAAGCTATACAACAATCAGAA 

ATGGGAAGAGAGCAAATCCAAGTATAAGTGAAGACATTCCTCTGAATTCCCAGCATTCAT 

CGAGCATCAAAAACGACCMTCTTTTAGTCCTTTTCTTGTACCTCGGCCGCTCTAGAACT 

AG 

Sequence 832 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGACATTCCAAATGGC 

TGCAAACTAATCAGGAATCGATCGGATTGTAAGGACATTGATTGGACGACATATACCTGT 

GTGCTAGGATTTCAAGTATTTGGTGTCTGGCCAGAAAGGATCTGATGGGACAGATATCAA 

TGCACTGGTGCGATCCCACAATAGAAAGGTGATAGCTGTTGCCGATGACTTTTGTAAAGT 

CCATCTGTTTCAGTATCCCTGCTCCAAAGCAAAGGCTCCCAGTCACAAGTACCTGCCCGG 

GCGGCCGCTCGACTGGCG I I I I I CCATAGGC 

Sequence 833 

CCGCGGTGGCGGCCGAGGTAC I I I 1 I I II I H I I H H I H I 1 1 U 1 1 1 1 CCTAGCCTTC 
GAGGCCCACCTTCTNTNTGAAGTCTTCCCTGGCCACCCTGGGTCACACTGTGCTGCCTAG 
G AACTCT AC AG C AG N G AATGCCTGC ATG ATTN ATTTGG N ACCTG CCCG 
Sequence 834 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACGCGGGACAAAATAAGAAGGC 

TTCCTGAAAGCACTGCTGCTTGGCATACTTCTTGTAGTAACCCTGTCACCGTCTGCTTTT. 

TGAAGCTAAACTAGATCCAAGAAATTGACATAAATAATGAGCAAGAAAATGATTATAAGA 

GGATATTGCCTCATTTTCCACATATATTCCTTATTCTGMGACTACTTTGAGGTCAACAT 

TCCAACAGACCTACGAGCAAAACATTCTGGGGAAATAAGTGAGAGAAAGGAAATTGAAGA 

ACTATCAGAAGCTTCAAGAAACACCATACCACTAGCAGTGGTGCTTCCCACTGAAATTCC 

ATGTGAGAATCCTGGTGAAATATTCATAATTTTGAGAGATGAAGTAATTGGTGATACTGT 

AGAGGTTGAATTTACATCAAGTAATAAGCGCATTANAACACGGCCAGCCCGTTGGAATAA 

GAAAGTCTGGTGCATGAAAGCTTTAGAGTTTNCTGCTGGTTCAAGTCCATGTCAATGTCT 

ACTGTGATGGAATCGTTAAAAGCTACAACCAAAATTAAGTACCTGCC 

Sequence 835 

GAGGTACAGGTGAATGCCANNCTAGTCATCTCCCATGAAGAANMTTAATNCCTTTGCAC 

AAAT CCATT CCTTCTGATCCAGAAGAAAGAAAGCAGTGAGTGATTTATTTTCTCCATTGT 

AA GATT nTGTTATTCAATAATTGCAATG AATGTGCTGAA 1 1 1 1 1 ATGGTGTTATTGGCT 

AGTTT TGAAAAAAAAMGGAAGAAATAAAGCATATTATTTTGAAGTTCTAATGACTCACG 

TTTTTCAAMTTAATTTTTCAGTT^ 

Sequence 836 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCAAGGCCCTAATCTCAATACCAT 

CACATTGGTGGTTAGGGCTGTAACATTGGGATATGGGGCAAGACACAAACATTCAATCCA 

AAGCAAGCATGTCCACACTGTTGGGACTCCAATCCCACTTTTGCATTGAGTCATTATTTA 

ACCACTGTACCT 

Sequence 837 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACMTTTTAATTTACAACTGTTG 

GAAATAAAAATCACTTAATTTTTTrCCAGTGCTTCTCCCTCATCTGGTTATTCAAGAAAA 

CAAAACAGTCTCTGAGATATTTAGCTTTTCAAACTGTAAATAGATGCTCTAGTGTTATTT 

ATTTTTTAATCCCACTTGTATTAT1TTACCTCTAGAGCATCTTGTATTAGGACATGTTAT 

ATTTATGCCAGTGGGAAATMGTTATGGCCAAGTTTTGCAAAAACAGGAAGCAGTGAGAT 

ACTTG I Mill I CTCCTCACTAAATATCAGTAATTGTCAGGAATGGTATTACCTATTTTC 



Page 112 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



ATTTCCTCTTTTCAGCTTTAAGCTTTGNTGGATTGGGACACTAAAACTGATGTATACCTG 
AGGGAAAAAATAGAATGTGCTCATGGTTAGGGAAAATTATATTAI I III AA I III ITAAT*. 
TTGTTTTTTAACCAAGTAGGAATTTGGGTGTATGGATAAGANGCCAATCTGCTTCTGTAG 
GCTATAGAAGAAAACCAGTTGTATTTTATGGNCATAAT 
Sequence 838 

CGCCCGGCAGGTACTGGGGGTAAGCTGAGAACTATCACGTTCGTTTCTTCCTCGTCCTCC 
CCCTTNTCTGCCTTCTCCTTGAGAACGTCTTTGAAGTTGGGTCCACTGTTTAGGGTTGAC 
TGCCTGTACCTCGGCCGCTCTAGAACTAGTGN 
Sequence 839 

GNACCCAGCGGGGGGGCCATTTAGAGAGGGGNAAGGGCGCGCTAGNCGAACANGGCANAG 

CNGACCNGGNAAAGGNAGCNGGNCACAAANCCACACAACAAACGAGCCGGGAGCANAAAG 

NGGAAAGCCNGGGGAGCCAAANGAGNGAGCGAACNNACAANAANNGCGNNGCGCACACNG 

CCCGCGAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnCGGANNGCGGAANGGGGCGGCACAAANCCGCANACACGCGNAACAGANNNGC 

NGGAGCACGGACGNAACGGCAGNGGCGAGCGGAACAGCNAACTCAAAAGGCGGGAAAAAC 

GGGNGAGCCACAGAAANNAGGGGGAAAAACGGCAGGAAAAGAAAAANGGGGAAGCAAAAN 

GGCCAGCAAAAAGGCCAAGGAAACCGAAAAA 

Sequence 840 

TNGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGGGGGCCAGTTATTATACTGC 

Tnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCCAGTGGCCCTGGTGGATGCACANG 

ATGAGGAGCCTGGGAGCCTGGCGAGGTTTCTGTCGAGCGGCCGCCCGGGCAGGTACCTGA 

CTGTGATTTCAAACCCCAGGTGGTTACTGGAGCCCATACCTAGGAAAGGAGGCAAGGATG 

TATTCCAGGTAGACATCCCAGAGCACCTGATCCCTTTGGGGCATGAAGTGTGACAAGTGT 

GGGCTCCTGAAAGGAATGTTCCANAAAAACCAGCTAAATCATGACACCTTCAATTTGCCA 

TCATGACGCAGACCTGTATACATTAGGTTAAA 

Sequence 841 

AGGTCAACTTTAAACTGTATTGTATTCATGTTGCTAAACAATATTGGCCTTCTCGATGAT 

TTTATTCATGTTGCTCCAAAGTTAAAACCCTGTAGAACTAAGTAGGTGAAGAGATATTTT 

GTATAAGTGCCCAGAAGAGAAAATATAATAAATAAATAGTGAATTGAGCATCACTAGAAT 

AAAATAAAATGAGTAGGCATTCCTAAGATGTGAAATGATCACCTAAGATATACATGCTCC 

AACCATATTGATTTTAGAACAAAACACAGCAGCCCATAACAGTTTGTGGCCTCTACTAAC 

TGTCCTCTGCTGTCCCATCTAGAGGTTATGTTTCTCTAI I I I I AAAATAAAATGTAGTTA 

AATTAGCCTGACGGATGTTTCCTCTCTATCCCTTATCTCAGACACATACAAAACAGACAG 

ACAATAACAAAAAGACATCATCTTTATCTGTACCTGCCCG 

Sequence 842 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCG 

Sequence 843 

CCGCGGTGGCGCGGCCGCCCGGGCAGGTCAGCAGCTTTGCATGTTGACAAATCCACTCTC 
TGCTGCAGATGCCTAGGGGAAGTTGCAGACTTAMTTTTCTTTTGTAAAATGGGGGAACA 
CAAACAGATCTTATGTCACACTGGTTACTCCAAAAAAAAAAAAAAAAAAAAAAAAGTACC 
T 

Sequence 844 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGGAGAGGACCATGTGAAGAGAGAA 

GCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGCCAAGGAACACCATCTGAA 

GTTCTGAC GGC AAC ATC AGAAGCTAAGAGAAAGG CATGGAAAAG ATTCTN ACCTAG AGC A 

TCCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGACCTCTGACCTCCGCAACTG 

TGAGGGAAGAAATTTNTGTTGCTTAAAGACACACAGCTTGTGGTACCTnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGATNTCAAAGCTTATTNGATACCCGTCC 

ACCTCNAAGGGGGGGC 

Sequence 845 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTCTACGAAGTCCCATGGCTTTTCCAG 
TTCTCTCTTGCCTTGAGGCTCAAGAACAGTTTGTTTTAACTGAGAGTCTGGGAGGAACTA 
CAGTTTTCCCCAGAGTAGTCTTGGCGGACTACTGGAAGTCACAGCCAAAGAAACTCTGTG 
ATTACTGCAAGTGCTGGATAGCAGACAATAGGCCTAGTGTTGAATTTCATGAAAGAGGAA 
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AGAATCATAAGGAAAATGTGGCAAAAAGGATCAGTGANATTAAACAAAAAAAGCNTGGAT 

AAGGCAAAGGAANAANAAAAGGCATCAAAAGGAGTTTGNTTNCAATGGAGGGCANTTTGC 

CNTNGAAAGNCTTTCCNAGNNGGNTTTTTTNAAAAANACNTTNGNTTTAAAM 

AANTTTTTNGGGNCCCCAGCCTTTTNANCCCCTTGTAANCCCNNGTCTTTTTT^ 

CNTNCCCTTGGNATTTTTNNAANTTCCCCNNGGGGGG 

Sequence 846 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATTTCCTAAGTATCACA 

TGACTTAACAGGTGCAAGCTGGCAGGCATCGGACTTGGTATCCGTAAGACCTGTTGCCCC 

TTTGCCTTCTGCCATGATTGTAAGTTTCCCGAGGCCTCCACAGAAGCTGAACAGATGCCA 

GTATCATGCTTCCTGTATAGCCTGTCGCCTTTGGGCAGCTGGACACCTCACCCACGCTTC 

AGGCTGGAGCCTTCCTCATCTTTATTAGGTGTCTTAGAGGCAAGTGCCGGAGGTAACATC 

CTCCTCTGCATTTCCTACACTGACAAGGAGAATGCTGTGGCTCTACGCAGTCATCCTTTT 

ACAACTTTTTTGCCAGGGAGCAAAAAGTCTTGNCACATNAAGTCGAATGAAGTACCTTGC 

CCCGGGGCGGGGCCGCTTNTANAAACTAAGTGGGGATTCCCCCCGGGGCTTGGCAGGGAA 

ATTTTCGAATTNTTNAAAAGCNTTTAATTNGATNACCCGGTTCNAACCTTCTAAGGGGGG 

GGGGGGC 

Sequence 847 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTGTGTAGCAAAC 
AGCCCTAMCCAAGTGACTTAAMCAACCAT 

TTGGGCTCAGTTGGGCGGGTTTTGCTGGTGATCACTCTAATGGCTGCTTTTCTCTCCAAG 
TAGCCTTTCCAGCAGAGTTGCTGGCCTTACCTGGTAGCTGATGGCTTAGGGCTCACAAGA 
GTACCT 
Sequence 848 

CCGCGGTGGCGGCCGAGGTACGCGGGGTCGGCAACTTTGGGAACCACCAGTAGGATGTGG 

TTAAGATTCAGTTCTTGCTGAGCTAAGGAAGCATTTCTCACTTCTTTTTAATTGTCTGGC 

TCACTTCTAGTCCCTAACTAAATGCTCACTCAAGAGTTTTAGCTTGAATGTCAAATGTCA 

AAAAATTAATTGGGTGATCTTTCTCCATTTCTAGGATAAGAAGAAAGAGAAGAAATGAAG 

TGACCATCCAGCCTTTCCCAATTAGACTTCCTCTCCTTCCACCCCTCATTTCCTTTTTGC 

ACACATTACAGGTGGTGTGTTCTGTGATAATGAAAAGCATCAGAAAAGCTTTTGTACCTG 

CCCG 

Sequence 849 

CGACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGTTTTAAGC 

TTAGTTCAGTCTCAAGTGTTTGCAGCCACATCTGAAGACCAATAAAGCAACTGCTGGTTT 

ATCCTTTGGGAGCTGACAGAATTTCTTCTCCCAAATACATACACAGTAAAATCATAAGCC 

TGGAATGAAGAAAAAAAAATCTTACGGGCAATGCAATGGCTGCAAAACTATAAGGATTAG 

AAATGTGMCCCACATTTTMTCCAAATTAGGGCAATTTAGAGG TGGTA GCCGTAAAGAA 

TAGCTTGCTGTAATATACGCCATGCCTGGATACANAAATTGGGCTTTTGGCCCTTGTCAA 

AAAATTAAAATTGTGCCTTTCNTGTATTTGATGGGTGGGGCATGT 

Sequence 850 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTG 

GGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCA 

GATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATTTCCACCTTGGTC 

CCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnCGCGCTTGGCCGTAATCATGGGTCATAAGCTGTTTCCTGTGTGAAAATTGTTT 

ATCCGCTCACAATTCCCACACCAACATACCGAAGCCCGGGGNAGC 

Sequence 851 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCAA 

AGTTGCTGCCAACACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAGA 

GTAGCAGGAGGAAGAGAAGCTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACTGA 

GCAGTTCCTCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCCTGCA 

GGAnnnnnnnnnnnnnnnnnnnnnnnnnnCGTTCGACCTCGAAGGGGGGGGGCCCCGGTA 

CCCCAAGCTTTTGTTCCCTTTAGTGGAGGGGT 

Sequence 852 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGCAAAGT 



Page 1 14 of 63 5 



WO 01/51628 PCT/US01/00798 

Table 1 



TGCTGCCAACACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAGAGTA 
GCAGGAGGAAGAGAAGCTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACTGA 
GTTCCTCCCGCGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnn^nnnnnnnnCTTATCGATACCCGTCGACCTCGAGGGGGGGGCCCGGTTACCC 
AGCTTTTTGTTTCCCTTTAGTGAAGGGGTTAATTTGCC 



Sequence 853 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGGCTCACAAAGCCCAAAATATTTACTA 
TTTGCCTCTTTACAGATAGTTTGCTGACACCTGACATATAGGAATGGGGGACATTGCTTC 
TCACCACCCCCAGCTCTCTCACTGGATGGTCCATGTATACAAAGTAGAGCCCTTATTAAA 
TGCAAAGATGTTGGTCTAAACCCTGCTTGATAGGATGATTGAGTAAATTCAAGAAAGTGG 
TAATGAGGGCTGGGTTGCCGGTGGCTCACTCCTATAATCTCATTACCTTGGAGGCCAAAG 
GNTGGGCAGATCGCCTTAGATCAGGAGTTT 



Sequence 854 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTT 

TAAACTGTATTGTATTCATGTTGCTAAACAATATTGGCCTTCTCGATGATTTTATTCATG 

TTGCTCCAAAGTTAAAACCCTGTAGAACTAAGTAGGTGAAGAGATATTTTGTATAAGTGC 

CACAGAAGAGAAAATATAATAAATTAAATAGTGAATTGAGCATCACTAGAATAAAATAAA 

ATGAGTAGGCATTCCTAAGATGTGAAATGATCACCTAAGATATACATGCTCCAACCATAT 

TGATTTTAGAACAAAACACAGCCAGCCCCATAACAGGTTTGTGGGCCTCTACTAACTGGT 

CCTCTGCTTGTCCCCATCTAAGAGGTTAATGGGTTTCTT 

Sequence 855 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATT 

ATACTGCTnnnnnr.nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGA 

TGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnCGTCCGACCTCGAGnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAGTGAAGGGG 

TTAAATTGGCGCCGCTTGGGCGGTAATCAATGGTCATTAGCCTGTTTT 
Sequence 856 

ACCGCGGNGGCGGCCGAf^GAAAGACAAAGGTGTCCAAGAATCTGCTGCGCATGAAGTTT 
ATGCAAAGGGGACTGGACTCAGAAACCAAGAAACAACTAGANGNAAGAAGGAAAAGTAAA 
ATTCA TTTAGG TGGAAGGAGGCACCTGNGGTGCTCTCNGGCNCGGCCC Ai rfTTTTTT CC 

ccnci rrm innaaaaangg ggngg ggncnaaaaaaanaaaaaatcccccccctttttt 

TTTAAAAA AANGGGGGGTTTCTTTT^ 

» i m i i MGCNCCC CCTNNCCCCCTTTCNTTNNTNCCCCTTATNNCCCCAATGTNANGGG 
GNTTTTTN mil I ATNCCCCCCCN T\ HUM I NGGGGGGNGNGCCNCCCCNCAAAAAA 
NAANTATTTGTTGGGGGGGGTTTAANAAAAAAAAAAAAAAMTATTTTT^ 
TNTTNNNCCCCCCCCCCGGGGGGGNGGGGGAANAAAAAAAAA' i I III I It I ( 1 I IA AAAA 
AAAAAAA 
Sequence 857 

GCGAGGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACC 
TGCCTGGTC AAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGG N NAGAGCAATGGGCA 
NCNCGGGAGTA ACAAATCTTA CAAGGAACCTCCCNAAAAAAANACCCCCCAAAAAAANCC 
CCCTTTACCCCCI MINIM I NTNATAANNNCCCCCGCCCCCCAAAAAAANTTTTTTTN 
GGGGG GGGGGNCCCCC CC > I M M I I I NGGGGGGGGGGGGGGGGNNNNAAAAAAAAAACN 
CCCCCCi rTTTTTTTl l l NNCCCCCCCCCCCCGGGGGGGGNGTTAAAAAAAAAAACCCCC 
CCCCG GGGGGGNGGGGGGG GGGGGTNGCCCCCN 1 M M 1 1 I M 1 1 I I CCCCCCCCCCCCC 

cccnn rrTTTTTTTTrrrm inctcc cccci rmrrn ngtggnttttttaannannt 

NNNNNNTANACCCCCCC I I I I M I I I I I 1 H NCCCCC I I I H I I I I I I 1 1 1 1 AAAAAAAA 
AA^CCCCCCCCAAAAAAAAAAAGGGGGGGGGNTTTTTTCCCCCCCCCCTATAAA^ 
TTTTNATTGGGGGGGGGGGGGTNTGTCTCCCCCCCCCCTN T M I M 1 M M I 
Sequence 858 

AACACTACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGGGACGCGCGTCTGTGGAGAAGCGGCTTGGTCGGGGGTGGTCTCGTGGGGTCCTGCCTG 

TTTAGTCGCTTTCAGGGTTCTATGAGCACCCTTTCACAGACCCGTTCACCAATGGGTAAG 
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TTGTNACACCCAAAAMTTTTTTTNTTTTTTTGG^ 

NNNTGGTNCCCCCNCNCCCCTTTTTTTGNGGGGGGG 1 M i ll ITNAAAANAAAAAAATNA 
AAAAAAAI J II III I GGGGGTNTTTTTAAAANTTNAAAAATTNNNANCATTGNAAAAAAT 
I I I I I I i I I AAAAAAAATTTT 
Sequence 859 

CCGCGGTGGCGGCCGAGGTACCAATAAGCATATTGCTTTGGCAATGCATNTCCAGAGCAG 
GTGACCCTGGCCGNCTGTCCTGGGGACACTGACACCNTAGGGTGGCTGTGTCAGTTCATA 
GGG AGNGTA CTCA CNAGC CCTGTTGGCACTANTAGNTGNAANGTATTGGGTTGGGGNTTA 
GNGTTTT^GNGGGTTTTAAA 

iTTTTTTTI IANAAAAAAGAGGGGGGNGNAAAAAAAATTTTTTTNNGGGGGGGGGGGGGG 
GGNTTTAAAAAAAAN rTTTTTTTTTTI MCCCCNCCCCCCCCCNAAAAAAAAGGNGGGNGG 
GGGGGGNGTTNCCCCCCCTTGCCCCCCCCNAAAAAAAAA1 I 1111111 I GGGGGGNGGGG 
GGGNCNCNTTTTTTTGNGGGGGNTTm 

CCCCCCCAAAAAAAAAAAAANCCCCCCCNGGGNNCCCCCCCCNTNNCCCNCNNCATNAAA 
AAAAAAANGGGGGl M M I I M I NAAAAAAAANAAGGGGGGGGG 
Sequence 860 

TCGCACTCATTTACCCGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 

CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCCTGCTCTTGTTCCACC 

GGTGNAGCCTTGGCTGTAGTAAAAAAGGGTGGGGNTTTTTNAAAAAAAAAAAAGGGGGGN 

NAAAAAAAAAAAAAAAGGGGGTTTGGGGGGNTTTNAAAAAAMGGGGGTTTTCCCCCCCN 

TCNCCCCCCCCCGGGGGGNI I 1 1 I 1 I 1 I 1 1 I I NCCCCCCCGGGGTTNGGGGGGGGGTTTN 

ANAAAAAAANGGGGGGGTN 1 1 1 1 1 1 1 I I I I NCCCCCNCCCCCCCCCCAAAAAAAANAGGN 

GGNGGTTTTNCCCCCNNTAAAAAAAANTCCCCCCCCCGGGGGNTNTGGGGGGGTTTTNTN 

TTTTTTTAAAAAAAAAANNNNNTTNNGGGGGGGGNTTTTTCCCCCCCCCCCGGG 

Al I II I I n I I I I 1 1 NNGGGGGNTTNNNTGGGNNNNAAAA I I I II I I I I I I 1 1 I I ICCCC 

CCCCCCCI II I I II I INNNl I I I I I I I GGGGGGGGGGGGG I I I I I I I II I I I I I I IAAAA 

Sequence 861 

CCGGGCAGGTACGCGGGGGAACTGCTCAGTTAGGACCCANACGGAACCATGGAAGCCCCA 
GCGCAGCTTCTCTTCCTCCCGCTACTTCTGGGCTCCCAGAATACTCACCTGGNAGTAAAT 
ANGTGGATGTACGTCAANTTCTTCNCAAGNNCCCCNTACCCCCCNAAAAAATGNCCCCNG 
NNCCCCCTGTGCCCCCANI Mil IGGGGNGNAI I I II I I I INCCCCCI I I I I I I I IGGGG 
GGGGN I I I I n I I GGGGGGGGTNTN I II II 1 1 ATTCCCCCCCC I I I I I I M 1 1 I I INCTT 
AGAACCCCCCCAAAAAAANAANNNATN I II I I I I I I I I GGGGNNTTTATNGGNTNTTTTN 
CNAAAAATTTTAAAAAATNTTTTATAGGGGGGGGNGNGTGNNAAAAAAAAAGGGGGNGGG 
GGTCCCCCCCTTNCCCCCNAAAAAAAAA 
Sequence 862 

GCGAGGTACGCGGGCATCCAGCAGAGAATGGAAAGTCAAATNACCTGAATTGCTATGTGT 

CTGGGTTTCATCCATCCr^CATTGAAGhrTGACTTACTGAAGTAATGGGAGTAGCAGGAAG 

TTGTAAACAAGGTGGGNAGACATATGCAAGCATCTTGNGTrrTTNNNGNCCCCCNTTTTT 

TTTTTTTAAAMTTTTTTTNGGCCCCCCCAAAAAMNNGGGGGGGGGCCCCCCT^ 

AANACAAAAAAAAGGGGGTTTTNGGGGGGGGNTTNAAAAAAAACCCCC^ 

TTTTGGGGGGGGNGGGGGGGGGN I I I I I 1 1 1 1 I I I I I INCCCCCCCCI 1 1 I I I I IGTTTN 

TNNNN I I I I I IANGNCCCCCCCI I M 1 1 1 I IGAAAAAAAAAI I I 1 1 II I IGGGN CCCCCC 

CCM I II I INAAAAACCCCCCCI I I M I I I I I 11 I I I NGGGGGGGGCCC I I I II M M I I 

AAAAAAAAAAAAAACCCCCNTTTTTTNCCCCCCN I I I I I I I I I I I I NGGGGGGGGGGGTN 

CNCCCCCCATNGTGGGGNCCCCCCCGNGGCCCCCCCCCCNGGNAAAAAAATGGGGGGNTT 

TTTTTGTAAAAAAANGGCCCCCCCCCCCCGGGANANNANTGNGGGGGGGGNGTATTNATT 

TTNCTNTTTAACCCCCCCCCCNGGGGGGGG 

Sequence 863 

CCGGGCAGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGC 
AGTTCCAGATTTCAACTGCTCATCAGATGGCCGGGAAGTATGAAGACAGNATNGGGTGCA 
GTCTCACCAGATTTCCGTTTTGN ATTCC 1 Mill I NCCCCNGNTCCCCCNAAAAAAAGGG 
GGNTNTNNCCCCCNA I I M I M M I M M M I GGGGGGNCNNNGGNNNNAA1 1 M M I M 
TATAACCCCCTTTNCCCCCAANNCCCTTNTATCCCCCI I I M M I NNGGGGN NGGGG GGG 
GGGGANTTCCCCCCCCNNCCCCCCCCNAAAAAGNTAAAAAATTTGAAAAANATTTTTAAA 
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AAANGGGGGGGGN I lllll l lll I NGGGGGGGA I nTTTTTI I ATN ANAAAAAAANCCCC 

cccccNCNCCcccccNm mrm iacccaccgggngnaananggggnnttnnntttt 

CCCCNCCTNNN NANCCCCC CCNGGGGGGNGGNGGTAGTNCCCCCNNAA I rTTTTTTTTTl 
TTGACCCCCCCM 11 U II I I 11 11 1 1 AAAAAAAAAAAGGNGNGGGNGNTGGAAAAAAAA 
ANGTNNNAAANACTTTNCCCCCCCCCNC I llllllll 1 NTTANAAAAAAAAAAAG 
Sequence 864 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGG 
CTCCTCATCTATGGNGCATCCACCAGGGCCACTGGTATCCCAGCCAGGNTCAGTGGCATG 
TTGGGGTCTTGGGGACAGTAGGTTTCACCTCTTCACCGCATCAAGNCAAGGCCTNTTGGG 
GGTTCCCCCCNAAAAAAAAAGGGGGN ( I II I I II I II I CCCCCCCCTTTTTTTGGGGGGG 
GGAN AAAAAAA AAAAAANGGGGGNGGNGANAAAAAAAMTTW 

CCCCi Mil NGNG GGGGGGGNTTCCCCCCAAAAAAAAAANNGGGGCCCCGNTATNNGAN 
TNTTNT TTAl MM I AATAAAAAAAAAANCCCCCCCCC I llllllll I A AAAAAAANCCC 
CCCCCCl M M M IGGGGGGGGGGNTTNTTTTTTTATNTCCCCCTNAAAAAAANNNGGGG 
GGNN GCCCCCCCNCC NNNAAAAAAAAANGNGNGAGTGNATCTTATTTTATNTCNAANAAA 
AATNNT1 I i I I 1 I I I I I NTANAATNAAANAAAANANAAA I I 11 II I I I 1 1 I I ATAATAAN 
ANAAGAAATAAATTTTGCTCCCCCCCCCN I I I I M I 
Sequence 865 

AGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGG 

GCCACTGGTATCCCAGCCAGGTTCAGTTGGCAGTGGGTCTTGGGGACAGNAGGTTCACTT 

CTGCACTCATCAGGCAGNCCTGNCAAAAGGGGNN I 111 1 I I I 1 CCCCCCCTTTTTTTGGG 

G GGGNTNNAA AAAAAAAAAAGGGGGGGNGGNAAAAAAANTTTTTT 

Cmiiiiii iGGG GGGGNNCCCCCCCNAAAAAAAAAAGGGGGGGGNNNTTTTTTNNNNT 

m M M M I I 11 I I I AAAAAAAA ANCCCCCCC C I II I I II I 11 I NAAAAAAAAACCCCCC 

CCNi M M M l I NGGGGGGGGT"! I I I i I I I I 1 I I CCCCCCCAAAAAAAAGGGGGGNNGGN 

NCCCC CCCCCAAAAAAAAAAGGGGGGGGN rTTTTTTTTI 1 A AAAAAAAAAAA I 1 1 11 II I 

MM l AAAAAAAGGNNAAAAAANNGG I llllllll I NTNGNAAAAAAAANNGGNAAAAAA 

ATNGGTNCCCCCCCCCCCCCTTTTTTTTTTTTT 

Sequence 866 

TCGAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATNCACC 
AGGGCCACTGGTATCCCAGCCAGGTTCAGTGGCAGTGGGTCNGGGACAGTAGTTTNACTC 
TGC ACTC ATC ANTC AAN GCTTTGTC N N ATGG N ANTNTTTTATTATNCCCCCC 1 11 11 1 11 
GGGGNGGT NNAAAAANTT AAAAAGGGGGGGGGTGAAAAAAAAATTTTTNTG I l l lllll l 
TCCCCC CCC1 1 MI1 I MTNGGGGGGGGTTCCCCCCCTCANAAAANAAGGNNGGAGNTNN 
M I M I NATATA TTTTNTCNNCT TTAAAAAAAAATATCCCCCC' l llllllll 1 AAAAAAN 
AAAATCCCCNC1 lllllllll I GGGGGGANNAAATTTTNNTNANTNCCCNCCAANAAAAT 
AANNGGGGGGGGGNCNTCCCCCCCNTAAANAAAAAAANANNNTNNTTG 1 I I 1 1 1 1 1 1 AAA 
AAAN AAAATA I I I I I I II I I I ICTCC 
Sequence 867 

CCGGGCAGGTACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCA 
CCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATNCTGGGAACATGTCCTC 
^^?Z11Z^Z5^TI^S ^ ^ NT AAAAAN AAAANTTTTTTTAAAAAN NAAAAATNCCCCC 
CA1 1 m I I NTTTTNTGGACCCCCTNTTTTnTAAAAAAA 1 111111111111 I CCCCCG 
GGGCCCCCTTTNCCCCCGGNAAAAA AGGG GNGNNTTTNAAAAAAAAMNGGNGTTTl^ 
AAAA AAGG GGGNNTTNGGGGGGGNGTTTTCCCCCCTTNTCCCCCCCCCAAAAAAAGGNAA 
AAAA TTTTAAAAAAG GGGGGGGNCC I IHI1II \ AAAAAAAAANACCCCCCCCTGATCCC 
CCANll I I I 1 II M ICCNCCTCGGAGAAAMNGGGNGGNNNTNTTTTTNTCCTTTTTTNC 
CCCCCCCGG GGGGGN GG NTTN N NCC CCC C C CTTTN I 111111111 1 1 CCNCCCCCCNTTT 
TTTTTTTTTTAAAAAAAAAAMGGGGGGGGGGGGGGNGGG AAAAAAANG N NTN AAAANTT 
TGGGCCCCCCCCCCNC I M I M M I M I NAAAAAAAAAA 
Sequence 868 

GGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCCCCTGG 
CCAAAAGTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTGGCATGGAATTT 
CNNATATNCAAAGGCTTTATTCGAATAACNCG I I 1 I I I I I I ANCCNTGGGGGT7TTAAAA 
AAAAANCCCCCNGNNN CCCC CCCN M M I 1 1 11 I I ATNCCCCGGGGGNGTTTAAAAAAAA 
AAAANNATATGGGGGNTT1TGGGGTGNANTTNGGGGGTTAANTGGGGNTTTGGGNGTNTT 
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TGGGNGTTTTT TCCCNCCTTTTTNNCCCCCNT7TCNCCCCCGGGGGGGGGGGGGGGGGNT 
TTTTTTTrTTl T ATAAAAAATATCTCCCCNNNTGTCCCCCTCTNGTCTCCTNAAAAAAAA 
NAGGGGGGTNGTNNNCCCCCCCNTTTTTTNTTTATTTATNGAATTTATNGGATATTANNG 
GGGGGGGGGGNGGGGGNAG I II 11 I A TATAATA I 1111 I A NTANACCCTNTNTNTTTANT 
CCCCCCNATATACCCNCCGAAAAI Mill 1 1 NTN I Mill I ANGNGGNGNTTTGTTCTAN 
AAAAAAAAAAAAGNGGGGGNNTGTTTTTTTTTT 
Sequence 869 

AGGTACGCGGGATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTATCA 
TAGAAGAGCTTATCACC7TTCATGATCACGCCCTCATAATCATTTGNCCTGTATGCTTGG 
NTTCCTTAGNGCCCTGTTA TNGCTCCNG CCCCCG I 111 M I NTTTTTAAI M i ll 1 A NTT 
TTTNNCCCC CCTNCCCCCC M (Mill IAAAAAAAAAAAAAANNACCCCCCNNAAAAAAA 
NCCCCCCCCI I I I I n I I I I CCCCCCCAAAAAAAAAACCCCCCCCNTAAAAAAAAAAAAA 
AAMN GNCCCCCCC CNTAAAAMGGNAAAAAATTGNNAAAAAAATTNAATAANNTACCCC 

cccccn mrm iaaaaaa aaaaaaaaaaaannccccccccc i iiniiiu iaaaaa 

AAAAAAAAAAAAAATTTTTTTNTTAAAAAAAAANCCCCCCCCCN M I I 1 I I i I I 11 I AAA 
AAAAAGGGNA AAAAAAGGGNNC CCCCCCCCCCCAAAAAAAAAAAAI Mllllllllllll 
CCCCCCCCCCN ITTTTTTTTTn I CCCCCCCCAAAAAAAAAAAGANGGGGNGGNTNGGN 
ANAAAAAAAAAAACCCCCCCCCCCGGNGGGGGGGGGNGGGGCNCCCCCCCCC I 1 1111 
Sequence 870 

GAGCTCNCCGCGGTGGCGGCCGAGGTACACTTTTGGCCAGNGGACCAAGCTGGAGATCAA 

ACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAAGCAGTTGAAAT 

CTTGGMCTGCCCTCTGTTGTGTGCTCTGCTGAATAACTGTCTATTTNNCCCTNCCCCCC 

CCCCCNAAAAGGGGNNGNAAAAANGGGNNAAAAAGGGNGGGGGNNTCCCCCCNCCCCCAA 

AAAAAAAAAAAAAAAAAAGGGGGGG 1 I I 1 1 I I I 1 1 AAAAATTTCCCCNCNCCCCC Mil l 

TTTNGGGGGGGNGCCC NAAAAANCC CCCCCCCCCNNGGGGGGGGGGGGGTTTANAAAAAA 

AA ACCCCC CCCCCCCCC I I 1 1 I 1 1 1 1 GGG N NG N AN AACCC NTN NNCCCCCCCCCCNCCTN 

NNTTTTTTGGGGGGGTTTNNNTANTTNNNCCCCCCCCGGNGNNGGGGGGGGGNNGNAAAA 

NCCCCNCNCCCCCCCCCCGGGGGGGGGTTTTTCCCCCC' I 111111 11 1 I CCCCCCCCCTT 

I I I I I II II MA 

Sequence 871 

TACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGACTTTTGGCCAGGGGACCAAGCTGGA 

GATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGNT 

GAAATCTGGAACTAGCCCTCTGNNGNGNGCCCTGCTGAATAACCTTCTATCCCAGNAGAG 

GGGGGGGGNGC CCCCCCCCCC CCNAATTTNAAATTAAAAGGGGGGGGTTTNNNNAAAATC 

CCCCCNCCCCCC TTTTTTT TTTTNCTTG GGGGGG GGGGGGGGCCCCCNANNCCCCCCGGG 

GGGGGGNCCCCCTT TTTTNTTCCC CCNl lllll I NNAAAAAAAGGGGGGGGGAAAAAAAA 

AAAAAAANCCCCCCC MINIMI I NAAAAAAAANGGGGG I 1 1 1 I I 1 1 I I NGGGGGGGGG 

GGGGGGGGGGGAAAAAAAN M M I 1 M M I I M I NCCCNCNCCCCCCCCN II I M 11 NAT 

AN ATANCCCT TTNCCCNCCNAAAAANAAANGAGGGGATTTTCNTTNNTTTTTNCCCCCCC 

CCM 11 M I IGGGGGGGGGGNGGCCCCCCCCCNCNAAAAAAAAAANGGGTGGGGGGGGGG 

G 

Sequence 872 

CCGGGCAGGTACCTGGATCTTCCAAGCACAGCCACTTNTGTGAACATCCCTGACCTGCTT 

CCTGGCCGAAAATACATTGTAAATGTCTATCACNATATCTGAGGATGNGGGGAGGCANNA 

GTTTNGATrCNCTGTTCATAACTTTACCAAAGGAAATNGAAAAANNCCCCCTTAAAAAAA 

AAAAAAAAANCCCC CCCCNA AAAAAANGGGTNGTNTTTNCCCCCGGGGNNTNNNNTCCCC 

CCCAANCNCCCCCN1 111 I M 1 I GNGGGGGGNNAAAAAAAAANTTTTTTTNGGGGGGGGG 

GGTGTGCCCCCCC NTTCCTCCC CNTCNCCCAN NTAAACTAANA A I M 1 I M I NTTTTNNA 

AATAAATCCCCTCN M M 11 M 1 GNGGGGGGGGGAAAAAAAAAAAATCCCCGTGTTTNCC 

C NNNNNC CCNGGGGNGGGGGANAANAAAAAAAAAACCCCCCCTTTTnTTTNGGGGGGGG 

NTTTTTTT 

Sequence 873 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTGCTCAGTTATGGACCCAGA 
GGGAACCATGGAAACCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATAC 
CACCCGGAGAAAGTATGTGTTGATCGCANTCTTTCAGGGCCNAAAAAGCCCANACCCCCG 
NTCCCCCTTTTTTNGGGGGGATA 1 MM NNCCCCCCN M i M I 1 I I M 1 I I M M NGGGG 
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GGGGGTTTTNTNTCCCCNNTTTANTNCCCCCCCCCCCCCAAAAANNNAAAAAGGGGCCCG 
GGGGGGGGGGGTCNANAAAAAAAAAAAAATAAAAAATNGGGGGGGGGNANAAAANGNGGG 
GTNTCCCCC CCCCCCCCC CNAAAAAAANNCCNGGTNCCCCNNNNCCCCCCT1 I M I M I 1 

ttnc ccccc ci rrrrm ictttancntncgccccccni mrm i gtggggggggggg 

GGTGTTTTTCTTTNNNCAAAATAAATGNAGGNGGGGGGGGGNTTTA 
Sequence 874 

TACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGTCAGGTTGTTCATT 

TGAGCCAACAACAGATTTCTTGGTTATTGTGCTATTGCCACACCTGGGTTGGTGGTTTTA 

TAGCCATT TCCATC ATCAGTTTCCTGTCTCTGCTGGGGGTTATCTTAGTGCCTCTCACGA 

ATCGGGTGTTTTTCAMTTTCTCCTGAGTTTCCTTGTGGCACTGGCCGTTGGGACTTTGA 

GTGGTTCGAGCGGCCGCCCGGGCAGGTACTCATCACTCTGTCCATACGCGATCACAATAT 

CCTCTAGTTCTT CCAT CACAGTCTGCGCACATTTTGGTCATCAGCTGGGAGAGCACGGCT 

GTCATTTGGGGTTTTTGCAAAAGTTGTGCTTCTCAAGCAAACCGATGGG 

Sequence 875 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnTAGTGGAGGGGTTAAATTGGCGCCGCTTTGGCCGTAAATCATGGGTCATAAGC 

CTGTTTCCTGTGTGAAAAATTGTTATTCCGC 

Sequence 876 

CCGCGGTGGCGGCCGCCCGGNCAGGTACATCAAATCGACTATGCCGAGTTGTGCAGCGTA 

NCAAGGAAAAAGGGAAAATCAGCTCCCTCGTGAAAGATGCTTCTGTTCCTCTGATTGATG 

TTACAAACCTCCCTACTCCTCGAAAATTCCTTGATACCTCTCACTATTNTACTGCTGGAA 

GCTCAAGTGTGAGGGAGATAAATCTGCNGGACATNAAGGAAGATTTAGAATTGGATCCAN 

AGGAAAACAGCACCCTGTTTATGGGTATCCTCATTAAGGGC 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCGCTTCTTGGATG 

TGTTTGTCTCTCTTGGCTGCACTGGCCTGCTCTGCTGGAGACACATGGGCTTCAGAAGTT 

GGCCCAGTTCTGAGTAAAAGTTCTCCAAGACTGATAACAACCTGGGAGAAAGTTCCAGTT 

GGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnhnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnGAGGGTTTAATTGCGCGCTTGGGCCGTAAATCATGGTATTAAGCTGTTTCCTG 

TGTGAAAATTGGTTATCCCGCTCACAAATTCCACACCAACATACGAAGCCGGGGAGCATT 

AAAAAGTGGTAAAAGCCTTGGGGGG 

Sequence 878 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTGCAGTATACAAGGA 

CTAACAGTTAATATTGACCCAATCTTATATACGTGGCTCATCTATCAGCCTCAGAAACGA 

ACAAGTAGACATATGCMCAGCAGCCTGTGGTAGCTGTTCCTCTTGTTATGCCAGTTTGT 

AGAAGGAAAGAGGATGAGGTGTCTATTGGAAGTGCCCCCTTGGCAAAGCAACAATCATAT 

CAGGCCTCTGAATATGCCAGCAGCCCTGTAAAAACAAAAACGGTAACAGGTTGAAGAAAG 

TTCTCCTGGATAATATCCTGAAGACTATTTTCCMCCTTGGTTCCATTTTTCCANGGTCA 

CTTTTCA 

Sequence 879 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCAAAG 
GCTGGAAGAAAATTGAGCCAAGAGCTGGAAGCCAGAAGTTCCCCATGCCTGCCCTACACA 
TCCAGGCTCTGAG I I 1 1 1 AC MM I CTACTATATTGCCATGCAAGTAACTTTAACTTTGC 
TTCTCTGTTCTCTGTCACTGAGAAATACCATGTCTCCATGGCAATAGGAATGCATTTACC 
TTCACCAAACCTGCTCAAATCCTTGCTGAACAGGACGTAGCAAGGAGTTGACTTTCCAAT 
GGTGGGAGGTTCAGAGTACCTGCCCGGGCCGGCCGCTCGAGGGGGGGGCCCCGGGTACCC 
AGCl I I \ IGTTCCCTTTTAGTGGAGGGGTTTAATTGCGGCGCTTTGGCGTAAATCATGGG 
TC ATT AAG NCTGTTTTC 
Sequence 880 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC1TAGTGAAATATT 
CTTGGTAATATCATTCAGGTGGACTTTGGCATCATAATCAAATAATTTGATTTCTACTAA 
CTGTrGAAGAACmGTTAGCCACTTrGCTTATTCATTACATmGGGTGATCTTAAGCG 
CACATTAATTACCCTGTCTGTTTAGCATGTCTTGCTGTTCATCTTGGMTTATTTGTTGT 
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TTAGGGACACTTATCTAGTTTCCAGCAGTCTGCTTTTAACCGGGTGGTGnnnnnnnnnnn 
nnnnnnnnnnnrinnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTTATCGATACCCGTCGAC 
CTCGAGGGGGGGGCCCCGGTACCCCAGC 1 I 1 \ \ GTTCCCTTTTAGTGGAGGGGTTTAAAT 
TGCCGCCGCTT 
Sequence 881 

GACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATT 

ATACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGA 

TGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGCTTCTGCTGGTACCTGCCCGGGCG 

GCCGCTCTAGAACTAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGGTACCCAGCTTTTTGTTCCCTTTTAGTGGAGGGG 

TTAATTGGCGCGCTTTGGGCGTAATCATGGGTCATTAAGNCTGTTTCCCTGTGTGAAAAA 

TTGTTATCCGCT 

Sequence 882 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGACAACAAGA 

GCATTTTAGAGTAATTAATTTAATAAAGTAAATTAGTATTGCTGCAAATAGTTAAATTAT 

ATTTATTTGAATTGATGGTCAAGAGATTTTCCA I t i I ( II I ACAGACTGTTCAGTGTTTG 

TCAAGCTTTCTGGCATAAATATGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGTAATCATGGGTCATTAG 

CCTGTTTTCCTGTGTGGAAATTGTTATCCCGCTCACCAATTCCACACAAACATACCGAAG 

CCCGGGGAGCAT 

Sequence 883 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTTAAACTGTATTGTATTC 

ATGTTGCTAAACAATATTGGCCTTCTCGATGATTTTATTCATGTTGCTCCAMGTTAAAA 

CCCTGTAGMCTMGTAGGTGAAGAGATATTTTGTATAAGTGCCACAGAAGAGAAAATAT 

AATAAMTTAAATAGTGAATTGAGCATCACTAGAATAAAATAAAATGAGTAGGCATTCCT 

AAGATGTGAAATGATCACCTAAGATATACATGCTCCAACCATATTGATTTTAGAACAAAA 

CACAGCAGCCCATAACAGTTTGTGGCCTCTACTAACTGTCCTCTGCTGTCCCATCTAGAG 

GTTATGTTTCTCTATTTTTAAAATAAAATGTAGTTAAATTAGCCTGACGGATGTTTrCCC 

CTCTATCCCTTAT 

Sequence 884 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTG 

GAGTCAAMCGMTCAGTTTTATATAATCMTTATTTTAGACTATCCGAAGCATAATGCT 

TAAGTTTAAAAGAGGAGACAAAAATAGGAGGACAAGATACTTGAGAGGCTGTGGCACTAG 

GACTGCTTGAGAGCCCAGGAGTTGGAGGCCAGCCTGGACAACATAGCAAGACCTCATCTC 

TGGTCAGACATGATGACGGTGGCTTCACCCGGGGGTCTCCGCACAGCAGCGGCCTCGGGT 

AAGCAGAACCTCGCTCCGGGGTTTACAAATCCTTCTCCCTTCCCCACAGCACAACACCCG 

CGGCTCCCCGCGTACCTCGGCCGCTCTAGAACTAGGTGGGATCCCCCGGGCTGCAGGGAA 

TTTCGATTATCAAAGCTTTATCG 

Sequence 885 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACA 

TAATGCAGGATCTGATGTCCTCCTCTGGTAAGCAGCATGCACCGCATAAACAGGTCCTTC 

ATGGCCTTGMGATGCACTGCTTTTAAMGCTGATTATCCTCTATTTCCCAGTGAATCAC 

TTGATTATCAGATCCTTCCAGAAAACTAATTCAAGTAGAAGGGGGAGCCATCCTGTTTAC 

AAATCCACTGTATGCAATTGACTCGGGCCGGTGTGACCATTCAAGTTGGTAACAACAACC 

CTTTTCAGGGGGTCATAGAGCACCACGGAGCAGGGACGTGCCAAAGGCCCAGAAAGTCCT 

CCCCCGCGTACCCTCGGCCGCTCTAGAACTAGGTG 

Sequence 886 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACATTTC 

TTTTCCAACACAAGTGAGGTGAACAGCCAGCCAATCCTGAGTATTTGAAAAAATGTATCT 

GCAGCAGGTTATGATCAGGGAGCTGTATTTTCAAGTAGTAACAGCACTGCGAAGAGCAAG 

CMTGAAAGGCCAGCATTCTTAATTGGAAMGAAAGAAGAGGAAATTTCCCTCTGCAAGG 

GGGCTAATTTGCCAACTGGTGATATTGTTCAGTTCAAGGGAATGAAGAATTCAGAATAAT 

TTTGGTAAATGGATTCCAATATGGGGAATAAGAATAAGCTGAACAGTTGACCTGCTTTGA 

AGAAACATTACTGTCCATTTGTCTAAAATAATCTATAACAACCCAAACCCAATCAAAATG 
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AATTCAACATTTATTTTCCCC 
Sequence 887 

ATTGG7TCTCCCCGCGGTGNCGGGCAGGTACAACTTTAAACTGTATTGTATTCATGTTGC 
TAAACMTNTTGGCCTTCTCGATGATTTTATTCATGTTGCTCCAAAGTTAAAACCCTGTA 
GAACTAAGTAGGTGMGAGATATTTTGTATAAGTGCCAGAGAAGAGAAAATATAATNAAT 
TTAAATAGNGGAATTGAGCATCACTAGAATAAAATAAAATGAGTAGGCATTCCTAAGATG 
TGAAATGATCACCTAAGATATACATGCTCCAACCATATTGATTTTAGAACAAAACACAGC 
AGCCCATAACAGTTTGTGGCCTCTACTAACTGTCCTCTGCTGTCCCATCTAGAGGGTTAT 
- GTTTCTCTATTTTTAAAATAAAAATGTAGTTAAATTAGCCTGACGGGATGTTTCCTCTCT 
AT 

Sequence 888 

CCTA TAGGGCTNTTTGGNGCANACCCCGGTGGCGGCCGCCCGGGCTGGTACTTCCTTCTT 
TTTTATGACTAAATNCTACTCTATTGTATGGACATGACACATTTTGTTTAAATTGTATAG 
GTATG CCATACTTTAAGTTTGTTCCATAAATCACTTCATTAACATTTGGGTTGCTTCCAC 
TTTTTGGGTAATTAATTAATTTAATTAATTTATTTATTT^ 

AAAAAAAAAAGAATGTGTATCTACCCGCAGTTGTCAGGCGCAGTATCCATATATGTCATT 
TAGGTCAAGTTTGTTGNTTTTCAACTCTTCTGTATTTTTACTGAC1 MM I GGTCTAAGT 
TGTTTAATCATTCACTGAGACAGGTGTGTTAAATCTCTCATTGGGGTGATGATTTATCTA 
GTTCCCATTTTAATT 
Sequence 889 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGAT 

GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGC 

TACGAGCGTNTCCTGAAGTATTGCAAAACCAAGCNGTGGGCGGCNCGNTCTTAGAAACTA 

GTTGGNATCCCCCGGGGCCTGCCANGGAAATTTCCGAATAATTCAAAGNCCTTTATTCNG 

GATTACCCGGTTCGNACTCTTNGTAAGGGGGNGGNGCNCCCCGGGNTTACCCCCAAGCNT 

TNTNTTTGGTTTTCCCCCCTTTNTAGGTTNGNAGGGGGGTTTTAAAATTNTGGCNGCCCG 

NCNTTTNGGGTCGGNTNAAATTNCAATTGGGGGATNCNANTAAGGCNTTGGTTTNNTCCC 

CTTGGGTGGGTTGNAAAAAATTTTGG M I 1 1 AATTCCCC AGCNTTTCCAANCCAAAATTT 

TTCCCAACCAANCCAANACCNATTTAACCCGGTAAGGNCNCCCGNGGGGAAAGNNCCAAT 

TTA 

Sequence 890 

GGCCGCCCGGGCAGGTCGCTCGTGATCTAGATAGTGAGCGGACGCGTGGGTCGACTCAAG 

ACTTTTAAAGATTTATCAAAATTTGGTGAGCCGAATCTCAGAAMTTGGTGAATT 

AGTTCCCAATAGGCCGCAGTAGTAATAAGTGGAGTGTTCGGGATATAGAAAAATTTTCAC 

GAATGGAATTAAAATTTCTAGGCGAATTCAGTGAGTTCCCAATGGGACATGATTGTATGA 

GTGAGCTTTTCAATATATAGCAACATTTrAGGTTCAAGAAACTTAAGTATCATGGTGAAT 

TCAGTAGGTTCCCAATAGGATTCGCATGTAATAAG 

Sequence 891 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAAACATGTGCCACGTCACCACACAAAACCAA 

AGTCTGCTCAGAGAGGTGGGCTATGGTGTGCAGGCTGCAACCTTTCTCTGCAATTGTTAA 

GTCTTCAAAAATCTGAGTTCCTCACATAAAATTCTGTGCTGTGGCCAGAGCTCGTTTTAC 

CATTTTCTTAGATTGGATCACTTTTAGGATCAGCTTCGTTGTTCTTTGCGTAGACAAATG 

ACTCTCACAGCTTTCTCCAAGTGTNCCAGAAGCACTAACTTACTGAAAATAGAATCTCAT 

CAAAGCTTAACATATTCACTCTGAAAACAGCGGANCTGCTGGGTCGCTTAAGGAAAGCTG 

ANAACCTNAAACCTGTGGAAGGAAAACCAGTGACCACTTGGGGCCTTATAAAAGTTTGAT 

TGGCAGGTGANGAAGGGGATCTCAAGAGGAGAATCCCNAATTTCAAAAGACATGGGAGAT 

TTTGTCCCTAAATGTTTTATACTAGTGCTCTTG N N AAATGG AAAACCCT 

Sequence 892 

CCGGGCAGGTACAAACATCCACACCAGAAGAGCAAGACTTAGAAATGGCATCAGAGGGAG 

AGCAAAAGAGGCTTGAAGAATATGAAAATAACCAGCCACAGGGAGAGAATGGGTCATAAA 

ATCAACCCAACTGGCTATCAAGAGAATTATACCTTGCAGAATGGCACCTTTGGTATTAGC 

GTACCT 

Sequence 893 

AGGTACCCCAGCCACATGGCCTTCATCCTTATGACCTAGCAGGCAGAACAGGGACCAAGC 

AGCTTCTATTTTGTCAAACTCCTTTGGACAAATATTCAACATTCAACAACAAGCTTTGTA 

AACCTAACGCTAAACAAGTCATGGCAAGCAAACTGGATTTTCTTAAGAAATGAGGAAAAG 
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TGCAAGTGATCTCAGTACCTGCCCG 
Sequence 894 

AGGTACTTCTTTGCAGTATACAAGGACTAGCAGTTAATATTGACCCAATCTTATATACGT 

GGCTCATCTATCAGCCTCAGAAACGAACGAGTAGACATATGCAACAGCAGCCTGTGGTAG 

CTGTTCCTCTTGTTATGCCAGTTTGTAGANGGAAAGAGGATGAGGTGTCTATTGGAAGTG 

CCCCCTTGGCAAAGCAGCAATCATATCAGGCCTCTGAATATGCCAGCAGCCCTGTAAAAA 

CAAAAACGGTAACAGAATCCCGTCCATTGTCAGTTCCTGTTAAA3CCATGTTGAATATAT 

CTGAAAGCTGTAGAAGTCCTGAAGAAAGAATGAAGGAATTTATTGGA 

Sequence 895 

CCGGGCAGGTACTTACTTGATGTGACTCTCCTCTCATGCCTGGGCCCTGCTTACAGGTGT 
GATTGTGACACATAGCTTGGCCTAGCCCCTAGGTTATGTTACTCTCCTCTTATCCTTCAG 
TTATTTTCACAAGGGGCATTGTGACATATTGCTGGACTGGGAACCCAGGTGATGTGACTC 
TCCTCTACTGCCTGAAATACAACCAAAAAAAAAAAAAAAAAAAAAAAGGTCCCT 
Sequence 896 

NGNGGCGGCCGAGGNACNNNNGNCANTTGNTGGGGGGGAAGNAAAACCCCCCCCCCNCAA 

CANAGGCAGNNCCAGANNNCAACNNTTTANCAGANGGGGGGAAGACCCCCNACAA AAGG N 

GCAGCCACAGNNCGNNNGANCNCCAGCNNGGNCCCCNGACCAAAAGNGNACCNGTTTTNN 

CGGCCNTTTTTAGAACGNGGGGGGNCCCCCCCC 

Sequence 897 

CCGGGCAGGTACGCGGGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAG 

AAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAAT 

GTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCC 

AGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAAC 

CTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAAC 

TTCCAGAACTGTGAGAAGTATATGTTTGTGGTTTAGTCAATGGTCTATGGTMTTTT 

Sequence 898 

AGGTACGCGGGGGACGCGCGTCTGTGGAGAAGCGGCTTGGTCGGGGGTGGTCTCGTGGGG 

TCCTGCCTGTTTAGTCGCTTTCAGGGTTCTTGAGCCCCTTCACGACCGTCACCATGGAAG 

TGTCACCATTGCAGCCTGTAAATGAAAATATGCAAGTCAACGAAAAAAAAAAAAAAAAAA 

AAAGTACCTGCCCG 

Sequence 899 

AGGTAC I I M I I I II U I II II I I I I I C AGGTCTCCAGGAAAGATAGTTAAGGCTATGCT 

AGTGTAAGTCTTTATGGCTTGCATGGAACCAGGCAGGCTACTGCTTTCTGCTG TGTG GCC 

TGGATCCTTTTGTTGTAGAATAAATAACTGCAATAGGAAAGAAACACATATATAATTTTA 

TTCTCCATTCACAGCCCTTTGGGTTATAGTCTTTTCCCATATAGCAGATACTATACTAAA 

AAAAATTGGGGTGGAGGAATCTTCACTGACATCACAAGGTGATTTTTGMTCCGACTTTT 

ACAGTCTGAAATTCAGACTTCCTGCCCTACTGCAATGGGAGTAAAAACCACCTGAACC 

Sequence 900 

AGGTACGCGGGGGTGGCGCCAGGGATTTGAACCGCGCTGACGAAGTTTGGTGATCCATCT 

TCCGAGTATCGCCGGGATTTCGAATCGCGATGATCATCCCCTCTCTAGAGGAGCTGGACT 

CCCTCAAGTACCTGCCCG 

Sequence 901 

AGGTACAACAGTTTTTTAAAAGATTTCACTGACATTTC 

TTTGGTGCCATTTTAGTAACTTAACAATTGCTTTAATAGACAAAACTCAATCATGCATGA 

TCAATCACAGGTGAGAAAATAGAGGCGTGCTTACTAATGGCAGCAAAAGGCTTCTGTGGG 

GGTGTGCGGGATGGAACAGCTAGCTAGCTAGAAAGGGGCTTCTTTATAGAGGGACTCCAG 

TGTCAGGCAGGGGCACTGAAATCTAGGGGTCACCAAGGGCCACAGCACAGGGGACTGAGG 

GAGAATCCAGTGGCTGGAACTCTCCTCACC 

Sequence 902 

ATGACATGGNTCAGCTTCGGTTTAAAAAAGGTCAGTGTCTATCTGGAAATTTCT ACGT GA 
GAGGTGATAGGAACCAGAGTTTACTTCTTCACACAAGAGGMCTGGACACGCATTTTCAC 
CACTTGCTGGAC 
Sequence 903 

AGGTACGGCCACAAGAGGGTAAGAAATTATCGCATGGTCTAATACCAAATTTTCCCCCAA 
ATAGAACCTACCAAGAGATCGAGCAATCAAAGCGTTATCTGTCAAAGCAATTCGCTTCTT 
ATTGATTTTGCCATAACCACGCTTGTAGATTAGTTCATTTACTGACTTCAGATTGGGGTA 
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CCTGCCCG 
Sequence 904 

GCCGAGGNACGNNACNCCTCGTTCAGGGGANAAAAACCCANGCGNCCCCACGCCCNAACA 

NGNNCCANAAGCTGTCCNCCANGCNCCCAGCCNGCCNCACNGNACAACGANNNCNCAGCC 

CNGGANNGNNCNNCCACAGANCNGCNCANCAACNCNGGCNCCNGACCCNCNCTCCTTTGT 

GACCCNAGCCCANNCCCACCCCCNGCGAAAGNGACCACAGCNNCCACCANGGCAAGGGGN 

NAG3AAGGNCCAAAGGGCNCNNCAGNGGGNACGNNCAGCNGGAGNGGCANCGAANNGNGN 

GCCNNCNCAGNANNGGGCAAGGNAAAGAGC 

Sequence 905 

CCGGGCAGGTACTCCCTCCAGACATCCGTATATTGGCCTGGGCCCCTGTAGAACCAAGCT 

TCAGTGCTAGGTTCAGCTGCCTTGAGCGGACTTACCGCTATTTTTTCCCTCGTGCTGATT 

TAGATATTGTAACCATGGATTATGCAGCTCAGAAGTATTGTTGGCACCCATGATTTCAGG 

AACTTGTGTAAAATGGATGTAGCCAACGGTGTGATTAATTTTCAGAGGACTATTCTTATC 

TTGCTTCAAAGTACCTCGGCCGGCTNTAGAAACTAAGTGGGATCCCCCGGGGCTGCAGGN 

ATTTCGATATCAAGCTTTATCNGATACCCGTCCNACCCTCGAGGGGGGGG 

Sequence 906 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGGAGGAGTGAGTCCCTATGCTGACCCCAATA 

CTTG CAGAGGTGAGAGAATGCTCTTTGGTTGTGCTACAAGTGCCCAAGGCCCAACAGTCC 

TTTTCTCTACAGCTTCTCCTCTCCTTGCAGGTGATTCTGGCGGCCCCTTGATAGTTCACA 

AGAGAAGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGATGTCTGCAAAA 

ACC AG AAGCGGC AAAAGCAGGTACCT 

Sequence 907 

CCGGGCAGGTACAAAGTGAGGAGGCAAGACAGGTGCACAGAGCATAGCTTTGTCCCATCT 
CAGGAACCCTGGGTTCCACCCCAGCTCCTGATCCCAAGAGATACGTTTC CC GGG ACTCC A 
AGGGAGAGCTGAAACACTGGTCAAGCTCAGAGCCCTGAAGCTCTTTCCCACTCCCCGCGT 
ACCTN 
Sequence 908 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAATGAGGCTGTATTA 

AGGAGAGTAACAAGTTCTAATTCTTGACCCATCAAATTCTTAAGGTGAAGCTGAGGACCA 

GGAAGAGAAAGAAGATGCTGAGAAAGAAAACATTGAAAAAGATGAAGATGATGTAGATCA 

GGAACTTGCGAACATAGACCCTACGTGGATAGAATCACCTAAAACCAATGGCCATATTGA 

GAATGGCCCATTCTCACTGGAGCAGCAACTGGACGATGA 

Sequence 909 

CGGGCAGGTACGCGGGGGAGTCCCCTCGCCAGATTCCCTCCGTCGCCGCCAAGATGATTG 

TGCGGGGCGCCCTCCGCCACGCAGCCGGCCACCGCCGAGACCCAGCACATCGCCGACCAG 

GTGAGGTCCCAGCTTGAAGAGAAAGAAAACAAGAAGTTCCCTGTGTTTAAGGCCGTGTCA 

TTCAAGAGCCAGGTGGTCGCGGGGGACAAACTACTTTCATCAAGGTGCACGTCGGCGAAC 

GAGGACTTCGTACCCTCGGCCGCTCTAGAACTAGTGGGATCCCCCNGGGCTTGCAAGGAA 

TTCGATATTCAAGCTTTATTCGATACCCGTCGACCTCGAGGGGGGGGCCCGGTAC 

Sequence 910 

CCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACAGAGGTTACCCAGTGTTGCATTACTTT 

TAAATGATCTTAAGAAGCATACAGCTGATGAAAATCCAGACAAAAGCACTTTAGAAAAAG 

CTATTGGATCAC TGAAG GAAGTAATGACGCATATTAATGAGGATAAGAGAAAAACAGAAG 

CTCAAAAGCAAATTTTTGATGTTGTTTATGAAGTAGATGGATGCCCAGCTAATCTT7TAT 

CTTCTCACCGAAGCTTAGTACCT 

Sequence 911 

AGGTACCAACTGGCCTCATCCTATATTCACTTTCGGCCCTGGGACCAAAGTGGATATCAA 
ACNGAACTGGNGGCTTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTNTGA 
AAATCTGGGAACTGGTCACTNGTTGTGTGCCCTTGCTGG 
Sequence 912 

AGGTACCACCCATGTGNA GGAGA CTGCAAGNGAAGCTGTTATTCAAAGTAAGAACAGTCA 
GCCTTGTGCTTGAGTCCTGTTTTATGCTTGCGTGTTTCATACCAAAACACAAAGGCAAGT 
CTTCATATCAGCACTTTAGTCTTTGATATCCAAGTTAGCTNGACTACTNGTACCTNGNCC 
GGGCGGCCGCTNTAGAN NTAGTGGATN 
Sequence 913 

CCGGGCAGGTACTATAGCTGTAAGGAGAAGCTGAGAAATGATACCCAGGAGCAGCAGGCT 
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TTACNGTCTTCAGCCTAAMCCTAAAAAAAAAAAAAAAAAAATTTAAAACAGCTATTAAA 

CTGAAAGCATCTGTAACAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 914 

CGTAATACCGACTCACTATAGGGCGAATTTCGAGCGGCCGCCCGGGCAGGTACAGGTCAG 

AGTCTTCTTTTCTTTTCTTTTTGAGATGGAGTCTTGCTCTGTTGCCAGACTGGAGTGCAG 

TGGTGCGATCTGGGCTCACTGCAATCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTC 

AGCCTCCCGAGTAACTGGGACTACAGGTGCGCGCCACCAAGCCCAGCTCATTTTTGTATT 

TTTAGTAGAGATGGGGTTTCACGATGTTGGCTAGGATGGTCTCGATCTCTGGTCAGAGTC 

TTTTCTGTAMTATCCTTGGTAMGAAGCAATTTTAGACTGTAGCTGTTGCAMTGCTTT 

AAGGAAGAAGCAAAACAACTGTCAGT 

Sequence 915 

AGGTACCATCGATCCTAGTGGGACGCGATCCAAAAATATGCCTATTAAAGATAATTGCTT 

TGGTTATGTTTAATGGGAAAGNCTATCTGTTTGGCTTAAAAAGGGGACAGATGTTCTGCC 

ATCACAAATTGACCMCANAATTNTTGTTTCTCCTGNNACTCCAGTAAGAAAAGACNCGT 

TTACAGACAGTGGGGTTCAAGGNNCAGNCACAGAA 

Sequence 916 

ACCGCGGTGGCGGCCGAGGTACAAAGTGTCTTTNTGTCCTGTGTTTNANCCNTTTAACAT 

ACAGMCCTAATTTTACTGGCATTTTAATGTTAATTCTCCACTCGAAGGTGAACATGGGA 

TGTTTGTAACACATATGTTTTGNTTATCAAGCACACATATGACCCCCTTTTCATGAATAT 

TCATAGGTTCCC 

Sequence 917 

ACCGNGGTGGCGGCCGAGGTACACTTTGCCTGCTCCAGCCCTGGAATCAGCCATTGCTGC 

TGTGCTATCCTTGACAGAACTATAACCTGAATTTAACAAGGAGGAAATATCAGGCAAACA 

CAMTGGAGTGACATTCTANTAMCAACTTGATCAGCACTCTTTAAAAATGAAAGACAAA 

AGAATGACTTGAGGAATGAGTTTC 

Sequence 918 

CGGCCGCCCGGGCAGGTACGCGGGGCAATTTGGAGAAGATAGAAGTTTGAAGTGGAAAAC 
TGGAAGACAGAAGTACGGGAAGGCGAAGAAAAGAATAGAGAAGATAGGGAAATTAGAAGA 
TAAAAACATACTTTTAGMGAAAAAAGATAAATTTAAACCCTGAAAAGTAGGAAGCAGAA 
GAAAAAAAATNATTTTTTNAANNNNAAAAGTACCCTCGGCCCGCTCTAGAACTAGTG 
Sequence 919 

CCGGGCAGGTACTTCATGAAAACTACCATAACGTAAATTCCCCTTCTCTCCCCTGGATGG 

AAGTATCAACTTGGAAGTTACTAGGAAAAGGTAAGGGAACTGACTGAGGACACTAGGCTT 

ACCTTTTAATTTCGAAGAGTAACTGGATTATTTCAGGTTGCTACGATTTCTGCAAACGGA 

GACAAAGAAATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTC 

ATCACAGTAMGGCAAGTGTGTTTGTATTTTTA AAGA TAATTTTGAGTTATCTTATGATC 

AAAAGTTTGAGTTATCTGATGATCAAAACTGAATTTT 

Sequence 920 

AGGTACGCGGGGCCGTAACTTTTCTATCCGTNCGCGTCAGCGCCTTGCCACCCTCATCTC 
CAATATGNCGTGGTCCGACCCCCAGTGGCACTAACGTTGGGANCCTTCAGGGCGCTCTTC 
CCAAG 
Sequence 921 

AGGTACTTTTTTCC I rTTTTTTTTTTTTI 1 AAGTGCGTTCATTCTCACTGCTGTTATTGT 

TTTCTGACAGCATGTCTGAACCAGCTAAGTCAGCTCCTGCTCCGAAGAAGGGTTCCAAGA 

AGGCTGTGACCAAGGCGCAGAAGAAGGATGGCAAGAAGCGCAAGCGCAGTCGTAAGGAGA 

GCTACTCCGTGTATGTGTACCTGCCCG 

Sequence 922 

CCGGGCAGGTACCCAGTTCCAGGCCTTGACTCTTGGATGGCATTTCTGGACTTGCCCTGG 

GCCAGAAGGAAGCTCACTGCCCTGAATGGAGAGTTCCAGGCCTGGCAGCATTCACCACAA 

ACTGACTAAAGAGGCCCTGGGCCTGAAGTGAACATCAGTGGTAGTCTGGCAGTATTCCTT 

ATGGACCTGTGGTGGTGGTGGCCATAAGGGTGAGGCTCCTCTGCCAGTGGAAGGGGGAGG 

GAAGAGTAGGAAGGACTGCATCTTGTNAGTTTAAGGTTCCAGCTCAGATGCAGTAAANTA 

G AGCATGN CC G G N N GATTTCTAATATTTTTTACTTNN AGTCCCTGGCTCCTGGAAA 

Sequence 923 

GGAGCATAAAAGTGTAAAAGCCTGGGGTGCCTAAATGAGTGAGCTTAACTCACATTTAAT 
TGCGTTGCGCTCACTTGNCCGTTTTTTCAGTTCGGGAAACCCTTGTTCGTGCCAGCTTG 
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Sequence 924 

AGGTACATATCACAGGATTAAAACTCCAGTTAAGCAACTGAGCTAATCATTGAAGTAAAT 

TAAAAATACCAAGCTTCTTTAACCTATCAATGCTGTTTTAGAAGCATCATCCGAACAAAT 

AGAGATTTAGTTATAAATTGCTGGGCTACATTCTGTGATGAGMTTTTGCTTAGTACCAC 

TAGAGGAAGAGAGACTAGAGGCCCAAGAAGANGGTGAAAGTAGAGCTATGTGCCCAAGAA 

AAACTTGTAGAGATGGAATAAGAAC TGAAA CAATAGTAGAAACACTTGCCCTGAAGAAAA 

AGGATGGAAAAA TGAA ATAGATTGATTTTTAGCTGGTAGGGAAGAMGTTAACATAGTCT 

TAGTCGGTTGTGTTTTCTCAAGGMCTGAGAAGCAGGATTAGTTTTAGAAAATTTGAAGA 

AGGTAGTGAGAGGNTGAAGCCAGCTGGGTCGAGTGAGGNCTTGGANAACTTTTTTNTCGN 

ACCTGCANGCCCTTCTTANACCCTACC' I MUM I NGG 

. Sequence 925 

CTATAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNTNTTTTTT 

TTTTTTTTTTTTTTTN N AAGAAC AAMTGTTTATTTTTTGGATATAGACATTTAAAACAT 

TCATNTCCAAGAACAG CTTCAAT CAGGTATACAATAAGAAATAGACTTTAAATCCTAATC 

AGAAAAGGN AACA GAATTTTTTTGAATGCTGCTGTANACACTAGTGTTAGAAAAAAAAGT 

TTATAAATGGTTTTGCACCAAAAAAATATGCAAAGAAACTTGAAA 

Sequence 926 

GGCGGCCGCCCGGGCAGGTACTACGAGTGAGTGAGGCTGGGAGGAACACCAACCTAAGCC 

AGGGTAATGAGGGGGGACTCTTTACCCAGGACCCTGCCCACTGGCCTTCCTCTCTTCCAA 

ACACAGGTTCCGGCATACCCAGGTGTGCAAGGCCTCAGCACTGAAGCATGGNGGGGATCT 

GGCACAAGACCCAGCCTGGACAGAGATCTTTGGTGTTCTCTCTGNGGCCACCATCAAGTT 

TTGAGATGCTGAGCACAGCCCCACAGAGTCAGCTCTTCCTGGCTCTGGCTGACAGCAGTA 

TCTCCACGAAGGGCACAAAGAGNGGCACCTTTGTCATGTATAATTGTGC 

Sequence 927 

CCGGGCAGGTACTACGAGTGAGTGAGGCTGGGAGGAACACCAACCTAAGCCAGGGTAATG 

AGGGGGGACTCTTTACCCAGGACCCTGCCCACTGGCCTTCCTCTCTTCCAAACACAGGTT 

CCGGCATACCCAGGTGTGCAAGGCTCAGCACTGAAGCATGGTGGGGATCTGGCACAAGAC 

CCAGCCTGGACAGAGATCTTTGGTGTTCTNTCTGTGGCCACCATCAAGTTTGAGATGCTG 

AGCACAGCCCCACAGAGTCAGCTCTTCCTGGCTCTGGCTGACAGCAGTATCTCCACGAAG 

GGCACAAAGAGTGGCACCTTTGTCATGTATAATTGTGCCCGTCTTGCCACACTCTTTGAG 

AGTTACAAGTGTAGTATGGAACAAGGTCTGTCCT 

S equence 928 

atttttttttttttttttt tn igg cngcttttcctgattttttaactactcatacaata 

gtctggt aact ggtnttctttttaagcagggaactccnggattcaaatncctgataatta 

aaggatcttttgatattttggcagttnctctcactaaaagmgttncattcancaga™ 

naatgattcatgagaaattcttggntagtaaatatttaatccagattttataattgccta 

aatctctttataggttatttcttgcaatatttcaagatcctgagtcagccatgcttatac 

aagcaaacttttattta 

Sequence 929 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACCTCCTGGCAGGAGGCCAACCAACACAAAA 

CTAGTGCAATAAACAAAACTACAACTAAGGATCCTCACAAGAGCCCATTTCACTCCCCTG 

CCACCTCCACAACAGCAGGTGCTTGTATCCATGGCTGAGAGACCTGAAGACTTTCACATT 

ACAGGACTCTATGCAGACTCCCCCAGTACCT 

Sequence 930 

CCGCGGTGGCGGCCGAGGTACTTCTATGTCATCATGGGTCTCAAAGGTGTTGTCAAAATT 

GAAMGATACTCAMCTTGTCACTGGAGATGCTGCAATCTATGACTGTTTTTCTGCTTGT 

ATTGATGAT TATGAA TGG CAGCTGA ATGGCANAGTTCAGAGCCGGCGGNCCCTGGGTTTT 

GCTGCTTNATTTTTGTNAATTTTTTTGTACCCTGCCGGGGNCGGCTTTTTNNAA 

NGGNNCCCCCCCNNGGGGNNNGGGMGNANTTTTTATTTNTNTATTTTTTTNNTCCCCCC 

CCCCCCTNGGGGGGGGGGGGGGNGGNCCCCCCCCCT A1 1 I U M 1 I t 1 

Sequence 931 

GTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGATTCT 

GATTCACTGATAATCGACTCATCTGTTACTAATACTTCCCCATCTGGCTTTCTGCGCAGC 

ACCTTTCTCTTCATCACTGGCTTTCGGAGTCTTTGGGGAGGGCCTCAGAGACTGTTTCTG 

AAGGGACATTACTTTCTACATGTGGGTACCT 

Sequence 932 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCA GGTAC TGTGCAATGCCT 

CAGGTTGCACCTGACTTATATGCTGAACTACAGAAGGCACATTTAGTTTTATTCAAGGGT 

GATTTGMTTACAGGAAGTTGACAGGTGACAGAAAAATGGGAGTTTTTCTGTTCCATTTC 

ATCAGGCTCTGAATGGCTTCCATCCTGCACCACTCTGTACCT 

Sequence 933 

GTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCGGGGCTGGCTCC 

TCCAGTGTC7TCTCAGTCCTGAGCAACAGTGCAGAGGTGAAACGGGAGCGCCTGGAAGAT 

GTGGTGGGAGGCTGTTGCTATCGGGTCAACAACAGCTTNGGACCATGAAGTACCTGCCCG 

Sequence 934 

AGCTTTACCGCGGTGGCGGCCGCCCGGGCAGGTNCTTTCAAACAACGCGGTAGGCCTTCC 

CTTTNGGGTCTGCCATGACAACGATACCCCAGGTGGCAATGCTTGAAGCCGATGACCATC 

TGCGTAATGGNCCATTGGACTNTGGGGGGTATTCCATCTTTAAATGGGCCTTTNCGTGCC 

ACCCATTACCTTNGGCCCGCTTTAAAAACTAAGTTGGGATCCCCCCGGGCTTTCAANGGA 

AATTTTNGAATATTAAAAGCTTTATTNCGMTT 

GGGGGG 

Sequence 935 

TAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGTATCATCGC 
AGTCTTGC \ I I 1 I IGAGTAATGGATTCCTAGATTCTATGAGGATACCACAACCACTTTTA 
AAGAGGTTTCTAAGGCCAGGTGCANTGCTTACGCCTGGGAGACCAANGTGGGAGGATCAC 
TTGAGCTCAGGAGTTTGAGACCAACTTGTACCT 
Sequence 936 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACAAACACGAT 

CGCAGATCCAGGTTTCTCAMCTGGAGCATCTGCTTAATTTTCCCATAAAATCAGTCTTA 

TTCTTTCTGACAGCTCTGAGACTCCTCCGGCCACGACTAGGTGCTGTCCTGGAGGAAACG 

GTGGAGGA 

Sequence 937 

AGGTACGGGAGAGACTCATAACTCTCCAGGCCCAGTGCTGGGGGAACCCCTCATGGGAGC 

AGCTGTAATGAGGCCTGGAAACTGGACACTGCTGAGCCCTGACATCCAGACCTCAATGCC 

CTGACTCAGCCCATGGCAGCGACCCCTTCAAACAGGCTCCATGTGGGCCTGTAATCAGGA 

AAGACTAGAGCATCCAGGAGTGGAGATTTGATTCTGAGATGCTGGGAGTTGTTCCTGGTT 

CTTGCTCCTGTCTGGCTGGTGCTGACAGCCAAAAGCGGGGAGTTGTGTGAGCTTTGTCCT 

TCAGCACTAGAGGCTTCACTCAACCCATGAGAAGGGGATGCCCCAGGCTTGCCCATCACA 

AGCACTGGCCAAGCTGACTTCCCGCTGCTGAGGGAAAGA 

Sequence 938 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTCTGGATTCACTTTTGATAATT 
ATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGGCAATCA 
GTGGNAGTGGCGGTGGCACACACTACGCATACTTC 
Sequence 939 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGGGCCTGA 

GMTTTACATTTCTAGCCAGCTTCTAAGTGATGCTGATGCTGCTAGTTTGGAGACCACAC 

CTTMGAACCACTGCTGTTGGTCTTGTCTGCACAGTAGATGCACCAAGGCAAATGTTTAG 

AAGTACCT 

Sequence 940 

CGCCCGGGCAGGTACTGCTGACTGCCCTGGCTGGTCAGTGGGGCAAAGCCAGGCATGAAG 

AAGTGCAGGTAGGGGAATGGGATGATGTTCATGGTCAGCTTCCACAGGTCAGCATTCAGC 

TGGCCTGGGAATCGCAGGCAGGTGGCAACCCATTTATGGTGGCAGATACTAGGAGGTTTA 

GGTCACTCTAGGTGGGCATGGGCAGCTTTAGGGTTCTGAAGCAGATGTCATAGAAGGCTT 

CATMTCAATGCAAAAAGTCTCATCTGCATTTGCTATGAGACTTGGAGGGTGACATTGTG 

GGGCTCTAGCATGGTGCCTGACACCTTGGGCAAGMCAAAAATGCrJNAATNNGTTTTNNA 

MNCCTATTTAAAATGGGTCTGAATAACTGGTGATGAATTTACTTGTATAATCAGAATTC 

MGATTAAAMTTN AGACAACTCTTTTGTTTNC AACAGAANGGAAAAN NAN NATTAANTN 

TTNNANNNNGTCCCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTNGAANGAATTC 

NATATCAAGCTTATCGATNCCCGTCGACCCTCGANGGGGGGGCCCGGTNCCCAACTTTT 

Sequence 941 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTGAGACAATAGC 
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ACCAAATTCAACAGAGAGAAAGAATGAGTGAGAGAGCACTTTACACCAAGGCTCTGCACA 
TAATTGGTGCAATTTGAAATTGAATGGCTCAGAAGACTGCTCTGTGAGGAGCAGATTG.GA 
GAGGATANACTCATCTTGCCGCGTACCT 
Sequence 942 

CGCCCGGGCAGGTACTCCCCTCCCCAAATAGAAACCTCAAAGACTGATCCATTTCCCCTA 

GGGCCTGGGCCAGGAGTAGCTCACTGCTCACTGCTGAGGAGAAAGGCACAAGATATAATG 

TCATAAGAGCAGGAGAGTGGCTCACCTACAGAGTTCCCTATAGGGGAAAGAAGGCAGGAA 

ATAGGCGCAGGGTCTGGTCCTGTCCCTGCACCACCCTGAGCAGCTAGTCTTGGGAAGGGA 

TTACAGGCCCT GGGC CATAGGCTGCTCGCCATTCTGCTTTCCTATCCTGTTTCTCTCCCT 

GTGCTGCTCCCTTTTAGCCAGGGCTGAGAAATGTTCAGCACCTGAGGCAAAAACTGCCAT 

AATACCT 

Sequence 943 

TGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACCACGTGCATGAAACCCCTGTCA 

CATATCCCCTTGCTTGCTCAMTCAATCACGACCCTTTCATGTGAMTTTTAGTGTTGNG 

AGCTCTTAAAAGCGACAGAAATTGTGCACTCGGGGAGCTCGGATTTTAAGGCAGTAGCTT 

GCCAATGCTCCCAGCTGAATAAAGCCCTTTCTTCTACAGNTCCGNGTCTGAGAGGTTTTG 

TCTGNGGCTTGTCCTGCTACAGTCCTTGGTTCCCTGACCATGGAAGCGAGGTTGACTGAC 

AAGCCCTGTGGAGCGTCCCTGNANAGGACTNCGGCCTGCNTGAATGACNCAATCCAAAGA 

GCGCTTNCCGGNTAGGAAATGGNCCTGGTGGAATGCC 

Sequence 944 

C CGCGGTGG CGGCCGAGGTAC 1 rTTTTTTTTTTTTTTTTTTI I GACCATCCACATTATCT 

GTTTTATTTTGGCAAAATGAAAGCATCTTAAAAAAAAAAGGCAATCACAAAATACAGGGC 

TTTCTTAGAAAGCAACATGCATCATATCTTGAACAAGTTACCCGAGCGGTTTCTCCGTCC 

TACAAACAGACATGGATCAATCTCTCCTTCTGGGAGCTTCTCTGTCTTTGCCCAGGTCGC 

TCCCTGAGAGGTGAGGTCTGAACCCACTGGGAAGCAGGACGATGTTCAAGGCTTGTACCT 

GCCCG 

Sequence 945 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAAGTTTACACCGTAA 

GAGGCAACATGGTCAGCCACAATGTCTTCACCTCCACAAGGGCTCATNACGGTGGTCAGG 

GCGAGGGCCCCCAGCATCAGAGCTTTGTTTAGGATCATNCTCTTTCCAAGGCAGCCTCAN 

CAGTTGCTGTTNTGAGCTGNAGAGCAATTGNNCCCGCGTACCTGCCCG 

Sequence 946 

AGGGCNAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTGCCTGGAGCCTGGTAGACT 
TGCTGGGTGGCTAGATTCAGAAGAGAGAGAGCAATCACTACAGCTCAGCTCTCAGGAAGC 
CACATCCATAGGAAAAGGGAGAGAGTACCTGCCCG 
Sequence 947 

TTAGG GCGAATTGGAGCTCC CCGCG GTGGCGGCCGAGGTAC ) rTTTTTTTTTTTTTTTTI 
nil irrrrNNGAGTTACAGTAATTTTACTTTANCAATCANACATGTATTCTGTGTCTCAC 
G GGCC ACCTGCTGAATGA^4AGGACTCCAGTTGAAAGGTCAAGAACATAAAACCACAAAAG 
CTTTTTGAGTGGGTCTTNAACTTANTTAAAATAAAAATAAAGGACCTNCCGCGTACCTGC 
CCG 

Sequence 948 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAATTCATGTGGAG 

GTCAGAGTGGAAGCAGGTGTGAGAGGGTCCAGCAGAAGGAAACATGGCTGCCAAAGTGTT 

TGAGTCCATTGGCAAGTTTGGCCTGGCCTTATCTGTTGCAGGAGGCGTGGTNAACTCTGC 

CTTATATAATGTGGATGCTGGGCACANATCTGTCATNTTTNACCTATTCCGTGGAGTGCA 

AAGACNTTGTGGTAGAGGAAGGGACTCNTTTTCTCATCCCGTGGGTACCTGCCCGGGCGG 

CCGCTCTANAACTAGGTGGGATCCCCCGGGCCTGCANGAnnnnnnnnnnnnnnnnnnnnn 

nnnnnCGTCGACCTTCGAGGGGGGGGCCCCGGTNCCCANCTTTTTGTTCCCT 

Sequence 949 

GCTCCCCGCGGTGGCGGCCGAGGTACTACTCAGCTGATCACAAGCTGCTTGATGGGAACC 

TACTAGATGGACAGGCTGAGGTGTTTGGCAGTGATGATGACCACATTCAGTTTGTGCAGA 

AAAAGCCACCACGTGAGAATGGCCATAAGCAGATAAGTAGCAGTTCAACTGGATGTCTCT 

CTTCTCCAAATGCTACAGTACCTGCCCGGGCGGCCGCTCGATATTATTCTTCAATTCAGC 

TTGTTAAACCTCCTTCAGGATTCTAAAACCTTTTAGACTCTTAAATTGCAGCCTTCCATG 

TCCCTTGTCCTGCCTCCAGCACACTCTTCAGTAAACAAAAGTCAACAGCACCAGGGCA 
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Sequence 950 

ACCCTACTATAGGGCGATTTTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACCAACAGA 
AAACCTGGCCAGGCTCCCAGGCTCCTCATCAATGATNNATCCGACAGGGCCACTGGCAnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn . 
nnnnnnnnnnnnTTGCAGTTTATTACTGTCAGCAGCGTAGCAACTCGTACCTGCCCG 
Sequence 951 

GCGAATTGGAGCTCNCCGCGGTGGGCGGCCCGAGGTACTTTGGCCTCTCTGGGATAGAAG 

TTATTCAGCGGGCACACAACAGAGGCAGTTTTCAGATTTCAACTGCTCATCAGATGGCGG 

GAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGCCCCTGGCCA 

AAAGTGTAAAGATTTGTAGAGCTTGCATGCAGnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTCAGGGA 

CCCCGGAGGCCCGATTAGAACCCAAATAGATCAGGAGCTGTGGAGACTGCCC 

Sequence 952 

ACAACTTTGGATTGTCAATGAAAGCTTGNTTGTGGATACTNATCAAACNNTTTTTNGTCT 
ATTTANTCCTTGNTGAAGCTAAAAAGGAAAGAGAGAGTGAGAGGCAAAGAAAGACAAAGG 
AGGAAAGACCTGCCCGGCGGCGTCA 
Sequence 953 

CCGGGCAGGTACAGATGATGACTGCAAAATGAAGACCTACTTTCAACTNCTTTTTCCCCC 

CTCTAGMGAATCAAATNGAATCTTTTAC^GCCTCTTGCAAAAAAAAGAAAAAAAGAAA 

AAAGTTCATTTATTCATTCTGTTTTTATAGCAAAACTGAATGTCAAAAGTACGTGCCAAG 

CATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 

Sequence 954 

AGNATTTGGTATCTGCGCTCTGGTGAAAGCCAGGTACC7TCGGAAAAAAGAAGNGGGGAG 

GCTCTNGMCCCGCAAAAAAAACACCGCTGGGAGCGGGGGGTTTTTTGGTTGGAAGCAGC 

ANMTACNCNCANAAAAAAAGGACCTAAGAAAAACCCTTGAACTTTTNTACGGGGGCNGA 

CCCTCANNGGGAACAAAAACNCACGNTAAGGGAANTTNGGCATGAANAATANCAAAAAAG 

GANCNTCACCCAAAACCTTTAAAA 

Sequence 955 

CCGCGGTGGCGGCCCGAGGTCAACGGCCAGCCAATCACCAAAATGGCCTGTGGGGCTGAA 

TTCAGTATGATAATGGACTGCAAAGGAAACCTCTATTCCTTTGGGTGCCCTGAATATGGT 

CAGCTGGGACACAACTCAGATGGGAAGTTCACCGCCGGGCACAGCGAAGAGTACCTGCCC 

GGGCGGC 

Sequence 956 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTAGATGCAAAGTTTTAGATGTTGGTATTTG 

AATGGGATACAAAATTGGGAACTTGGAATTTGGGTTGAGACATCTTTCCTATGCAGTTGG 

GAACTTGGAATCTGGGTTGAGACATCTTTCCTATCCTTGCAAAAAAAAAAAAAAAAAAAA 

AAGTACCT 

Sequence 957 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGGGGAGGAATTAACATA 

C ACTM CTTTGAGGAAGATGTGCTCATATACATTATTTTTATAATTCAGTCATATGAGGC 

ATTTTATTCACATCCTACATGAGGAGATTGAGAACAAAATATGTAATGTAGATTCAAGTT 

GACAGTGAGTGAGGCCAGAGTTCATACCCCAGTCTCGTGGTCTTTCTGCAGCCTCATGAT 

AGAGCAAGGATATTGTCACCTGAGAAACCAACATTGAACCTCCTTCTCTGAAGCCCAGTG 

AGTGCCCTTCCTCTAGACTCTGGCCACAGCTGGGCCTCCCTCCATCAGCCCACATAACTC 

ATTGCAGTGTGCTTATCTGTTCACTACTCCTTGTCTTCAAAGGAAACTCCAGAATTGTAA 

TAGCTGTATCCTCTTCATCTCTGAACCTCCAGGACTTCAAAAGAAGTGACTGGACGTAGT 

AGGTGCCTGGGAAAGGTTTTTGAATGAATG 

Sequence 958 

GTGTGCCACCGCCACTACCACTGATTGCCGAGACCCACTCCAGCCCCTTCCTGGAGTCTG 
GCGGACCCAATTCATGACATAATTATCAAAAGTGAATCCAGAGGCTGTACCT 
Sequence 959 

CCGCGGTGGCCCGAGCGGCCGCCGGGCAGGTACATTTAACAATTGGACATACAGCACTGT 

AAACAATTCAAAGTCCATTGAGCTTACCATGAAAGAACTGAAACGTATGTAGGAACTGTG 

TCCTGCTGCAGACAAGCAGCTGCTCTGGCTCTGGTGCGGTGCGGTCANGAGGTGAGGTGG 

GCAGGTGTGGAAGGGAGAATTCCTGAGGCCAGTCACAATTATTGAAGCTAAATTAATTTC 

TTCCTGACACTTTAGCATTAATTCTAAGTAGTTCTAAGTTCTAAACCATTAACTGTATAG 
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TTAAI MM ITAAAT CAAA GAAGAAAAAGGCTTTCACACAGATGAATGTATAGGAAAACA 
CAA GTAATG ACCAATTTTCMGAGCCTTATTGTAAAAAAAGAMGAAAGAAAAAAGGACA 
ATTATTTTTGGAGTTACTGTCTCTAATTCTTAAAAAGTAAGTACCT 
Sequence 960 

CCGGGCAGGTACTCATCAGATGGAATGTTTTACCCTGCCGAGGTTNTAGTCATGATGTGC 
TGAGCTCTCTGNGTCTGACGTGACTGACTGGTANCTGGGCGTTGGCNNGACCCTCCTTTN 
NC TNTAN N AN CACATN ATGN AG AATTTN NC ACCTATGGG AAAATTAAANTGATNG ACATG 
CCCNNNGAAAGGATGCATNCCCATCTGNNCAAAGGCTATGCGTACCTCGGCCGCTNTAGA 
NCTANT 

Sequence 961 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGGTTTTAACTTC 

ATGTGTCTGAAAGCAGCACTGAAGAGGTTAGAAAAGACCATCTTGATCTGAGGATGCCAC 

CCCTCTCCCATCTCCCAGCAGCAGCCACCGTGGCATGGAGAGAATCTGTGTGCTAGGGAG 

AGGGACAGCACAGTGATTGTGAGATACTGCTTTGAACTCAGTGCTGCCCTGTCACAGCTG 

AAAGCAAAACTGGGCTGAACTCAGCCAGCACCCATCCACACAGGGAGCATTTAGATGAGC 

TCTAGCCAGAGAGGAATCGCCCATCCCAGCAGTCTGAATATGAGTTCCAGCAAGCCCCGT 

CACTGTGGGCTAAATTGGTCCAAGACCTTAAATAAACTTGAAAGGGCAGTCTAGGCCATA 

AAGACT 

Sequence 962 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTGCCAGTTTCTTCT 
CCACTGCACTCTTATATTCTACCAAAAATTTCTCCATAGCACCAAATCCCGCCATTTCCG 
AGCTACGATCACCCGCGGCTGAGGAAGGACGAATCCCCCGCGTACCTGCCCG 
Sequence 963 

CCGGGCAGGTACTCCGCCATTTTACGTGAGAGACTTGAGCATTCTTGGATTTGGTATCCT 
CAGGAGTCCGGGAACCAGTCCTCCATGGATATCAAGGGATGACTGTTTGCCCGTGTTTTT 
AGCCTTTGTATCTTTGCTCAAGAAATCCCTAAAACCTGGAGTACCT 
Sequence 964 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTC t I II M M ICC I I I I I ll I 
AAGAGTATAAGGTTTACACAATCATTCTCATAATGTGACGCAAGCCAGCMGGCCAAAAA 
TGCTGGAGAAAATAACGGGATCTCTTCCTTGTAAACTTGTACCTGCCCG 
Sequence 965 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTCCTTGTT 

TTCCCTCTGATANGGGTTTGGATCTGTGTCCCCACTAAATCTCATACTGAATTGTGATCT 

CCAGTGTTGAAAGTGGGGCCTGCTGGGAGGTGATGGGGGATCATGGGGGTGGAGTTCTCA 

TGAATGGTTACCACCATCCCCCCTTCATACTCTCTCTTGCTCCAGCTCTGGCCATGCGAT 

GTGCCTGCTCTCCCTTCCCCCTCCACCATGATTCTAANTTTCCTGAGGCCTCCCCAGAAG 

CTGAGCAGATGCCAGCCATGCTTCCTGTACCTN 

Sequence 966 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGCAATCGGGTTGCCAGC 

AAAGCACTGGATGCAAGCCTTGCCTTCCAGAAGCTTACCAGTCGGGTTGCCAGCAAAGCA 

GTGGATGCAAGACTTGCCCTCCAGGAGCTTACCATCACAACGAAGAAGACAAATAAATGC 

ATAATATACAGACGACATAAATCCATACTGCACTCCAGCCTGGGTGACAGAGCAAGACTC 

CATCTCAAAAAAAAAAAGAAAAAAGAAAAGAAAAAAAAGAAAGAAAAAGAAAAAACAAAA 

AACCANAAGAGCAAGGGAAGGCTCACAGGGCTGAATCTTGAAGAATGGGGGTTCTCAGGG 

TTACAGAGAAGGGGGAGGACATTCTGGACAGAAATGGAAATGTGTGAAGATATTTGTGTA 

CCTGCCCGGGCGGCCGCTCTAGAACTAGGTGGATCCC 

Sequence 967 

AATTGGAGCTCCCCGCGGTGGCGGNCGAGGTACCAATAAGCATATTGCTTTGGCAATGCA 

TCTCCAGAGCANGTGACCCTGGCCGTCTGTCCTGGGGACACTGAC 

Sequence 968 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCTAATGAAACCCGTCTTTGG 

CCTGTGGCTGAGACGCCATCTGTAGGCGGTGAAATCTTTCCAGCCTATGTGACGAGGGTC 

ATGCCACAGGGTGCGCACCTGAAGGAAAAGGAAAAGGAAAGGTTGAAACACAGCTCCTTC 

AGGGCTTNTTCTTACCTGGCCAGGGGTGTTTCCTGTGTGCCACAGGGCGTTCCGCAGGTG 

CTCGCCAGGCCCTGTGGTGGAGTTTACAACTTTCACAGAAAGGCCCGAGTATCCCTGAGC 

CCTCGTGGGGTTGGTGTCCCAGTAGGACTGGGTGACTTGCTTCACATCACAACATAAAAG 
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CGGCTGCTGGACTGGTAGCCAAAAGACAAATNCAGCATAGTCATCGTCCCTTTTCGGTGT 

TGATGAAGAAGGTGCCACTTGAAGTCCACAGCATTAAACTCATCATAACCTGGAAGAAGA 

ACCCAGAGCCAGGTTAAAACCTGCAGCCTTCAGAAAGGTTCTTCCGTCATGTTTCCTAGA 

CATTCCTCAACACCCACAGGGGATAGGTACCCTGNCCCGGGCCGGCCGCTTNTAGAACTA 

AGTGGAATCCCCCCCGGGCTGCAGGAAATTCNATATTAAGGCTTTATTGATACCCGTCNA 

CCTTNAAGGGGGGGGCCCGGGACCCCAACTTTTTG 

Sequence 969 

TGGAGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGNTACFi I I I 1 1 I M I INTTCI Nil 
TAAGATTATTNTTNTAAAAGGGGAGATAGGTAGGAGTAGCGTGGNAAAGGTGATGAGTGT 
GGGGAGGAATG 
Sequence 970 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACCTCATCCTGTCTC 

CAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAA 

TCCTGAGGAGGAACTGGCATCAAMCAAAAAAAAAAAAAAAAAAAAGTCCTCGGCCGCTC 

TAGAACTAG 

Sequence 971 

ACCNAAAGCATTCTACCACAGTTGTATTTGACTCCCACTTGTAATAACTCCTTTTAAAAA 

TTCCATGTTTAACCATATGACCCTGCTTGCTTACTCATATTCTCCCTCCCTCTCCCCTTC 

CTTTCTCTTCTTCCAGAAGTCATTTGCTGTTGAAATATTTTGTAGGGATTGCTTATTATA 

TTATTTTAGCTGATGAACCTCAGGACAACCGTCTACACACACACACATACATACACGCAC 

ACAAAATCTCAGCTGTTGAAGAGTGGGCTTGGAATCAGACTTCTGTGTCCAGTAAAAAAC 

TCCTGCACTGAAGTCATTGTGACTTGAGTAGTTACAGACTGATTCCAGTGAACTTGAATC 

TAATTTCTTTTGATCTAATGAATGTGTCTGCTTACCTTATTTCCTTTTAATTGA 

CCAAGTAGTTGCTAA I IT IT i GACAACTNTAAAATGAGTTNCATTCACTTCTTTTACTTA 

ATGGTTTAAGGTATAGACCTGCCCGGGCGGGCGCTCTAAGAACTAGGTGGATNCCCCGGG 

GCTGGCAGGGAATTCGAAATCAAGCTTAATCGATTNCCGNCCAACCTCGANGGGGGGGG 

Sequence 972 ____________ 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACl I I II I I I I II I I ICTCC 

ATGGTTCTTTNGGTATCTCTCCTTCCTTTCCCTGTCAGCCACAGTTCTTTATATTAAATT 

CTTTTGATGGCGAGAATCCTGGAGTAGGAGTGAGAACTGGACTGTAGATGCACCTCTCCC 

ACTAACCCTGTTNGGGGCAAATACGTCTGTCTCGTTGGGAGATGAACTTTACAAGTTCAA 

TTACTAAATCTGAAATATCTGGATTAGAATATTTCTAGG I II I I I I I CTGCTCTTGACAT 

TCTATGATAACTCCATGACAAATGTTCTTTTTCCAAATAATTAAAACTTAAAACATTTCC 

GTTTAAACCTTCAAAGACAAAGACAACCTTNTTCTTTTTGCATTTGGGGTCATAAA^ 

CTAAACTAATAAGGTTAGATGTTTTAGGGGAGAAATCTATTTGCCTACAATAACCACCCC 

CCCTCCAAAAAGCC 

Sequence 973 

CGAGGTACCTATGGGCCAATGAATCCCACCATTTTCAAATAAATCNTTNNTCANACTCCA 

CTCAGNTGGAAAGTTTTGCTGAAAATATTTCTGGATAAAAAACAAAAAACCTCATGTTGG 

AAATCAACTGTCCTTCCCATTTGGAACATTAAGAGGAGTCCCCAGGGAACACTGGTAATA 

GGTGAAAATGGTTCCTGGAGTCACCAGTGTTGTACAGAGCCGCCAGGCGACTCAATTCAC 

GTGCTCCTTTTCTGGGGATGAAAAGGTGCTCACCAGATCTTTCTATGCAACTGAGAGCTC 

CTAAATCAAGTCAAAGGGGACCATCGTACCTGCCCGGGCGGCCCGCTCTAAGAACTAGTG 

GATC 

Sequence 974 

CCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCTCTAGGAGGCTGGGCCAGTTTCCTCTCT 

TTTCTGCAGGCGTCTCCAGAGCACCCTCTCACCAGCTGTATCTACTCACAATCGTCTGGC 

ATTTGGAATCTGGTTGAGTTTGGGTCCTCAATACCCAGAAAATAGAGCCTCCAGGACCCG 

CCCCTAAGCAGGAATTTTTCAGATCTCCCTTCTGGGTCCTTTGGTCCCTAAGTCTCTGGC 

TTTGGCATTCCTGGTGGGAATCCTTGCGGAGAGCCATCCTGGTACCTGCCCG 

Sequence 975 

CGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCCNGGTACANTTNAATACATTANTG 

TAGTAAGNTATTAANTGGTGCCCCTATGATCTNCGAGAGGTAATACACTATCACGTGTTC 

CAAATTTTNACAGGAAAAGAATCATAGANTCCTATANCTGAAGGGGGCTNTACCGGGNTC 

TACAAANGCCTGCCAGGTGCTNGGATNTCCTNCATCACATNCANCCATGANAAGTTACTT 

GTGTCATGGTACCT 
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Sequence 976 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGAAGTGTCTCTCAGTCAGGACAC 

AGCNTGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTCTGCATCCAGTTTGCAAAGnnnnn 

nnnnnnnnnnnnnnn nnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnCAACCTGAAGATTTTGCAACTTACTACTGTCAACAGAGTTACAGTATCCCCCTCACTG 

TCGGCGGAGGGA 

Sequence 977 

ACCCACTATAGGGCGAATTTTNAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTG 

GGTGGGGGCTGGGGGGGGTNCCGCCGACCGTTTTTCTCTTCCGGCCAGGTGCTTTTCTGT 

CAATTTCTATGGAATGCAAMGGAGGTTTTTGTTTTATTTTG 1 lilll 1 GTAAAGCTTAA 

GAAAAAMTCACATCTTATACTTGAGCCTCCATACTTAAAAAAAGAAAAGAAAAGAAATC 

AATAAAAAGAAACTGGGGCGCAGTT 

Sequence 978 

GGCC GCGGGGGCTGCGTAACCATGTTTNCATGAAACAAGCCAGCCANTTTGAGGNGTGCT 

TTTTGAGTGTAAGGCCATTATCATGGACTTTTTCCCCAGCTGCAGGGTTATCTACAACTC 

CGGGTGGAAGTCAAAGATTGGAAGAGGATGCTCTGATCCTGGAACTGCATTGCTTGGTGG 

TCAGGAGGGGTTCTTAGTAAGCAAACTCAGTTTAATCCCAGAGTCGGAAGTGGGTGTCAA 

CAAGCCAAAGCAACATCCTTTCAGCTCTGCTGCTTTTTTGTCTTGGTGGAGGAATAGCCA 

ATGGMGAAGATGGAGAGTCCGGACCAGAAAAAAAAAAAAAAAAAAAGTACCTGCCCGG 

Sequence 979 

GGTGGCGGCCGCTCTATAACTANGNCCACACCCTGNTATTCCTGCAGGTNGCCTTTTGCA 
GCTNCAGGC N NTATGGAAC AAATGAAG GTCN ATGTCGACTCTAATAGAAGTAATNGTNG A 
N NG GTGTT CTTCAC ATCCACTTCTGTTN NTG ATTGAGTTAGGCATCTCTTTCATGGTAAA 
ACCCTTTTCATTAAACACAAAGAAAGC 1 rTTTTTTTTTTTTTTTl 
Sequence 980 

GAATNGGAGCTTTTCGCNGTGGCGGCCCGAGGTACAATAACTNGCCTGAANTTCTATGGC 

AACAGGCATTTANTNAGCGGAGCGGTTTTATGGNCTCATCTGTATCTGGGATGCAAAGAA 

ATGGGMTGCCTGANGTCAATTAAAGCTCACAMGGACAGGTGCCTTCCTTTCTATTCAC 

CCTTGGCAAGTNGGCCCTGTCAAGTTGGTACCTGCCCG 

Sequence 981 

TAAAAATCACGTCACTAT ATAGG CACTGCTGTATGGAAAACGCATTTTGTGTTTCTACAA 

ATTGTMGCGGA AATG CCTTTTGAGTACGCGGGATTTGGATGATNGACATAAGGTTTTTA 

GCATGTNCCTCCTTTTCTTCACCCNCCCCTTTTTTCTTCTATTMTCANGAGAAACTTCA 

AAGTTAATGGGATGGCCGGATCTCACAGGCTGANAACTCGTTCACCTCCAAGCATTTCAT 

GAAAAAGCTGCTTCTTATTAACCATACAAACTCTCACCATGATGTGAAGAGTTTCACAAA 

TCCCTCAAAATAAAA 

Sequence 982 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTTGCCAGACCAGA 
GTCAGTTGTATCATCCTGGGGCAAGTATGAGGAGAAGCTCACGATTACCAGGCACCTCAT 
TGTGAACATGCTTTCTGCAACGCCTGCATCACCCAGTGGTTCTCTCAGCAACAGACATGT 
CCAGTGGACCGTAGTGTTGTGACGGTCGCCCATNTGCGCCCAGTACCTGCCCG 
Sequence 983 

AGGGCG AATTGG AGCT CCCC GCGGTGGCGGCCCGAGGNC l rTTTTTTTTTTTTTTTTTT G 

TCTGAAGTTTTTTCCCATTTTATATCGTCAATATCATCACTCATTTGAAAACTAGG 

TCCAAGTTGATGTCCTGACCCAAGGCACCCCAGGTTTCCAAGGATTCGTAATCTCTTTGG 

AGCTGACCTATAAAAAGCAAAAGAGTTACTGAGAGTCACTTTCAGGGAAAGCTGGGTTGG 

GACTTCTTTGACTCTTATGCTTACCTTTGGAAGAAACATTGAAAATCAGAAGGAAGCTGG 

CTACAAATTTACATGGAAAGCTCATCTTAGGGGTTGGGTTCCTCTTTTGATTCTCCCCGC 

GTACCTGCCCGGGCGGCCGCTCTAGAACTANTGGGATCCCCCCGGGCTGNNAGGAATTCG 

ATATCAAAGCTTATCG 

Sequence 984 

TCGAATTGGAGCTCCCCGCGGTGGGCGGCCGAGGNCAGAGCCGCCAGGCGACTCAATTCA 
CGCGCTCCTTTTCTGGGGATGAAAAGGTNCTCACCANATCTTTCTATGCAACTGAGAGCT 
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CCTAAATCAAGTCAAAGGGGACCATCGTACCTGCCCGGGCGGCGTCTAGAACTAGTGGAT 
CN 

Sequence 985 

CGCGGTGGCGGCCCGAGGTAC1 I I I M I I I 11 1 1 1 I I I I i I 1 1 GCGCGCCTGCTGCCTTC 

CTTGGATGTGGTAGCCGNTTCTCAGGCTCCCTCTCCGGAATCGAACCCTGATTCCCCGTC 

ACCCGTGGTCACCATGGTAGGCACGGCGACTACCATCAAAGTTGATAGGGCAGACCGTTC 

GAATGGGTCGTCGCCGCCCCCGCGTACCTGCCCG 

Sequence 986 

ATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGGAGACCAAGGGCTA 

ANGCTGGGAGGTGAGTCTGTCACCTTGAGCCGGGCGAGCGCTGTGGGCCAAGCAGGGGTT 

GCAGGGCAGTAGGAGTGCAGACTGAAAAAATGCAGACCGCCGGGGCATTATTCATTTCTC 

CAGCTCTGATCCGCTGTTGTACCT 

Sequence 987 

TATTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACAAGCACCA 

GCACCAAGGGATAATCAATGGGTGGCCACCACGCATTCACAAGTATGCCTAGGTTTGAAG 

CTGTGGAACCTTTTAGCTCCAAGTTCTCTGCATGGGTATTGGGGCAACCTAAACTCCTAA 

TCTAGGAGAGTGAGTACCT 

Sequence 988 

GGCGGCCGCCCGGGCTTGTACCTACTTANTTCCAGCTGATGNTTGACCAAAGAATTTGCA 

GGTGGATGGTTTGGTATCACTGTAAATAAAAAGAGGGCCTGAAAAAAAAAAAAAAAAAAA 

AAAAAAGGAGTACCCAAGATCCCCAAATAMTTAGTAAAAGNGCCATTAAGAATTATCAG 

ACATAGMTGTGTTTCAMTACAAATTNGTTAAATCCAAAAGATACAGTAATTTTATTAC 

ATAAMTTTCTCCTGCGNGCTAAAAATGCTMCTTACAAGACATAATATTTGTAGTAGAG 

Sequence 989 

GNGGGNCTGGGACAGAGTTCACTCTCACCATCAGCAGCCNGCAGNCTGAAGAANNTGCAA 

GAATANNACNGGGCAGCAGCATAATAACTGGCCCCCGNACCTCGGCCCGCTCTAnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnANNCGANAACAAGCNNATCGATACCCGGNNACCCAG 

AAGGGGGGGGCCC 

Sequence 990 

TAGNGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCATGACGCTGGAGGAGA 
TGGNCTTCGAGAGGGGAATCTGGTCGGCAGCCCTGAATGGAGACCTGGGCCGAGTGAAGC 
ATTTAATCCAGAAGGCCGAGGACCCAAGTCAGCCCGACT 
Sequence 991 

CCGCGGTGGCGGCCGAGGTAC HI I I I II 1 1 II I I II I I I II I III I ATCATGAANAACA 

CAAMTATTACCCAAAGATGAAGATTTAACAAAATTAATAATTTTTACTGTTTCACCAAG 

GACTTTATAGTGAAACCGAC M M I I 1 1 I III I ACACTGCAAGTATGTGGCAGTCAANAA 

CNCAATGGC TACT AACTAGCACCGTTTCATGCCACTAATGCTGATTCACACTACAGTGCA 

AGTAATTTAATTTTACCCATGGNGCTNTTGCNCCATCAGCACAAATGTCAACACAGTGGA 

AN AGCCAAACAACATTTTA A I 111 NIG 

Sequence 992 

CTACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGAATTTCCATCGGTTTGC 

TGAGAAGCACAACTTTGCAAAACCCAATGACAGCCGTGCTCTCCAGCTGATGACCAAATG 

TGCGCAGACTGTGATGGAAGAACTAGAAGGATATTGTGATCGCGTATGGACAGAGTGATG 

AGTACCTGCCCGGGC 

Sequence 993 

CCCGGGCANGTACATCAAGC I Mill IACATCAAACCCCAAGGCTCTCATTGCCACCTTT 

AA7TCATG ATAA TCTATTGCTTCATCTTTGTCTGTATCAAATAGTTCAAAAGCATCTTTA 

ATTTCTTGTTTTGTCCTCAGACAGTTCTCTGANGNTTCCTCTTTGTTTTGTCCACTAG^ 

GCTCA CTTCTCAGAGCTAAACTCATTATCTCTTCGCACAGAGACGTTCCTCTCAAGAACG • 

ATTTTAACCCCCTACCCAAGGCAGCAAGACGCCCACAGCCGTTCAACAGACACGAACGAC 

CTCAGCGGCCTCCCCCGCGTACCTCGGCCGCTCTAGAACTAAGTGGATCCCCGGGCTGGA 

AGGAATTCGATTTCAAAGCTTATCGATAC 

Sequence 994 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACTTTTCTAGAAAAGCCTACA 
TGTATATACTTAGTTACAGCTGCACTTCCCCATTACTTATTTTTAGGAAGGTTATAGAGA 
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TGGAATAGATGCTGGCAAAGCAGTTACTCTTCAACAGGGCTTCAATCAAGGTTATAAGAA 

AGGNANANGTCANTNTANACTATGGACNACTCCGAGGMCATTGAGGTAATTTTTAAAGT 

CTAAATGCTGAATCATTTTAACCTCAATACTACTGGAGGATGTTTCTGTATAAATAAAGT 

GTTTAAACTGAAATGCTTTTCCTGGTGCTAAATACACTAAAGCGTGTCGCAGATCATAGA 

ATTATATTGCC TTCAA AAGGTCAAAATCNTNATNAGNCTCATCTACTTTAATGTGTGAAC 

TACAATATTGTCTTTTCGTGCAAAGAAATGGTAAAGAAGATGTATACTTCTGCTACCTGA 

ACAATTATCTATCTCATTGAAAGGTCTTCAGATTTTGA 

Sequence 995 

GMTTGGAGCTTTCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTTmAGCTTTTT 

AAATCTGCTTTGGTATACTCC ATAN N NTTTGTGCCATTGG ATTATTCTGTTCCTATAG AA 

ATCCCCACTATAAAATGTAAACCAGACAAACTTCCATTATTCAAACGGCAGTATGAAACC 

ACATATTTGTTGGCTCAAAAACTGCAGATCCTTGCTGTTACGGTCACACCCAGTTTCATC 

TGTTACCTGACAAATTAAGAAGGGAAAGGCTTTTGAGACAAAACTGTGCTGATCAGATAG 

MGTTGTTTTTAGAGCTMTGCTACTGCAAGCCTTTTTGCTTGGACTGGAGCACAAGCTA 

TGTATCMGGATTCTGGAGTGAAACCAGATGTTACTCGACCTTTTGTCTCCCAGGCTGTG 

ATCACAGATGGAAAATGCTTTTCCTTTTTCTGCTACCAGCTAAATACTTTGGCACTGACT 

ACACAAGCTGATCAAAATAACCCTCGT 

Sequence 996 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTTCACCGTGTCAGCATTTGTTGAATTNGCACTT 

ATTGTTNAATTTAGCTCTGGAACMTGCAGGGAATTTGAAGTTTCTTGTAAATAACCACA 

ATTAGGAAAAAACCATACAGCTCAAGGAAAATCCACTAGTATANCCAAGATACCCTAAGT 

TCTTCAAGAGACACAGANGGGAGAATTATGCCAAAGGTAACTATCACCACCAGAACGCGG 

CCATCCACGTACCTCGGN 

Sequence 997 

CCGCGGTGGCGGCCGAGGTACGCGGGGGACGTTAGGTGTCCGCCGGAGGTGTCGTTGGTG 
TGTTGCGCGACTGGCCTTGAGGGAGAGCTGGGGCCTGCTCCCGGAGAGATACCGGCTATG 
TCGATCGAAATCGAATCTTCGGATGTGATCCGCCTTATTATGCAGTACCTGCCCG 
Sequence 998 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTNCCAACAGAAACCTGGCCAGGCTC 

CCAGGCTCCTCANCTATGATGCATNCAACAGGGCCACTGGCATNCCAGCC 

Sequence 999 

CCGCGGTGGCGGCCGAGGTACTCACGCCTGGTCGGGTGGCAGCTCGCGGCGCAGCTCGTC 

CATGGTAATGTAGTTCTTGTCCCCAGCCAGGATCTTGAAGGAAGCCATGACTTGGTCTGC 

TGTATCTGTGTCGGCTGTCTCGCGGGACATGAAGTCAATGAAGGCCTGGAATGTCACTAC 

CCCCAGGCGGTTGGGGCCCACGTGGCATCGATCCTCCCTGCCCGCGAAGTGACAGTTTAC 

AAMTTATTTTCTGCAAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGCCGCTNTAG 

AACTA 

Sequence 1000 

CGGCCGAGTGTGGCTCTTCTGGTGTTTCAGCTTGGGGAGAGAGGAGTGGCCTTCCTCTTG 

CAGTTGAGGCCGGCGCCGAGCCGGACTTCAGCCGGATCTCGTGGCGGAGCCCATCTTGCT 

CCCTCTCCAGGCCTTTATCCGCTCCCTAGGATTCCCGGGCCCTGTAGGTGGGAGTTGGGA 

GACGACAGTACCT 

Sequence 1001 

AGGTNCCCCTGCGCCAATCATCACAGCCTGGTNTGTATNAAGTGTGTGCCAAACTGCCTG 

CGCTAACNACCAAATGCAAATGTCATACNTCTNTGTCACTAGCANCATTAAGTTNCTNTA 

GAGGACACGTTGTGAGCCACACCTNAAAAGACAAGCCTTCATCAATCTTTTCCTGGAAGA 

TAGCTTGACCTAGCAAATTCATTATGTTCAAAATAANAGNCTATCTTTTCTTTNGTACNC 

TGGCCCCGGNGCNGGGCNCCGCTTCTATNGAANCTAAGTTGNGTATTCCCCCCGGNGCCT 

NNCCAGAGAAATTTTCTTATTATTNCAAAAGNCTTTNATACCNATTTNCCCCGGTCCGNA 

CCCCTTTTGGAANGGGGGGGGGGTGCCCCCCGGGGGTTAACCCC 

Sequence 1002 

• CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACi I M I I I I I II I M I 
TTGTTTTAAAGTAGTATTTTATTTCTCTGGGCTGTGCACACTTAAGGCATTAGATCCACA 
GATGGGCTCATCCGCTGGGATGCTACTTGTGGTGATCATCTCCTAAGCCCCACGTTTTAG 
CCTTGTACCTGCCCG 
Sequence 1003 
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CCGCGGTGGCGGCCGAGGTAC I II M 111 IN 1 1 11 I I M N 11 1 A CCAAGTCTT ACAGT 
GATTATTTTACGTGTTTCCATGTATCTCACTTTC 

GAAAATCTACGTTGTACGTGGGATACAGGTCACGGGCAGAGCTCCTGGCCTCAATGATGC 

CTCCTGATCTATCGCTGGGCCTGGACGACCAACACTGGGATGATGACNAGCAGAATGGTC 

ATGAAGATGCTCAAAATCAGGGCCCANATGTTCAGGCACTTGGCGGTGGAGGCATAGGCC 

TGGGCCCCG 

Sequence 1004 

TACTATAGGGCGAATTNGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTCTG 

AAGAGGAAGCAGCTGGTTGGACAGGATTTCTTGAAGAGCCAGGTGCTAAGGGCATCAGGT 

CGACATCCATAGTAACCATGTGCCATMCATCTACACATTTCCACTTGTTTTACAGACAA 

GGTAACAGGCAGAAGGAAAATCCAGAGTCCTGCAGTAAGCAGATGACAAAACTTCAATAT 

GCTTGGGCACCACTTAGGTGACCCCAGGGAGATTTAGTGTGGCCTTAGGAAAGCAAAAGA 

GCACTTTTTATTGGAAATATGAGCTTGTCACTGGGAAAGATTTGTAAAATTGATCAAGAA 

CTTGATTTATAATTATGCCTCAAAAAAAAAGTTCTCATTTAGTAGTGGAGCAATCTAGAA 

AACATACCI II I I IGTT 

Sequence 1005 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAACCC 

CTCAAGAAAGGGGCAAAAAGGCCCCGAGTCACTTCAGGTGGTGTGTCAGAGTCTCCCAGT 

GGATTTTCTAAGCACATTCAATCCAATTTGGACTTCTCTCCAGTAAACAGTGCTTCTAGT 

GAAGAAAATGTGAAGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGGTAATCATGGGTCATAAGCCTGTTT 

CCTGTGTGGAAAATTGTTATTCCGCTCACAATTTCC 

Sequence 1006 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTTTTT 

i m 1 1 n in 1 1 1 n n i n 1 1 actttg aaattactttaatttaaaaatan aaaac atn 

TTGAAAGGAAAAAAAAACCCACAAAACATACAGGCAGATT TGTAT TCTGTGAGTTTCTNA 
CACCCTCACACTTGTTCACCAATNTNTAANATGAAAMCTNTTTTCCAAAATGTATTCAG 
CC NCCAACAATGG NGTTATTAATAGGAATATGGATCAATTTC ACAAAN AANCC AATG AAT 
CCCATTNTA 
Sequence 1 007 

GGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAGACTTATGAA 

TAGATTTATGMTTGGAGTAGAAAACATAGACATTTCTACCTTGAGGATATAGAAGGGAA 

CTTAGGAAGTGAGAAGTCAGATTACCCTAGTCCTCTAGGGTGCTTAAAGCTAACTAGTCC 

CGTGAATGAGAATCTAACCTATGTGAAAATCTCCAGCCTGTACCTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnGC I I I 1 1 GTTTCCCTTTAGTGAGGGGTTAATTGCCGCCGC 

TTGGGCCGTTAATTCATTGGGTCATAAGC 

Sequence 1008 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTGATCAAGCTATTAAAATGCTATAAAAA 

TATTACCAGTTTGGTGGTCTTCTAATGAGAAATCTAAAGCACACCTCCCCACCCACTCGG 

ATATCTGTTTCTGATGATCACGTGACATCTAGCATATACGTCTGTAGGAAAAGGGAAAAA 

GCAAACCCATTTCACAAATGGCTAAAGTCAATCCCGCGTACCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGATACCCGTCGACCTCGA 

GGGGGGGGGCCCGGGTACCCAGCCTTTTTGTTTCCCTTTAGTGNA 

Sequence 1009 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCTTCTTCCGGCTGATAATTGCATTATGGAT 

GAACACAAACGAGAAATTGCGGMGCTMGCAMTTGTTAGAAAGCTTACGATGGTTGTG 

TTGTCTTCTGAAAAAATGGATGAGCGATAAAAAAAAAGAAAAAAAAA/XAAMGTACCTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGAGCGGTTAA 

TTGCGCCGCTTGGCCGTAATCATTGGTCATAAGNCTGTTTCCT 

Sequence 1010 

CCGCGGTGGCGGCCGAGGTACCATGATGATGACCTGGCAGATTTGGTCTTTCCCTCCAGT 

GCGACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAAAGACAGTTACGGT 

ATGTCTCCCTGCAACACAGCTGTTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCGCTTGGCCGTAATCAT 

GGTCATAAGGCTGTTTCCTGTGTGAAMTTGGTTATTCCCGCTCACCAAATTCCACACCA 

AACANTACCGAAGCCC 

Sequence 1011 

CTNCTTAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTAGGAT 

AAGGAGGAAGGCAGTAAAGCTGCAGTATGTCCTTGAAGCGTGTCAAAGTGGTATGGTAAG 

GAAMGGAGAGTTTTATCTCACAMGCCATAAACACTAAACAACTTMTTTTCCATTCCA 

GAAAATCAGCAGTCATCMGACAGCACTTGGTAAAACATTTAAGATTTGTGATAATAACA 

AGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnaCGGTTACCCAGCCTTTTGTTCCCCTTT 

TAGTGGAGNGGTTTAATTGCCGCCGCCTTGGGCGTAAATCATGG 

Sequence 1012 

CCGGGCAGGTACGCGGGGTGTGAGACTTCGGCGGACCATTAGGAATGAGATCCGTGAGAT 
CCTTCCATCTTCTTGAAGTCGCCTTTAGGGTGGCTTGCGAGGTTANAGGATTTGGTNGGT 
TTGGGTGGNGCC TGTTTCACC GNGGNAAAAAAGGGGTTGNCCCCCCCGGGGGGGGNAAAA 
AAAAAACCCCCCC I t I I I I I I I I GGGGGGN I I t I I I I I I I I CCCCCCNGGGGGNGNTAAA 
AAA NGGG GGNNN TTTNNAAAA AAAAA 1 1 HI I I I ( I NCCCCCCCGGNGNNGGTTNTCTCC 
CCCTTTTCC CCCC I I I 1 1 I 1 1 I NNAAAAAAANAANGNG I ITTTTTn I AAAAAAAAAAAN 

rrrnrm i ggggg gggggggntttttta t mum iaacccccccccn i him inggg 

GGGGAGTTTTTTTNNGGGGGGNNNGNNAAAAAAAAAAAAAAAAAAAACCCCCCCNAAAAN 
AAAA AANNCCCCCCCCCCCCAAAAAATNTATAAAAAANANATAAAAAA l rTTTTTTTTTT 
TTTTN N AAA ANTN N NTAAAAAAAAATNN N AATN AAAN ANTTN AAAAAAAAN N NNTTNTTT 
CTTTNATCATTTTATTANGGGGGGGGGGGGGGTAATAAAAAAAAAAAAAA I M M M M I 
TT 

Sequence 1013 

CCGCGGTGGCGGCCGAGGTACACCAAGGCGACCACAGCAGCTGCAACCTCAGCAATGAAG 

ATGAGGAGGAGGATGAAGAAGAACGTCACGAGGGCACACTTGCTCTCAGTCTTAGCACCA 

TAGCAGCCCAGGAAACCAAGAGCAAAGACCACAACGCCGGCTGCGATGAGGAAGTAGCCC 

ACGTTGACAAACTGCATGGCACTGGACGACAGTGGCCCGAAGATCTTCAGAAAGGATGCC 

CCATCGATTGACACCCAGATGCCCACTTGCCAACAGGGCTTGCACCACACAGGAAAGATG 

AAGCCAAATTGAAGAAGGATCATCATTGGGTCTTTAATGAAAGCCTGGAAGNCACNTGCC 

ATGGGTGGGCTCCCTGTTTCAGGG 

Sequence 1014 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGAAGGTCAGCGCCGTAAT 

GGCGTTCTTGGCGTCGGGACCCTACCTGACCCATCAGCAAAAGGTGTTGCGGCTTTATAA 

GCGGGCGCTACGCCACCTCGAGTCGTGGTGCGTCCAGAGAGACAAATACCGATACTTTGC 

TTGTTTGATGAGAGCCCGGTTTGAAGAACATAAGAATGAAAAGGATATGGCGGAGGCCAC 

CCAGCTGCTGAAGGAGGCCGAGGAAGAATTCTGGTACCTCGGCNnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnNAATTCGATATCAAGCCTTATCGATACCCGTCCGACCTCGAA 

GGGGGGGGGCCCCGGTACCCCAGCTTTTTGTTCCCTTTTAG 

Sequence 1015 

TTCGAATTCAAGAAGGGCAGCCAGCAGAGGATGAACACAAAACCTTAGGTCACTGGGTGA 

CAATCAGNTG ACACAANAAAAGTAGGCACAGCACNTGAC N N GNTGTATTTGACNN CACTT 

TGCAAGAAACACTGACTTGANNTCANCTTTNAAANGTNTAGCAGTTNCCTATAATNANAT 

GGCCANTACCCTNTTTCCATC CCCTC TNCTTAGGANCANTATTCCCNTTTCCACNCANTAN 

AGGTAAGGANCAATTCCTNATTTTTNTTAAAAAGGGGNCAAANNAACCTTGGGNTCCNCA 

ANANANAANCN GGTTN TCTCGNTTGNTCNANTTTCTTAGTTNCCAAACGTTNCTTTCCNG 

ATNGGTTATGNTTTTTNTAAACCNAMNAMNANMNTTTCCGGTNGGGGTTTCTTTCTT 

TNTTAATTN 

Sequence 1016 

TGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACTTTTTTTTTTACAAGAGTGCAGA 

AGAGAGCGAAAAACTCGTGAAAGCTGAAAGAAAATTCATTGAAGATAGGGTTAAAAAAAT 

AATACGAACTGAAAAGGAAAGTCTGTGGCNGATTCACATAAAGGATTTGTTGTTATTAAT 

CAAAAGGGAATTGACCCCTTTTCCTTAGATGCTCTTTCAAAAGAAGGCATAGTTGCTCTG 

CGCANAGCTAAAAGGAGAAATATGGAGAGGCTGACTCTTGCTTGNGGTGGGGTAGCCCTG 
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AATTCTTTTGACGACCCTAAGGTCCTGGACTGCCTTGGGGACATTGCCAGGGACTTTGTA 

TATTGAGGTATACCATTTGGGGGAGGAAAGAAGAAAGTTTTA 

Sequence 1017 

ACGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGCAAAACAGTTTTCCTAATCTCAGCAGTATCCAGTGAGTGAAGAACACTTGACTGACTC 

TTGGGCCACCTCTGTTACTTACTGTACTATGGAAGCTCCTGGTGAATGTTTACAA7TATG 

GGATGTAGTATTTCTATTTGTACTTTAAGTCAAATGCTTATATGAAATATGTGACAACAA 

ATAGAGAAGACTGGCTCTGTTAGTAATTATGCAGTATGTACCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTTATTCGATACCCGTNCGACCT 

TCGAGGGGGGNGGCNCCGGCTACCCAGCTTTTTGTGTCCCTTTTAG 

Sequence 1018 

NTGAGGGGTTAATTTGCCGCGGCTTGGGCGGTAAATTCATTGGGGGGTTTTTTTTCCCCC 
CCAAAAAAAI I \ I I I I I I AAAAMGGGGGGGGGNCCCCNTTTTTTGGGGGGGNATTTNNA 
rTTTTTNNTTTTI I ANA AACCCCNNACCCCCCCC II I 1 1 M GGGGGGGGGN I I I I I I I I G 
GGGGGGGNN rTTTTTTTl ( GGGGGGGGGGNNAAAAAAAANGAAAAAAAGGGGAAAAAANT 
rTTTTTTTTTTTTTTTI I GGGGGGGGGNGGNNTNTTTTNN I MINI I AAAAAAAAAAAT 
Mill INTGNCCCCCCNCCCCCCCCCCGGGGGGGGGGGCCCCCCCI I H M t I I 11 1 M I 
CCCCCCCCCAAAAAAAAAAANCCCCCCCCCCAAAAAAAANNNAAAAAANNNNNNTTTT^ 
TNNNTTTTTTTANNNNNNNNCCCCANNNCCCCCCCCNAAAAAAAAAAAAACCCCCCCCCC 
AAAAAAAAAAAAAANNCCCCCCCCCCCAAAAAAAAAAAGGNGAANAANGGNGGCNCCCCC 
CCCC 

Sequence 1019 

TGAGGATCCTGTGCGCGATCGCAATGGACAGCTACTAATGGTCACAACGACCTCTTTCTC 

TAATCCTTACTTTATACATCAACCTTCAGGTGGCTCCCCAAAATGTTATACCAAATCTGT 

AAAATGCAAATAAAACAAAAGACAAATCTCCATCACGTACAACATGAACTGAACAGCCAT 

TTAAAAATTCAATGGGTGAGAACTTCTTGGACCTCATTGGCTAGATTCCTTTCTTAAAAT 

CCATACACCCGNAATTTMCAGGGAAGGTTCCCCATTCCTTGGAAGGAGCAAACTTTCAG 

NATGGTACCAAGACCTTGGAGNTGAAAATTATATTTT 

Sequence 1020 

NATACCAGGCGTTTCCCCCTGNNAAGGCNCCCNTCGTGGCGGCTTCTTCCATTTGGGGGG 
TTTTTTN I I 111 i I i NCCCCCNCCCCCCNGGGGGGTNGAAAAAAAAANCCCCNGGCCCCC 
CTNCCCNTTTTTTTTNNGGGGGGGGGNNNCCCCCCCNGACCCCCCCGGGGGGGGGNNNAC 
CCCCCN rTTTTTTTTTTTTTTTTTI I AAAAAAAAAAACCCCCCCCCNCCCCCCGGGGGGG 
GGGGGGGGGGGGTTTAAAAAAAAAI 1 1 1 I t I I I I AAAAAAAATACCCCCCCATCCCCCCT 
I 1 1 I 1 i I f NNGGGGGGGNN I I I I 1 M 1 1 I I I ICCCCCCCCCCCCCCCCGGGGNGGGGNGA 
TTTNTTTNTNTCCCCCCCCNNNATTT^ 

CCCCCCCI rTTTTTTTTTl INTTNNNNAANNNGNCCCCCCCNNTTCCNCNNNATTTTTTN 

TTTTTNATTATANTANCCCCCCCCCNCCNATAT^ 

AAAAAAAAAAAAAAA 

Sequence 1021 

CCGCGGTGGCGGCCGAGGTACTTACAGGGGACCGCCAGGGGCCTCGAGAATCGGTATCCT 
GAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGTGGCAAACACACTTAGTTGCTT 
AATTTGAAAACNGGGGGGACACNAATACNGGCANCTCGCGNNGGGGGGANGGGGGNTTTT 
AAAAAATTTCCCCCCGGTTTTTCTNAAN AAAAAAAAA 

ATTTATTTTTTTTNAATNTNGG NNCCCCNC N ACCCCC NAAAAAATNTTTTTTTTNTATTT 
TGGGGGGGGGGGNCCCCCCCTTTTTNTAAANAATANAAAAMCNCNCCCCCCNCCTTT^ 
I I I i rNCCCCNI illlill 1 1 ANAAAAAANNGGATTGGGNAAAAAAATTTNTTTTGA7TT 
TTTTTNNCCCCCC I WWW I AACCCCCTNNGGGGGGGNGGNGGGGGNGNTTTTNNCCTC 
NTTGNGCCCCCCCCNNAAAAAANAANAACCCNC I rTTTTTTTTl I GGGGGGGGGNGNNGG 
NGGGGNGNANTANAAAAAAANGGGGGGGGGGGN1 HUM I NTNNTTCCCCCNCCTTTTT 
TTTTTNGGGGGGGGGGGAATAAAAAAAAANAAAAAAAAAA 
Sequence 1022 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGACCTNTAACTCCCC 
CTGACACAGAGCAATTAGACTCCCATAACAATGGTATCAATTATACCACTCCATTGGAGG 
GACTTCCTTTATGTGTCACCCAGGATACATTGCTCAACTGCAAGTTGGCCTTTG GGGG GC 
CCCCAAAAAAGGGGGGGN rTTTTTTTTTTTTTTTTI I GGGGGGGGAAAAAAAANTTTTTN 
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T NCCCCCCC CCCCCCCCCCCCTAAAAAAAAAAAAAAAGGGGGGGNNCCCCCCCCNAAAAA 
Am ii i i i i t NGGGGGGGGGGGG GGGGG I HIIM11I1IH I GGGGGGGNNAAAAAAAA 
AGGGGG GG1 I M 1 M t M M 1 1 I I ANAAAAAAACCGCCCCCCCCCCCCCCAAAAAAATTT 
Mil 1 I 1 AAAAANNAANAA ANTNTN TNNAAAANNNGNAAAAAAGGGGGAAAAAANTNAAA 
AAAN CCCCCNAAA NNNTN I 1 I I 1 1 I I 11 I 1 H 1 I I 1 I N AAAAAAAAAA A 1 1 1 1 M 11 U G 
GGGGGGGGGGGG I I I I I I 1 1 M i AAAAAAAAAAAAAACCCCCCC NTTCCCCC CCNTTTTT 
TTTTTTGGNGGGGGGGGGGNNCCCC 
Sequence 1023 

CCGGNCAGGNACTTGACCTTNATCTNCTAACCATTCTTTTATGTCATCAAGAGTTGCATC 
ANNCGGGAAGCCTTTAATATNAACGGATCTGNNNTNNTACATCATTCNTATACTCANCAG 
TT TCACCTTTCATAG TTTAGTGGAGl II I I I M I ATTTTTTGGGGNANAAAACNCCCCCN 
NCTTTTTTTTTTTT1 NTNNAGGGGGGGGGGGGGGNTTTTTTTAAAAAAGGGGGTGNTGNA 
AAAAA ANNGNCCCCCCCGNAAACCCCCCTTTTTTTNNGGNTTmGNCC 
TTTTTNGGNGGGGGGGGGAAAAAAAAAGNTTTCCCTATTTTTNGGGGTTNTGTNTT^ 
AANAANAAAAAAAGGGGGGGGGGG I VTTT1 J 1 N7TTTATNNANTNTCCCCCCGTNTTTTT 
GNAAAAAAAAATNNTTTNTTT NGGCC CCCCCTN I I HIM I NNNNCCNGNGCCCCCCCCN 
AAAAAA AAAAAANCCCCCCCC 1 1111111(11 TNGGGGGGGGGGGGNTTTAAAAANAAAT 
1 1 1 I 1 1 NNT AGNTTGNTNTA GNTAAATAAAAA AGAGG GNCCCNNNTTGCCCCCCCNANAA 
AAAAAAAAAI I I I I II I I I I NGGGGGGGGGNNTTTTTAAAAAAAAAAANNGGGGGGNGTT 
TTTTTTTTTTTTTT 
Sequence 1024 

CCGGGCAGGTACTGACCTTGGAAGCTAACCCCTGAGTATGATGCAACTCCACTCTAATGT 
AMTTAAAATGCCATGATCTTAAAAATGCCATAATATNTGTCAGGTATMTTTTMT^ 
C CCAGGNT TN AGGTTT C CCAATC I I llUll NTCCCCAAAAAAAAAACCCCCCCNAAAAA 
AAI I I I I I I NNTTTTTTNN I II I I I I AAAAA AAAGGGG GGGGGGCCCCCCCAAAAAAAAA 
GGGGGGN I I III 111 l M GGGGGGGGNGl WWW I GNGGGGGGGGN I rTTTTTTl (GGG 
GGGGGGG I H I I I I 1 1 1 I ICCCCCCCl 1 1 I I I I \ NNGGGGGGGGN I I I 1 1 1 I I I I GGGGG 
NNN GGGGGGGNATN TNTTCCCCNTNTTCC CCCCCCGGGG GGGGGG I I I I I I I I I NNCCCC 
CCC TTTTTl mm INCCNCCCCCCCCCCCI nTTTTTTI I NGGGGGGNNNNNNNNNNN 
NNNI rTTTTTTl I NGGGGGGGGNGTGTTCCCCCCCCNGNGTCCCCCCCNAAAAAAAAAAA 
AAN GGGGNGGG GTGGNCNCCCCCCCCCNAAAAAAAAAAAAAA I M I I H I I NNN Ml 1 1 l 
TTTN I I I I I l I I NNNNNNCCCCCCCCCN AAAAAAAAAAA 
Sequence 1025 

CCGGGCAGGTACAGCACTGGAACTTCTTGAGCAGGAGCATACCCAGGGCTTCATAATCAC 
CTTCTGTTCAGCACTAGATGATATTCTTGGGGGTGGAGTTGCCCTTAATGANAAACCAAC 
CAGTAATATTTT GGTGGNTGTCACTCA NNTTGGG GGGGTTA l Mill I G GGGGGGGNTTT 
I I II I I II I I I IGGGGGGGGGGGGGGNTTTTTTAAAAATTTGTAAAATNTTAAAAATNTN 
AAAAA ATTTA AAA AANNTC NCCCCCCNNAAAAAANNTNNCCCCCCCAAAAAAAAAANAAA 

aaaan rrrrrrTT Tm itttt ttnnaaaaaaaai i rrm igggggggggggnnttttt 

TTTTT AAAAAAAAAI I l l 1 1 1 1 1 I GGGGGGGGGGGGCNCCCCCCCNAAAAAAAAAGNGGG 
I I I I I I I 1 1 I I I I NGGGGNTTTGGGGGNATTTTCCCCCCCCC I I I 1 1 1 1 1 1 1 NNTTCCCC 
CCNNN AAAAAAAAAAAA I I I M I 1 1 1 I I NGGGGGGGGGNNNTCCCCCCCCCCCCCCNTTT 
TTTTTNAAAAAl rTTTTTI I GGGGGGGGGGGGGGNTN I I I I I I M I 1 1 AAAAAAAAAAAA 
NNGNGGGGGGGGGI rTTTTTTTTTTTTTTTI I NCCCCCTATTTTCCCCCCCCTTTTNNCC 
CCTAANAAAAAAAAAAAAGG 
Sequence 1026 

TGGTCACTGGA ACCTGGC CTGGGGATACCCATNNTGGGCTCCTTAAGTTNGGGGGGGGGN 
NTNTTAAAAAAN TTTTTTTGG GGGGGGTTCCCCCCCCNAAAAAAAAACCCCCCCCNNCCC 
CCAAAAAAAAAA 1 111111111 1 AAAAAAAANNGNGGGGGTNNNAAAAAAANTTTTTTTN 
GGGGGGGGGGGNAAAAAAAAAANCCTTTGNGGGGGGGNGTTAAAAAAAAAAGGGGGGGNN 
NGTCCCCCCNTTrTCCCCCCCNTTTTTTTNTTNGNNGTAN 1 1 I I 1 II 1 I GGGGNTTTATT 
AAAAAAAAAAANGGGGGNNGNTANCCCCCCTCTCTCCCCCCC I 111111111 1 GGGGTGN 
ATNTANATNTNNTTCTCNCTNCCCNTTTCNCCCCCCCCANAAAAAAAAAAAAGTGNGGGA 
NNNNGGNN NCAAN NNCTTTTTNTTCCG 

CCCNTN 1 I I I 1 I I \ NGGGGGGGGGGGGGGTTCCCCCCCC 1 1II1III1UI11 I GTNNGG 
NTTNAATTAATCNTTATAl 1 I I I I II I II II 
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Sequence 1027 

CGAGCGGCCGCCCGGGCAGGTACAGAGATAAGATACAAAGATAAAAATGTTGGTATCAGT 
TGATTTTGAAGCAGGTAATTACTGTGCCACCTCACAGTGCTCAAAAATAGTTCAGNNCCT 
GGAATAGGGTTATNTCA GNCCCCATGCAAA ATTTTTTAAAAAAAAAAAAAAGGGGGGGTN 
CCCCCCCNCAAAAA AAAI M I l I I l I I I I I I TGGGGGNNNGGGGNNNNNGGGGGNNTTTT 

nrrrrm ttttttnggggggggngnccccccct i m i m i i r i i cccccccc mini 

TTTTAAAAAA AAAAAAAAAA ANCCCCCCCCCl M HI I I M II .' I IN INGNNCCCCCNN 
CCCCCCCNN I I I I I I I I I I NTNCCCCCAAAAAAAAAAACCCCCCCCAAAAAAAAANTTTT 
I I I I I I I I ICCCCNNCCCCCCCCl rTTTTTTTTTTI I ACTNTNTNNNCCCCCCCNNNCNC 
CCCNNNNNNCCCNTN NGGNC CCCCNTTAANNCCCCl I77TTTTTTTTTTTT1 INANCNNN 
NNCCCCCCCCCCNNNTTTTTTGGI Mill r I ! NNAAAAAAAAAAAACCCCCCCCC 
Sequence 1028 

ccgggcaggtacttatgtgaaaggtaaaaaagatctcatagaagtagagagtggctcatg 
cctctaatctcggngctttgggagtctaaggtgggaggatcgctttgncgggcctgngta 
gt7tttggtnactcaagnctctgnagg7tnncccccnaaaaaaanaaaaaatattccccc 
caaaaaaaaal i i i i i i m i i i i i i i gggntngtngcccccccn m ii i i hi i cccccc 
cnnaaaaaatattanntntgntaaaatttttraaaagngnngaaaa i h ii i ii 

Sequence 1029 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCACACCGAAAGTAAAGAAAGATAAACA 
GAACATCTTTGAATTTTTTGAGAGACAGACTTTGTTACTTAGGCCTGGAGGCAAATGGTA 
TGATCTGGTAGGTACCAGGAGGTGAACANCTGGAACCAGGATTCACCAAAAAAAANGGGG 
GGGGGGGCCCCCAAAAMNANCCCCCGGGGGGGGGNAAAAAAGGGGGNNGTTAAAAATTN 
AAATTTAAAANGGCCCCCCCCNAAAAAAA I I II I I M I I NNTTTTTTAAAAAAAAAAAGG 
GGGNGNNNNI 1 1 I I INI 1 1 II 1 1 I AAAANCCCCCC I I I I I I I I II I I CCCCCCN 1 1 Mil 
TTTTCCCCCAAAAAAACCCCCGGCCCCCCI M I T I I I I I I ATNTNCCNNNNCCCCCCNNN 
NCCCCNTTNNCCCCNTTTNCCCCAAAG 

I I I I I I I I ICCCCCCCCNI I I I I II I I INTTTGNNNCNNNNANNCCCCCCCCCl I Mill 
TTNNI Willi INNNANCCCCCCCCCCGGGGGGGGGGTGUi mTTTTTl (CCCCCCCC 

Sequence 1030 

TAATTCAGGGGGGATAAGCGCAGGNAAAGNAACCATGNTGGAGNCTANTAAATTGAAAAA 

NNAAAAAAGGGGTTTGGGGGGNTTTTTCCCCCCTTTCCCCCTAAAAAANGGGNGGNNGCC 

CCCCTNTAAMTTTAAAATANNNTAAAAAATTTAAAAATGGGGGNNNTGGGGGGGGGNTT 

TNCCCCNTTTTTNCCCCCCAAAAAAAAANGGGGCGGGGGGGGGGGGNGNNAAAAAAAAAA 

AAAAAAAAATNCCCCCCCCCTCCCCCCCCCGGGGGGGGGNTTTTTTTTTTTTNANAAANT 

GT^WW^TTTTNAAAAAANTNAAAAAAATTTNAAAAAGGGGGGGGNGGGGGGGGGGNTTT 

TTTNNCCCCTTNTTTTCCCCCCCCCGGGGGGGNGNNTATANTCCCNCNNGGGGGGGGGGG 

GNGANNI I I 1 1 1 I I I AN AN I Mill I NGGGGGGGGGGGGGNATTCCCCCCCCCNl I 1 1 I I 

I I 1 1 1 I I I IGGGGGNATTTGGGGAGNTN I I I I I I CCCCCCCCCCCGGG 

Sequence 1031 

CCGCGGTGGCGGCCGAGGTACCACCTGCTCCTCATCTTAGGAGTCTCCTTTTCAAATAAT 
TAGGCTCTGTTCCCATTTTAAMCTCTGATATTGGCCTTCACCTGTGACTGGACACCTTT 
ACNTANGAGGCGCCATGTTTCANCTATAANCAANTATAGAAAAGAAAAAAI III I I I I IN 
NCCCCCNTTTTTTNAAAAAAAGNAAAAAGNGAAAAAN I i I I I I I 1 1 NTN AAAAAGGGAAA 
AAAGGGGAAAAAA I 1 1 I I II 1 1 I 1 1 I I I I I I I I GGGGGGTTTNGGGGN I 1 1 I I I IAAAAA 
AAANAAAAAAAMNGGGGGTTTTTGGGGGGGGNNTTAAAAAAAAAAAAA^ 
TTTTNAAAAAAANAAAAAAAAANCCCCCCCCCCNAAAAAAANNNNAAAANAGGNCCCCCC 
CCGGCCCCCCCCNAAAAAAAAAACCCCCCCGGGGGGGGTNNGTAAAAAAAAANNAAAAAA 
ANNNTNTTNTNGGGGGGGGGGGNTTTTNAAAAAAANTTrN AAAAAANNTNNTAA AAAA^ 
NGGGGGGGGGGGGGTGGGGAAAAAAAAANNNAAAAANN I II I I I I I II I I II I AAAAAAA 
AAAGGGGGGGGGGGGGGGNAAAAAAAAAAAAAGGGGGGGGGGGGGN 1111111111 NNTT 
I 1 1 I 1 I GGGGGGGGGGGGGGGGGGGGG 
Sequence 1032 

AGGTACNCGGGGNCTTAANAACGAACGGNTTGGGCGCGGACTGGTATCCGGGGACTGTGA 
CTTGCAGGGTCCGCCATGGAGCCAGANCAGATGCNTGGAGGGGTACAAACGTCAAGGTTT 
TGTCAAGTAAAAATTCCTTACNTCTGAAAAANGNGGNG I nTTTTTl I NAAAAAAATTNC 
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CCCNNAANCCCCN rTTTTTTTI I TG GGGGG N NTTTCCCCNTN N ATCCCCCCTTTC NCCCC 
CTATGNGTGNAATNGGGGNTATTTTTTTTTNTTTCNCCCCCeNGGGGGGGGGGGGGGGNG 
NGNT NTTTCNTATTN NCCCCCCCGGGGGGGGCCCCNNCNCC J rTTTTTTTTn I CTCCCC 
CCCNT TTTl I I I I I t AAA NAACMGGGNGNTTNGTTAAAAAAAAAAAAAAAAAANCCCCC 
CCCCN i m m m 1 M 1 l AATAAAA AAAGGGG1 1 1 i I 1 I 1 1 I 11 1 II NGGGGGGGGGGGGG 
GGAG GNTTAAN AAAAAAAAAANA 1 i 11 1 1 11 I NTNTTANCCNTTTNTTCCCCCCCCNCTA 
TATTATTTTTTTNTTTNTG AANTAN G NTCCNCC N N N N NTGTTTTTTATTTNG AGG NATTA 
TTATTTGGTNTATTTANCCNCCCCCCCCNCNTTTTTTTTTTTNN 
Sequence 1033 

TGAACTCTGTCCCAGGACCCACTGCCACTGGAACNCTGGGCTGGNGATACCCAGGTGGNC 
CCCTGGTTGGNGGGG TNTAAAA AMTTTTTTTGGGGGGGNGNCCCCNNAAAAANNCCCCC 
NTCCCCCCAAAAAAAN I'll I IN NTAAAAAAAAGGGGGGGGGGAAAAAAAAA H mm 
TGGGGGGGGGNNAAAAA AAAMNNNT TTTTNGGGGGGTTTTTA^ 

TNCCCCCCCGCCCCCCCC I I MINI I GGGGGGTNNNGGGGNTTNGGGGGTNTTNTAAAA 
AAAAANGGGGNGNTTNGCCCCCCCTNNCCCCCC I M I M I M I GGGGGATNNNNNNATTN 
TCCC CCCCNTN CCC CCCCCA AAAAAAAAANGGGGGNNNNNGGGGGGNNNNNNNNTTTTTT 
NNNNi Mill iNN TNTTTTTTNNN NNNCCCCCCCCCC l nTTTTTI 1 GGGGGGGGGGGGA 
NCCCCCCCCCCC IN 1 1 II 1 I 11 NTGGGGTNNNNNGGNNNNNNN I M 1 I I 1 1 1 11 1 I AAA 
AAAAAAAANCCCCCC 
Sequence 1034 

ATGAANNCTGGNC CTGGGGG ATACCANNNGGGGNCCNTCCCCCCCCCCC l I I I I I H NGG 
GGNGGGGGGGGN II Mill IGGGGGGGGGGGGGGGGTAhWW\ANTTTTTTTNGGGAAAA 
ACCCCCCCCNAAAAAAAAANNCCCCCCTNCCCCCCCCAAAAAAAAA A I rTTTTTTTTTTA 
AAAMGGGGGGGGCNCAAAAAAAAAATTTTmGGGGGGGGNGNNAAAAAAAAAAANNNC 
NCCCNGG NGGTTI'TTTAAAAAAAAAANNGGGGNGGNGGNNCCCCCCNNNNCCCCNTrnT 
m i i i l i iGGNAAAAAAAAAAAMTAGGGGGGTNTTTNAAAAAAAAAAAANNGTNTNTTT 
TTTNNNCCCCCNNNTAACCCCCN I I 11 1 I II II I 1 1 II I GGGGGGGNTTNNAAAAAANTT 
ANNNNNN CCCCN TNNNTNNCCCCCCCAAAAAAAAAAAANAGGGNGNNNNGGNNGANNANA 
NNNNNNNTTTTTNNCCCCTNCNNTTN I I I 1 I I I 1 1 NTTTTCCCCCCCCC 1 rTTTTTTTTT 

Sequence 1035 

TCGANGT ACATG GGGGTCCGTGCGGGCAN AACC CAGGGCATGAAGATCCAAANGGGCCTG 
GTTCAGC Mill NTCCAGGGCCATGGCAGCTTTTCATTGGCCTTNTTGAAGGTTTTTTAN 
CCCC CAAC CTACATTC TTTCCAANCT TTGTGTCATTTACCCC I I II N M I I I f I \ GGGG 
GGGG TTTTNAAAAA AAA I lllltll I NGGGNGNNAA'i nTTTTTI I AANNCCCNTTTNCC 
CCCCN i M M M I IGGNGTGNTGTGGNNNTNTTTAAAAAAAAANNAAAAAAAAANGNGGT 
NTNNGTTTAAAAAANGGGGGGNNGTTTAAAAAAAAAANGGGGGGGGGTCTNCCCNCCCNG 
GTGGGGGNTNCCCNAAN GNGNATANTG GGNTTG I I I I I I I I I'CTAATATNTCCCCCCNGN 
GGANGGNGGANNCCCT A \ mTTTTTTTTTi I NCCCC I I rTTTTTTI IA AAAAAAANAGG 
NTGGTTNTTNAAAAAANA TAAAAAAAAAAA TCCCCCCCCCCC' I rTTTTTTTI 1 AATAAAA 
AAAAAATGGGGGNTNCTN 1 rTTTTTTTTT I NGGGANGTTTTTATNGTA I MINI I A AAA 
AAAAAA 
Sequence 1036 

CCGCGGTGGCGGCCGAGGTACCCCCATGCTGTGGCACGGCTTCAGCTGTGATTGGGTTAT 
TATACCCCTGCATTGACAGACATCTAGGAGAACCACATAAATTTAAAAGAGAGTGGTCCA 
GTT GTAATGCC GGTGGTGTAGNCAGTCT7TTGCCTTTTGGGGGGGGGGGGG I I I M I M A 
AAAA l MM! I NTAAAAAGGAAAAAGGGAAAAATN I MUM I NCCCCCCAAAAAAATTT 
TTTTNGGGGGGG GGGCCCCCC CCCCCCCCCAAAAAAAAGGGGNN n MINI INGGGGGG 
NGCCCCCCCCCN 1IH1HU I AAAAGNNAAAAAAANNAAAAAAGGGGGGNN (TTTTTTT 
TTGGGGGGGGGGGGGGGGGNNAAAAAAAAA I I I I I M M I M I M NNNNTTTTNNNCCCC 
CCCCGGGGGGGGGGAAAAAAAAAAAN1 TTTTTTTTTI NAAAAAAAAAAAAAAAAAACCCC 
CCCCCCCNAAAAAAAAAAANAAAAAANAGNCCCCCCCCCCAAAAAAAAA A I rTTTTTTT N 
AAAAAAAAAAAACNCCCCCCCCCCAAAAAAAAAAANGGGGGGGGGGGNNNNTTTTTNNNN 
N NTTTTTI FTNGGGGGGG 
Sequence 1037 

AGGTACAAAAATTAGCTGGGCATGGTGGCGCACAACTGTAGTTCCAGCTACTCAGGAGGA 
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TCAGGCAGGAGAATCACTTGAACCCAGAAGGTGGAGGTTGCAGTTGACCCGTAGATCATT 
GCCCANCTGNCATTTNCCAGTCNCTGGGGGGGGGNNCCCCCGGGGGNGGNAAAAANNCCC 
CCNAAAAANGGGGNGNNAA AAAAAGG GGN i 11111 I 1 1 NGGGGGGTNAAAAAAGGGGGGN 
TTNTNAAAAAA ANCCCCCCC I I 1 1 I 1 1 I I CCCCCCC1 1111111 I GGGGGGGG I I I 1 1 I I 
TTNNCCCCCCCI I I I I I II II I I NCCCCCCNCCCCCCCCCCNAAAAATTTNAAAAANNNN 
NAAAAAANTTTAAAAANTNTAAAAANNTTAAAAAAAATTNAAAAAATTTTNAAAAANNTN 
AA AAA,\NN NNAAAMTTNTNAAAAAANNTTTAAAAAAANNTNNNAAAAAANTNTNAAAAA 
AATTTTTTNAA^^ 

AAAANNN TI 11 1 I 11 1 1 rNNNTTTTNTNNNNNGNAAAAAANNNNNAAAAAAAAANNNTNN 
NAAAAAATTTTTTAAAAAANNGGGGGGGGGGGGG1 I I I I I I I II I 
Sequence 1038 • 

NCACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCNAGCGGCCGCCCGGGCAGGTA 
CTCAGGGCTTGTGGTTGTCTGCCTTGTGTTTAGACATACCCAGGAATCTCATTACTTTGC 
ATTGAAGACTG I M I IA l I H IGGAATGTTCTGAATGTTTCCAAACTGCTGAACTGTGCC 
TGAAANGGGGNTTTT7NAAAAAA I I I I I I I I I I I I NGGGGGGGAAAAAAAAAAAAAAAAA 
AAI I I I INNCCCCCI I I I II I I I I I I I I NNNCCCCCCI I II I I I I M M M I II I I INCC 
CCCTTTTTTGGGGGGGGGGGGGGGGGNAAAAAAAAAAAAAAAACCCCCCC I I I I I I I I II 
TTTTTTGGGGGGGGGNCCCCCCCAAAAAAAAACCCCCCCN nTTTTTi NCCCCCCCAAAA 
AANl l I I I 1 1 1 I I NNAAAAAAAAAAAAAAAAAAAAACCCCCCGTNCCCCCCCCAAAAAAA 
AAAAACCCCCCCCC I I I I I I I I I NGGGGGGGGGGGN I 1 I I 1 1 I I I I 1 1 1 I 1 1 M M ICCC 

cccccccci 1 1 1 n 1 1 1 1 1 1 1 1 n 1 \ iaaaaaaaaaaaaaccccccccctt 

Sequence 1039 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGA 

CCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTC 

ATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACC 

AGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAG 

CCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTCAAATAA 

TCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAAACAAGC 

CATCCACCAGCCTATCT 

Sequence 1040 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGGAGCCT 

GCGGAGTTCGAGACCATGCTGCTGTTCTGCCCCGGCTGCGGGAACGGGCTGATCGTGGAG 

GAGGGACAACGCTGCCACCGCTTCGCCTGCAACACGTGCCCCTACGTGCACAACATCACC 

CGCAAGGTAACAAATCGGAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTAATTGCGCCGCTTGGGCGTAATCATGGG 

TCATAGGnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTCACAATTCCACACCAACATACC 

GAAGCCCGGGGAGCATTAAAAGTGG 

Sequence 1041 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAGTGCAAAAG 

TTAGTTGAACTGGTTCATTCATCTCTATGGTAACAGCTTCCTCCTCTTTATCGACATTAC 

TTGTCTGTGACAATTTAATGTTTCCATTTCCAAGTTCTCCACTTGCAGAAAATTTCACTC 

CGTCTTTTGCACAGGAAATTACAACAGCATCTCCAATATGGCTGAGATCTCGGCATATAC 

GTGCAAATTCACCAGAAGGCATCTTTACTACACAGCTGTACCTGCCCGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCG 

ACCTCGAGGGGGGGGCCCCGGGTACCCMNCTTTTTGTTCCCTTTAGTGGAGGG 

Sequence 1042 

TGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATAGCTGTAAGGAGAAGCTGA 

NAAATGATACCCAGGAGCAGCAGGCTTTACGTCTTCAGCCTAAAACCCAAAAAAAAAAAA 

AAAAAAAAAAANGTACTGGACCGGGTTGCTGAAAANACTCACACCCNAATACCCGCGTAC 

CTGCCCG 

Sequence 1043 

CCGCGGTGGCGGCCGCACGAGGATGCTTGGCACGTACAGGATCTTGCCATCACCTGTTCG 
GTCAAACAGCTGGAAGGCCTCCTTGAACTCTGCGGTCTGGTCTTCGGTGAAGTCACACAT 
CTTGACTGCTTAGCTCTCCGGGACCTTTTCCTGCAGTAATGGCCCCCGCGTACCTnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnCGAGCCCGGGGAGCATTAAAAGTGGTAAAAGCCTGGGGGTGCCT 
Sequence 1044 

GNGGCAGCAGTCACTGTGGCNTAAATTTCTAAACTTACATANCNNATCNTGTAATAGCAA 
AC CTTAA AAAGG GC AAGGN AN AGAG G ACTTAAAAATGAAN AAN AAAAG NG ATAGGGAGAA 
TGAAGAAi Mill I AAAAGCTGTTTTAGATCAGGGTAAGACCGNAGGCTTAAACAAAAAA 
AAAAAATAAAAANGTNCCT 
Sequence 1045 

GACTN CTTAG GGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCTTTTGCCGCT 

TCTGGTTTTTGCAGACATCCACTACTCCCCAGCTGATTACACCAACTTGAATGAAACGAC 

TTCTCTTGTGAACTATGAAGGGGCCGCCAGAATCACCTCTGCAAGTATTGGGGTCAGCAT 

AGGGACTCACTCCTTCCAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGGGGGGGCCCCG 

GGTACCCAAGCTTTTTGTTCCCTTTTAAGTGAGGGTTAAATTGCGCCGCTTGGCGTAATC 

ATGGGTCATAAGCTGTTTCC 

Sequence 1046 

CACCCTTACGACTCCTATAGGGCGTTTTGGAGCTCCCCNCGGTGGCGGCCGAGGTACGCT 

GGATAGCCTCCAGGCCAGAAAGAGAGAGTAGCGCGAGCACAGCTAAGGCCACGGAGCGAG 

ACATCTCGGCCCGAATGCTGTCAGCTTCAGGAATGCCCCCCGCGTACGCGGGGCCCGCGG 

AGTATACACCATGAGCAMGCTTCCCCCTCCCGAAGTTGAAAAAATTTATGGACAAGAAG 

TTATCATTGAAATTAAATGGTGGCAGACATGTCCAAGGAATATTGCGGGGATTTGATCCC 

TTTATGAACCTTGTGATAGATGAATGTGTGGAGATGGCGACTAGTGGACAACAGAACAAT 

ATTGGAATGGGTGGTAATACCGAGGAAATAGTATCATTCATGTTAGAAGCCTTGGAAACG 

AGTATAAATAATGGGCTGTTCAAGCAGAAGA 

Sequence 1047 

GGCCGAGGTACATTCCCATNCATTGCCTTCTTNTTTNACCNCTTTTNTTTGTAAANCTNT 

G NMG ATGC AAAGAGGTN GGATCAAGTTN AAATGACTGN GCTGTCCCTTTCAC ATC AAAG 

AACTACTGACAACGAAGGCCGCGCCTGCCTTTCCCAANNCGGCCAANCTATCTAGGCTGG 

GCAGGGGAAGGAAAGAACTTGCATGTTGGNGAAGGAAGAAGTGGGGNGGAAGAAGTGGGG 

NGGGACGACAGTGAAATCTANAGTAAAACCAAGCTGGCCCAAGGTTGTCCNGCAGGCTGC 

AATGCAGNTTN AATCAGA GNGCCATTTTCTTTTNTGGNNCAAANGACATTAATTATTTGG 

GAMGCACAAATTTTTTTNTTAAAATATGGNCAAAATAAA 

TCCAAAAAAAAAA 

Sequence 1048 

CCGCGGTGGCGGCCGAGGTACTGTGGGCTGGAAATCCCAAATCTAGATTCCAGCAGAGTT 
GGTTCTTTCTGAGGTCTGCAAGGAAGGGCTCTGTTCCATGCCTCTCTCCTTGGCTTGTAG 
GAGGCATCTTGTCCCTATGACTCTTCACATTGTCTTTATGTACCTGCCCnnnnnnnn n nn 
nnnnnnnnnnnnGATCCCCCGGGCTGCANGAATTCNNnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnGCCCGGTACCCAGNTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTGTGTGAAATTGTTATCNCGCTCACA 
ATTTCCACACAACATTACNGAGCCCGGGAAGCATAAAGTTGTTAAAGCCTGGGGT 
Sequence 1049 

AGGTGTCGACCCACNCGTCCGGCTACTTACTGGCTTGCTTTTGTTTGCCCTGCCTTGCTC 

AGCTTGCTTT TTTAT AGAACCCAAGACTACCAGCCTAGGGCTGGCACCATCCACAATGGG 

CCCTCCCACCTTTTGATCACTAATTTGAANAAAATGCCNTTACAGCNTGGATTCTCATGA 

AGGCATTTCCTCAAGGGGAGGCTCNCTTTTTGAAAGAAC 

Sequence 1050 

AGGTACGTTTTCAGAGATGTGTTGACTGAAGTAACCTTCAGCGGGCCTGAGTAAACCCAC 

AGCCAGTATCC AGCC CTGCCTGGGAGTCCCTGAACTGGGAAANTATNNTGTGATTTCTTT 

CACCTGGTGGCTTTTAGNTTATTTAAGGCCGTTNTTGGGAGGAAGGGCNCATTGGCCTNA 

NNCCATCTTTAAMGAMGTrTCTTTTCTCCCCTTTTMCNCCANCAATTANAAGGGTAT 

TTTCCTNNTGGTAGNNCCANGNAACCNCANCNCCCCCTTTCTTTTGGNAAGAAACNTTTT 

NCAACCCNTTCCCAA l MUM) I I NGGAANCCAAAACCANCCTCTAACNCCGGGGNGNN 

CNAA NGTAANNCAAAAATTC TGGGGG TTNTTAATTTGGGTTTITAAAGGGGGTTC 

TTTTCCANTNGGNGGNAAGATTTTTTGGGGGNGGGNNAAAAGGTAANANTTTTANGGAAA 
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A 

Sequence 1051 

ACAC TACTTAGGG CGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCAACATATAT 

TTTTACGAGGGTTATTTTGATCAGCTTGTGTAGTCAGTGCCAMGTATTTAGCTGGTAGC 

AGAAAAAGGAAAAGTATTCTCCATCTGTGATCACAGCCTGGGAGACAAAAGGTCGAGTAA 

CATCTGCTTCACTCCCCGCGTACCTGCCC 

Sequence 1052 

CGGGCAGGTGGCCGCCCGGGCAGGTACTGCTAGTGCTATCGGTTGAATAAATAGGCTGAT 
AGTTTCGATGATGATTAAGTATGGGAATTAGTGAGATGGGGGTTCCTTTCGGTNANGAAG 
TGGGCTAA 
Sequence 1053 

CGGGCAGGTACAGATACTTGMGCAATTCTGCAAAGACCCTGAGTTTTCACAGGGCTGGT 

TTTGGCTATCGTAAAAAAAAAMCAMNACACGCACACACACACCTATAGTTTTCCTGCT 

TGCCATAGGATCTCTCTTGGGAGATGGACAACCCTNAAAGGCACTGATTTGGTTGACATT 

GGTAGCTCTGGTCCTTACTCAGGCAGCCAGCTCATCCAAGGCCCGGTCCAAGGGATCCAC 

TGACCAGTAGTCATTATCCCAGCTCGAGGAACCAGCCCACAAAGGAAGGAGGCCTNAAAA 

GCCTTTGCCTTAACCGACAGTTGATGGGGGGTCC 

Sequence 1054 

GGCCGCCCGGGCAGGTCGCTCGTGATCTAGATAGTGAGCGGACGCGTGGGTCGACTCAAG 

ACTTTTAMGATTFATCAAAATTTGGTGAGCCGAATCTCAGAAMTTGGTGMTTCGGTT 

AGTTCCCAATAGGCCGCAGTAGTMTAAGTGGAGTGTTCGGGATATANAAAAATTTTCAC 

GAATGGAATNAAAATTTCTAGGCGAATTCAGTGAGTTCCCAATGGGACATGATTGTATGA 

GTGAGCTTTTTCAATATATAGCAACATTTTTAGGTTCANAAAACTTAAAG 

Sequence 1055 

CGGCAGGTCTTGAGTCGACCCACGCGTCCGCAAAATTCTAGCCAGAATAGCCAGACAAAA 
AAAGATCTAGGGGATACAGAAAGGCAAAAGGAAGTCAAAATATCACTATTTACATATGAT 
ATGATAGTATATTTCAGTGATCCCAAAATTTCCAGNGGAGAACTCCTAAACCTGATAAAC 
AC CTTCAG C AAAATGGCTGTGTAAAAAATTAACTCAAATAAATC AGTAGTCTTC ATCTAC 
AGAAAAGACAAACNAGCCAAGGAAAGGAGATTAGCGAAAAGACACCCTTCATAATAG 
Sequence 1056 

AGGTACCATGTTTCACGTTAAAATGCCAAAATGTGGTGCCTTCAAAGAAACAGTCAATGA 

AACAGAAAGCAAAAATAAGCAGAAAATTAGAAACGTTATTTGGCATCATGAAGGGCAACA 

CCAAATTTCACTACTAACTGCAAGGATTGATAAACACCATGATTTCAGATTTGAAA^ 

ATGGAAAACTGCTAATGCACAATGTTAAGAGATCTCCATGAGAACCAGGCATTAATCCCA 

TATAGCMTGATTTAATATTGTTACCAATTTCAGGAAAATGATTTTTTGATAAATG 

ATTGTAAGATATACCTTTTTATTTCTTAGGAGCATGTGACCTGCATACGTAAACAG 

Sequence 1057 

AGGTACAGAGCCAGCCAGTGTTGGGCAGCAGGCTCACAGCCTCAATAGGGAGAAAAGACA 

AAGGCCTCAAAATGACAGGCAGCCTGACAGAGGAAGGAGTCTGACACCTCAGCTTGAGGC 

GTCTTTGGMTTCCTAGCTCATCTCAGAATTATATCTTAGAGTGATAATATGGGGTGGTA 

GCCAGTGGCCAAACAGCAAGAACTAAGAGTGGGCCCTTGCAAAAAAAGGTTGGGAAAGCT 

GGGCCCATATTGCCTGGTAMCCCTTGAGCCTGATGCTCATACAGCTGTCCCTTGTTTTA 

GCCAGGTCTTGACAGAAGGGTTACCAGCA 

Sequence 1058 

AGGTACAAAAAGTCAAGCCCTGAAGTAGTTTTTCCCCCAAAAGTTCTGATGTTAAAACTT 

TAAAAAAAAAAAAGAGAGAGGTAGTGTGTAAAAATATTTTTACTTAAAATGTTGAACCCC 

TGCACCACTTATATTATAAATGAAAGTTAACCTTCTACATACTAACAT^ATTTTGCCATT 

CMGAGTCAAATTTGTAAGACTCACAGAATGTTAAAGCCGGAATGCCATTAAAAATAGCA 

TATAAATGGAAGAACTTCGGTGAATTCAACAAAAAGTACCTCGGC 

Sequence 1059 

CCGGGCAGGTACTCAGTATAAAGTCCAGATGCCTTTACCATGACCTGCAAGGTTATACGT 

GGTCTGACCCATGCATTTCTCCGTGACCTCATCTTATACTCCAGGTTTCTCTGTTTTACT 

CATTCTTTTCCAGCCACACTAAACTTGCCGATCTTAGAATATGTTGAGTTCATTCTAGCC 
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CCAGGTCTTCAAATTTCCTCCTCCTTTCCTGAAATAGTCTTCTCCCAGATTTTGGTGTGG 

CTCTGTACCT 

Sequence 1060 

CCGGGCAGGTACGNTGGGGGGAGGATTGCATTCAGTNTAGTTCCTGGTTTTNGGCTGAAA 

TAACCTGACCGAGAGCATCACCCCCTGAACATGGACTTGCAGAATTCCACAGAAGAGAGG 

AGACTGGCCTAGACAGACAGCCAGAGCTGAGGGCCCAACAGGCTTTCTACCCTGGATGCT 

GCTCCCATGCCCTGACACGAGGCCCACTACAATGCTCTCCAAAATGTCCACAAAAGTGAC 

TTAAGTCAGGTTCCCCCAAACCAGACACCAAGACAAGAATCCATGTGTGTGTGACTGAAG 

GAAGTGCTGGGAGAGCCCCAGCTGCAGCCTGGATG TG AACTGCAACTCCAAAAGTGTGTC 

CAGACTCAAGGCAAGGGCACTAGGCTTTCCAGACCTCTACTAAGTCATTGATCCAGCACT 

GCCCTGCCAGGACATAAATCCCTGGCACCTCTTGCTCTCTGCAAAGGAGGGCAAAGCAAC 

TTCAGGAGCCCTTGGGAGTCCTCCAAAAAGAGTCTAGGGTACCTN 

Sequence 1061 

CGAGGTACCTTCATGCTCTAAATCAGATGATCTACTATCTGAAAAGGAAACGACAAATCT 
GACAACAAANGAATTTCACAACAGATAGGCAGTTGATAGCATGAGGCACTAACATTAAAC 
CCAAGTCTTTCAATGGCACTTGGAGTCCCAGGGTCTGCCCG 
Sequence 1062 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTATAGCGACATACAC 

AGAGAATGCAGTTAAAAACTCTACCTGTGATACATGAAAGCTTCAGGGGGCTATATATAG 

CCAACAGTAATTGACTATTACACMGAACCAAAGAATTGGACCTTGAAAGGAAATATTCT 

ATCTGGAGATTAACACAAACTACTTAGTAACAAAAATGAACTATAACAAACATTAAGGAT 

MGTAGTCTTCTGTGGTAATAAMCTATCATTTTGAAAGGACAAAMGATTATTTTTAAC 

CCTGCTTAACTNTTATGNNGTMCTACATTCCAATTTAACAGCATTTAAATTAAAGA^ 

GG 

Sequence 1063 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCACACCAGGAG 

AGAAGATCATTATTAATGTGCTAATAGCAGCATTTTATTTTGAAACCCACTCTGCATGGT 

TACAGGGCTCAAAACAACATATTCTAACAGGAAGATACATTACCGAAATATTTTAATGAG 

AATATTTAATATGCATTGAGAGGTCCGCATTTTCTTGCAGAGACCTTGTAGGTAGCTCTT 

TGAGATTTCTGTCTCTATGCATTTAAGTGAAGGAGTTGGTTGGGTATTTTAGTTGGCAAA 

TTTTGCAGACATGTAGCTTTGGTAGTGGAGGGGGTAATAGTTACNCTGCCCGGGCCGGCC 

CGCTCTTNGGAANCTAGTGNGNATCCCCCGGGGCTG 

Sequence 1064 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCTTTTAAGT 
TTGCTTGGCTTTCTCAGATTCTGGTTTCAATTAACATGCTTATGATTTTCTGGGTTTG 
GGGCAGATTAGTAGGCTTTGGTGTGGC I Ml! I CTTTCTTCCTTTCTGCCTTTACGCCGT 
TTGGGTTGATGCATCTGTGTACCT 
Sequence 1066 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGAACTAGCTCCTTCTGGTTAATTTGTTGATT 
GGATTGGAAATTAGAACATGGAGCTGGTCAATGCACGGTATCTGGTAATTGTGGATGGGG 
GAGATGACTGGGGCAGAACTGAGCTCATTTTTGCCAACATAGTACCT 
Sequence 1066 

CCGCGGTGGCGGCCGAGGTACCTCTGTGGAAATCCCTTCCTTCCAGAAGCCTCCACTCAC 

TTCTGATGCCAAGGAGCTCTGCGGCGCCCTGCACGCACCTTTACAGATGCAGGTGGCTGT 

TTCCTGTGTCAGACTGCAAGCTCCCGTGAGCTGGGTTCATTGCTCATCGTTGACTTGGCC 

TCCCCGCGTACCTGCCCG 

Sequence 1067 

CCGCGGTGGCGGCCGGTGCCACACTGGGCCAGGGTAGTCATCCTGAAATACATGTCCAGG 
GTCTTGTTTGTCTATGATGTGGGTGAAAGCTGCCTCAGCCCGCACCACAGTAGAGAGCGG 
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GACCACCTCACGAAAGTTTTAGCAAACTCCCAGGGTCTAACCTGAAAGCAGCCAGGAACA 
AAGACCTTTCCAGAAAGAAGGACATGAACAAACGCTTAAAGAACGACCTGGGCTGCCAGG 
GTAAGAACCCTCAGGAGGCCGAGAGTTACTGTGCACAGTACCT 
Sequence 1068 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGCTCTTGTTAATTCCATTCAATAAA 

CACACACAAAGCCCACTCTCCTTCCTTCAGACCCTATTAATTCTAGTGTAAGAGAGATGG 

GTGATGTTCCACAGAAGAAGCTAAGTTGGACTTTGAAGGATAAAGAGACGTATGGGAGGA 

AAAGGCAGGAGGTAGAATTCTAGCCACAGGAAGAACCAGGTGAGAATAAGCTTGAAGATG 

TTTAGGGGAGAAGGAGTTGTCCCTTAGCCTTGATCCTTAGAAACAGACGTAGAGGTAGAG 

GAAGCTGTGGCTGCAAAGGCAGTAGGCAAGAAAGTGAAGCCAAACGCCACCCAAGTGGCA 

CTCTCATGCCCCTCTCTAATGTCAGGATATGTTTACAGCCAGCATGATTTCCATATACAT 

TCCTCCTTCACTTC 

Sequence 1069 

AGGTACTCACTTATAGATGGTTATTGGCCAAAAATACAGACACCATGCTATTATCCACAG 

ACTCAGAAAAGCTACACAAGAAGGAATGCACAAGTGAGGATGCCTGAATCTCACTTAAAG 

GAGTGCATAAAGCAGTTCCTAAGAGGAA.GACGGAGGGAGTGAACTGAATAGAGAAGAGAT 

ACGGAGGGCTATGGGATGGGTTTGGAATCAG 

Sequence 1070 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATACGCGGATTTGT 

GTGTGTGTATCTATGAGAGGGGGAATTAACAAATTACATGGATTTGTGTATGTGTGTGTG 

CCTAAGAGACAAGGAAGGAAAACAGTGCAGTTGAGAAAGGTGAAGCAAATGTGGCCAAGT 

ACCTGCCCG 

Sequence 1071 

AGGTACI rTTTTTTTTTTTn ICCTGAGGGAGATGGATAGGTGAAGTAATGGTTGACAAG 

CATAAGTGGGAAGCTTCAAGTCCCAGGTTCTCCCATCAAATCATTTTCTCACAATCTTGC 

TCAGTATTCTCTTCTAGTAGGCCANAATTTGAAGGATTAGCTCTCTGTATATCTAATCTA 

AATCCTACTCCTTGTTTTAAGCTCTTTGATTTCAAATATTAATGATCTGGTACAGCATCG 

CTGGTGGTTTCAAAAAACGTAGTCATTCTTCTCACTGCAACAATGTAAGATAAGCAGGGT 

AGATCTGTTATTTCCAAATTAAAGGTGATTAAGATATATGGAGAGAGAACATGGCATGTT 

GAGGTTTATAGGGCTAGAA 

Sequence 1072 

AGGTGCTAGCTTGAGTCGACCCACGCGTCCGCTCAAGGAGAAGGAAGACCAAAGTATGCA 

GGAAGGACCCTAGGGCCTTTCACCAGCTTACATCCACCCACCTTGGGCAAGGATAGGTTA 

GATTCCAGTTTCCTGCCTCCCGCCATGGCTAGGCATTCTCAATCCACAGGAACATTCTTG 

AGTAGAACATTCCCAGAATGATCTGACTGATGCTTGCTAGCCAATAGATTTAAAGGTTAA 

TATGCTTAGCCTATAAGTTAGAAATGTAACCTTTCTGAGGTAACCTGTGCCCTTAAAAAG 

TATAAG 

Sequence 1073 

AGGTCACGCGTCCGCTGCTGGGCTGAACCAACCAAAGATGGAGTGGCAGAACCCTGGGAG 

CTGATCGGGGTTCTTTCTAGTAAGTAGTTGCCAGGCCTTAGAACTGATTGGTGTCACCTA 

CTTCAGTGTTCATTCATTTCAATGACAAGATCAAAATCAGATTTTACAGGATGACCAAGA 

TTCTGAGTTTCTCCCATGGCAACAAGTTCTCAGGTTAAAGTCTAGGAATGGTCATACTTG 

CAGCTTGTTCTTCCAGAAGACCTGACTTTGGTTCCCAGAACCCATGTTGGGTGGCTCACA 

ACCATCTATAACTCCAGCTCCCGAGAGCCAATGCCTCTGACCCCCAAGAGTGCCTGCACT 

CATGTGTATATATCCCATAATTAAAAATAATAAAATAAA 

Sequence 1074 

CGGACAGGTTGCAGAAAGGGAAGAAATGCCAGAAGGAACCAATGAAGGATGAGCCCAGCT 

TCCAAATTTTGATGAAAGCTTTCGTCTCTACATCAGAGAAATTCTGCAAGATCTTAGAAG 

GATGCGCTCAGAGAGATTCACACCTAGAAACACCATACTCAAAGGGTAAGGAGAAGAGAG 
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CCCCAGGGAAGAATACAGAAGGGGACACATGACTGGCCACACAGAAAGCTCATCAGGAAG 
AAAACATGACTCCCTGTCAGAAACCCAGGGTCCAGAAGGAGTGGGATGACGTTTGCTAAG 
TTATAAAAGCGAGCTGTGCACCCAGATTCTATAC 
Sequence 1075 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCCAATCATTTGCCTGCATCCTTCTCTTCTCA 
ACAGACTCAAGAAGTAGAGACGCTGACTTTTTTGTCTTTGTATCCCTAGGACTTAGCACT 
GCCACCAAATACAACCAMGTTTGCTGAGCTGACATGATTAAGAGCCTATCTGTGGAAAA 
GTATG ATC GGTCTTTGTGMGTTGC AATCTCTC AG AGACTTGTTTTGTGCCTTTAC AAAT 
AATAGAI I Mill 

Sequence 1076 

CCGGGCAGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCA 
CTGAGCAAAGGGTGAAGTTCAATTAGGTCAGTTTTATCACTTCTTTTCCTACACACAAAC 
TATGAGGAGAGATCATTTCTTTTCTTTCTTTTATTTATTTA I MINI I GAGACAGGGTC 
CCACTCTGTCGCCCAGGTTAGAGTACCT 
Sequence 1077 

CGGN CAG GTGCTTG AGTCGACCCACGC GTCCGCACTAAGATTNTAAGTGAATAC C ACC AA 

ACCCAGGAATAAACTGA I I II I I I I I CCCTCTTAACTGGAAAATAACCTGAATGTNTACN 

CAGNTAAAAAATGAGATTGGNGGGAACCNTAAACGACCCGGGCTGNTGTTTGCTCTCCAC 

AACCTGATGGCAAGGCCCTGTTGNTGGAAATAACACCCACACGGTTNACTGAACATGGGA 

AGTGGAGCTGGTGCNTTTNTAGAGTCTTNAACCCCATTGGNTGGCGTTCTTGGAACCAGA 

GGTAGCCTGCATGC 

Sequence 1078 

AGGTACCCAGAAGAAGGTGCAGATATCTGATTGAAACTG7TCATCTTACCAACTCTTCCT 

ATTGCAGAGAAGACTAAGGTCCAGAGAGGGATAGTGACTTTCCCAGGTCACACAGGAAGC 

CACAAGAGCACATTGAAGCCAGATACACATTGTCAAAGCACGCATGTCCAGCCGAAGCAA 

GGACCCGAAATGCTGATTGTTTCCCCTGCAAGGAGGATCCTGAGTTTGTCCCCGGGAAAG 

GTTTAAGAGAACTGGCAGGMCCAGCCTTCATTCTTCTAGGTCCGGTTACCCTTG 

Sequence 1079 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACCTCTGGATAATTTTACTTT7TT 

TATTTGAGACAGAGTTTCGCTCTGGTTGCCCAGGCTGGAGTGCAATGGCGCGATCTCAGC 

CCGGGTGACAGAGCGAGACTCTGTCTCAAAAAAATAAAAAATAAAAAATAAAAAACAGCA 

TTCCTTGGGAGGCTAAGCGATCCTCCTGCTTCAGGCTCCCAAAGTGCTGAGATTACAGGC 

GTTGAGCCACTGCACCTGGCCTCAACTATTTTTTTAAATCATGTCTGAATCTTAATTCTC 

CTAGGGATACTTATTCCCCAAACTAACACAAGTTGAGTGACGGGATGGGCCCACTACATT 

TANTAAAAAGAAAGGTACCTCGGCCCGCTCTAAAAACTNGTNGGATCC 

Sequence 1080 

AGGTACAGCTAGGACAAACCTTGACTCCGGGTCGAGCGCTCATTGTATAACATGCAGTGG 

CATGGCCCGTGGTTAGGAGGAGTCTGAAATCAGATTGCTGAGGGTTCCGGATCTGGTTCT 

ACCACTCACAAGTGACTTCGTAGGCCTTTCTTATATTCTGATTTGTTATATCTGAACTGG 

AGCTTACACATGCACCAGAATCCTACAGGGATACTGTGGGGGACACACAGCTTGAGGTAA 

AGTGGTTACCAAGAGGCCAGGCAGTTGGGGGATCACCCATATCACAATAATTTGTCATTG 

AGCATGTAGTTTCTCTCTTCATGGGATGACA 

Sequence 1081 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGCTAACACTTATCC 
TCCAGTGCTCTACTAATTGCTTGGTGCTATGTTGACTACTGGGAATGAACCACAAGATAC 
" ATCACTGTAAACTGCAGGGAGNGCTTCACTGATCATGAGGTNAAAATCTCCCCATCATCA 
NAAAGAGAATTCTTTACTTAACTTAGTGGTACCTGCCCG 
Sequence 1082 

AGGTACACATTTTTTGTTGCAAATAGTCCTGCAGCTGGTTTTTCTAAAAAAAAAACCCCA 
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AGAATACATATAAATCTGAGTTCCCTATGGCGCTAGTNTACTGGGCAGACTTCTGGTGTG 
GGGTTCTTTCTTTNTTTTAATAGCCTTCTGAAAACCTCANTGCCACCAAATGCATGGATC 
GTAGCACCTAGTTCTAAGTGTGTGCTATGCTTTGGTCTAATATCCAAAGAGATGTCTAAA 
GCAANTGTGCTTG CTTTTC ATG A 
Sequence 1083 

TTAGGGCAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTGGTCTCAAACTCCTG 

AGCTCAAGTGATCCTCCCACCTTGGTCTCCCAGGCGTAAGCACTGCACCTGGCCTTAGAA 

ACCTCTTTAAAAGTGGTTGTGGTATCCTCATAGAATCTAGGAATCCATTACTCAAAAAAG 

CAAGACTGCAATGATACACAGTACCTGCCCG 

Sequence 1084 

AGGTCGACCCACGCGTCCGTGTATCTCCTACAGTTCATTATGCAGCAGACACGGATGTAA 
ATGGGGACAAANATGAAAATACCNACTCAGACTCTTTGTGAGACGTTCANCATTTTTTCT 
GAAATGGCGCAATGTATTCATTTTCCA 
Sequence 1085 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTACA 
GTGAMCTGTTTCTACTGTGGCTTMTTTGCAAMTCCAAGCAGATATTACTmAAAM 
TCCAAC TATCTCTTTCAAAGGATTTGTAAAGTTGCTTACAGACATAAG GTGAC ATC AGAA 
AAGCATGTTGGTATMTGCMTTAGCTM 

TTAAAGAAGGCATANATAAGTGCAAGTCCTACAAGTAATGAACTGTCCTAGAAGCAAAGA 
AATTCATTAAATAGATCTCCTTGATTTCTCCTTCATGGCCCCATGTCCATAGAGATTGGG 
GAATGTACCT 
Sequence 1086 

CGGGCAGGTACTGACTTAAAATAACAGAGTAACCTGTCTGGCTTACTTGTCTCTGAAAGT 

GAATGTTCAAAGTGGGTGGAATNGAAATCACATATTAACGATAGTCCATACTTATATGCA 

MGCCATATGATTCCCTAAGCTACTTAAACTGGMTAAAGACACTAATAAAAATTTCATA 

AAACATTGAACCTGGTGCCAGCCTCACATTAAACCCATAGGTTCTTCACGATTAAGGCTG 

ACMTTTTAGCTACTGCTTAGAMGTTGCAAACACAAATAGGGGGGAGTTTAATGTTCCA 

TCAGMATTMCCCAANCTGGTTTTGAAACTCTACTTGGCACTTA 

Sequence 1087 

AGGTCTTGAGTCGACCCACGCGTCCGCCTAGGTCTAAGAGCTGAAGCTATAACTGCTAAG 

AGGCCTGGGACTCTGTGACACTGAAGTTCAGCAANTGTCTTAANAAACAANAACAGNAGG 

GGCTTGGAGAGATGGCTCAGNNGGTTAAGAGCACTGACTGCTCTTCCTGAGGTCCTGAGT 

TCAAATTCCAGCAACCACATGGTGGCTCACAACCATCTGTAACGATATCCGATACCCTCA 

TCTGGTGTNGTCTGAAGACAGCTACNTACATAGAATAAATCTTAAAAAAAAACAAAAACA 

AAAACAGAGAMTAGGAAAAGGAATGGGGGAAGAAAGGGAAG 

Sequence 1088 

CGGCAGGTCTTGAGTCGACCCACGCGTCCGCATTTATTGTGGGCCTTACTTCATGCCTTG 
CTAGGCAANTAAAGATACTTTMNACAACCTATTN 

AAAGAACCTTATTTTATTGGGGTGNTTGGGNCAGGGTTAAACGNGCCAGGAAAATNGGGA 

GGNAAACCATGNAAAAAGNAATTMNCNATTTCCACCCATATTTTGGGGGGTTCCAAGAT 

GGGANAAATTGGAAAATTGGTTTTAAGTAAGNMTTTCNGGGGTNGGTTCCTTGGCCCAN 

ATGGCCTTTTAGGGGGTNNGGAAAAATTAACC 1 1 R I GGGGTTTGGGAACCCTTTTNGGA 

AGGNAAGGGGGAATTTGCAAANGNGGMGNGGGGGGCCTGTTTTTTCTTTGGGGTAAGGC 

CAAAAAANGNGGGGNAATTTGNGGTTTTTTGGCCAATTC 

CCNNAGGNACCCGGGGGANAATGGAANGGGGGGAAAAANGAACCANTGGTANGTCCCAAN 

NGGNAAAGGNAAAANNGGGNAAAGNGGGGGNG 

Sequence 1089 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATGACACAAAGTAACTTCTCAT 
GGCTGGATGTGATGGAGGATATCCAAGCACCTGGCAGGCATTTGTAGAGATCCCCGCTAA 
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AGCCCCCTTCAGCTATAGGATTCTATGATTCTTTTCCTGTGAAAATTTGGAACACCGTGA 
TAGTGTANTTACCCTCTCCAGATCATAGGGGCACCATTTAATAACTTACTACATTAATGT 
ATTAAATTGTACCTGCCCG 
Sequence 1090 

AGGTACGCGGGGAAGCAATGACGTGGGGAGAGAGCGCGGAGGAAGGGAGAGATCCAGAAA 
GGTAGATTCTTCTGTGTAGGTGAGTGGATGAGATAATGAAGCAAGTTAATTTGGGCTCTG 
CTGACTTATTTATCTGTCCCGGGCACTTTTCTCCGTACCTTGCCCGGNCCGGC 
Sequence 1091 

CGGCAGGTACAGTAGAGACACCATCTCTGTTAGGAAACTGTTAGTGGCTACCATGTGTCC 

CTAAACTCCCTAGCTTGGTAGTCATAGCCTCACGTTCTGCTTCCAGTTACTTACCTGTCA 

ATCCTTTTCTTCCTACTGATCCCTAGTGCACTTTTAACACATCCTTATCCTCTTTGTCTA 

TGTCTGTTTGTCCTATTCTCTGCACCTGCCTTTCTCTCCCCTAGTCCTACTCCCGCATAC 

CTTAAATGCTGCCAATAACTTGATTATGGCTTAATTACTTAATTATGCCAACTGAAAGGG 

GTCTACCGTGTTAAAAGGGCTTTGGCCCCTA 

Sequence 1092 

NGATTACCCTNGTNCGCCCTTTTTCTTCCCTTTTCGNGGTAAAGCCGGTGGGNCGCCTTT 

TTCTTTCANTAAGNCTTCAACAGCCTNGATATGGGTNATTNCTTNCAANTTCTCNGTGTN 

GATMGTGGTTCTGTTTTTCNGNCTTCCCCAAATGACCTTGGGAGGCCTTGGTTGGTAGT 

CCAACCGAANACCCCCCCCACANGGTTTACAANNCCCCCGNACNCCGGCTTTGTCGCCTC 

TTTAATTTNCNGNNTTAAANCTTATTCCGGTTACTTTGTAGGTTTCCCAAACCCCCGNGN 

TTATAAGTAACCAACCNGAANCCTTTTAATTTCCGGCC 

Sequence 1093 

AGGTACTCTTCTGGTTGCCTTGACTCTAGTCCTGATCAGAGTGCCCAGCAAAGAACATTA 

TACGCACTGTCCCGGCACTGTCCTGTGACATAAAAAGCAATTGACCTGATATCAGATTCC 

CTGGGATAGGACCTGAAAAAGAGTTGAGGGCAGGCTCTGCATTTTGTCCTTTTCTGATGC 

AAATG AGAAC G AAATAATTTTTTTC AACTTACTTTATCCTATGGTG AGTAGAAGCC ACC A 

TGTTTCCTGTGGAAACCAGGGCGGTGGCATGGCTTCTGCTTCTGTCTGAGAGAACATTGC 

TTTCCTGGGACTTATCCCCCTTTTA 

Sequence 1094 

GGCCGCCCGGGCAGGTGCAGAACGCTGGAGAGGGTGCAGTCTCTGTGAGGGTGAGCGGTG 

TGGTGCCCTGGAAGGACTCCTTCCCACCAGTCACTAAGCAGCAGAGGCCAGGCAATCACA 

GGCTAGGAGCCTTCTGACCCAGGAATAGGCATTCCCGCATCCTTAGACCACACAATGCAT 

GCCCACTGTGTCACTGCAGAAGAAAGGATAAAGGTGGCCATGGCTAGCTGGCCATGCAAG 

GCACTCACCAGCGTTAATGAGACCTGGCCGGNCAAGGCAGTTACTCTGATGGTAACTAGG 

GCTTGTCTCCTTGCCATGAGAAGAGTTAGGGACACAC 

Sequence 1095 

CTGAGTGGCTGGGACTACAGGTATGCGCCATACCTAGCTAATC l I I I IGT Al i I U M GG 

TAGAGAGAAGTTTTCGCCTTGTTGCCCAGGCTGGTCCGAACTCCTGAGCTCAAGTGATCT 

GCCTGCCTCAACCTCCCAACCTGCTAGGATTACAGGTATGAGCCACCCTGCCTGGCCAGA 

TTTAATGAAACTTTATTAATCACTTAGTTNTGCTAATTACTATCTTTCCATAATGTTT^ 

AATACAACGCTATTGTAGTTAATACCGTTG I I I I I GCGGTTATTTGCTTACATGTCTGTT 

TCTGCTACTGAATTGTAAATCCCTTGAATATAGGGAAAATATATTTGTATCCCTACCATC 

TGAAAAGAGTACCTTCGGNCGCTCTANAACTAGTGGATCCCC 

Sequence 1096 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGCCCTGCTTGTCTG 

CAGATCCAGAGAGCCTTAATTGACAGATTCAACTGTGCCTATTCCTGAACCAATTCTGAT 

CAAGAAAGTGGTCCCACCAGTCAGTGCTGACAGGAATTAGTCTTTTGCTGTCAACTAATG 

GTTTCCAATTAAGGGCAACTTTTTCCACCCAGAGATATTTGTCAATGTCTGGAGACATCT 

ATGGTTGTCACAGCTGAGGGAATATTAATTAATAGCACCTAGTGGGTAAAGGTGAGGAAT 
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GTCATTAAATACCCTACAATGCAGAGGGTAACCCCCCACCAACAGAGTACCTCGGCCGCT 

CTAGAACTAGTG 

Sequence 1097 

CAGGCAGGTCTAGCTTGAGTCGACCCACGCGTCCGATTTAGAAGAGCCAGAAATAAAATA 

AMTATCTTGGTGTAACTTTAACTAAGTAAGTAAAAGAACTGTATGACAAGATCTTGTCT 

TTGAATTAAAAATTCTGAAGATGACAGTAGAAGATAGAAGATGCTCATGAATTGGTAGGA 

TTAACATAATAAAC ATGGTGA TCTTACCAAAGGCAATAAACAGACTCAACACAATAACTA 

TCAAACTTCCAACA I Mill I ATTAATCTTGCAAGAACACAACTTAACTTCATAAGTTGA 

ACAAAATAAAACAGTATATGTCCAGCCTGGCTTGGCCCTTCAT 

Sequence 1098 

TCTAGAGCGGCCGAGGTACTCAAAATCCTAACCAGAGCATTCAAACAACAAAAAGTAGGT 

TAAAGGGG ATAC AAAACTN GG AAAATG AAAG N AAGGTCAATTAATTATTCACCTAN TTTT 

GCCAGTATGGAATNATGNATTAGGTTTTTACCTTTTAAAGNTGACCNCCCCAAAAAAGGT 

TTC N C AACTCGGG AAG N AAAACCT AACCT AT ATAG N C NTTG AATTAAAAAACC AAAN CTT 

TTCCGAACCAAAAGGGGTNANGCCTTGGGGGGTTMTTAAAAMCCNTAAAACNTTTCCA 

AANTTTCCAATATTAAAGGNCCCCTTTTCNCCTTTTTTNAACCTTCCAA 

TTNAAAATACCAAGGGGGNTNTTGGGAAGGNAAAAAANAAAAATATAATTTTAAGNGGGG 

GGNAAAAAANTTGGGAACCCAACCCCCCCNTTTTTAA 

Sequence 1099 

AGGTACACACACACACACACACACTCTGCAGAAATTACTCACAGTTTTGGCCTGTAAGCA 

GTATTCATTCTTCCATTAMTCTGCTAGTATTAATATAGGTTTTCTCCCATTTTATATTC 

CTTTGACTATTAAAGTMGAATTCAGGACTTGGAGAGAAAGTGGAAAATTTCCATTTAAT 

TCAT TAAG TTTTAAATTATTAGCTATACCAGAACCCTTTATTATTATTTTATG 

GATATTTTGCCTATATGTATGTTTGTGCACCATATCCATGACTGGTTGCTTGAGGTCAGA 

AGTAGGCATT 

Sequence 1100 

GGGCAGGTCTCGTGCGGACGCGTGGGTCGACTCAAGCTAGCANATGGAAGCTAACTGCCC 
ATTGATAAGTAAAGTTTTTACAAGACGTAGMCAGTNTTGTTCTNGTNATACATTATCTT 
TGGTTN N ATTCTTATG AG AATC ATATTG AANCATC AGGCC ATN NTTTTAAAGGGTG G ATA 
TTGGCGCGTGNAACCCTTTGGCTCCCATTTCTTAMTACCAACCAGGTATTCCGTTTGGA 
ATAGNTTTGGTAACCAGGAAGTACCTATTCTTGGACCCCCATAAAAAAACCTTTAANGGT 
TCTTAC AC N NTGGTTTTGGAAGN ATCCTTATTAGGAAAAAAAAAGGGTNTTGGCTTTTTA 
TAATTN CTNTAACAGNTTCGGTTTTAATTATTGTTGGTTGGGGG GGTTTGGT 
Sequence 1101 

CGGGCTGAGCCGGGAGCCTGAAGCCCAGCTCNGAGTTATATTGGGGACAGGGCTCCTTTG 

GCTCNTGGTAATGGAGNAAAGGGCCTTGANGAATTTAAANGGGAACNCTTGAAGGGGAAC 

CAAANNAAAAGNAAAGGCCACCCCTTCCAATTTTMTNTTGGGGTTGGTTGGTTCCCCTT 

CGGAGAAATACCGAAAGNGGCNCCATNTNTTGGTTGGGGTTGNCCTTTANTTTCNMTTT 

TNCCCCTTNCCCCGCGGGGGGGCTTTAAAAAAACCAA 

Sequence 1102 

GACGAGGGCAACCCAACACCCTAGCCTAAAGNCCCGTTGACAACTGCAGNCATGGGTTGG 

CTTGGCCCACCGCCTTGGCANCCGNTCACGNAAGGAAAAAAAGNCCGCGGGCCCTTAAAA 

GGCGGCGGAGGTTCTTGGGTTGAACCTTGGGGCANCCCCACCCGGTTGCCANGGCCTGGA 

TTGGGGTNACCCNCAAAAGGCCTGTCCCCAAGGCCCGNAACCTTGGGGAAAANGAAATGG 

TTCCTTTTNGNGNNAAAAAAAAAAAATGGAACCCCGGTTGGGGGNANGGCCCCTTGGGGG 

GGCCTTTGGGGAAGTCCNCCCCGNAAGGGGTTCCCCGGGCCTGNTTGGCCGGGGNCCCCA 

AAAANTCCAAAATGGNCCANGGGGNTGGGGNCCAACCCCCG 

Sequence 1103 

AGGTACCTAGAATATTTATTTCAGATAATATCAAGATTTACCTATATTTCTACCAAAGAC 
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AATGATAGGTTTTTTTTTATATAGTTGGAGGGCAGGGATAGTGTTTAAAGGTTTATGATG 

TATAAGCACGCTTAGACACACCAGGAGAAAGTATCAGATCTCATTACACATGGTTGTGAG 

CCACCATGTGGTTGCTGGGATTTGAACTCGGGACCTCTGGAAGAGCAGTCAGTGCTCTTA 

TCCACTGAGCCATCTCTATAGCACTTTAAAAAAAATATTAAGGCATCATACAGCNTTTCT 

GCCTCTATGTATGCCCTGCA 

Sequence 1104 

CGGGCAGGTCTTTGTCTCTCAAACTTTTATACTGTCATAGTTCATTTTTCTTGTACAGAT 

GCTCTTTCAACTTTTTTTGAGAGTATATACATAACAATTGCAGTTGAAAGTAGGGTGGGG 

TTTAGTCTCCTGGTCACAGTTCAATAAGTACGGACGCGTGGGTCGACTCAAGCTAGCTTG 

TACACCTN 

Sequence 1105 

TCGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTT 

GAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGGATGAGCTACTGAAAACAAAAAGAGG 

GTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTA 

CTCATTCACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCC 

TGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAMCTATTTTGA 

ATTGGTGGTGTTGAGGCTAMTTCTCTGCTATTGACAGMTTGAGAATGTGATCAATTTC 

AGAGTAGCCATGTTACAMTTTTGTCCCMTTTCAATGGGGGAGGAATTATAACCAAATC 

AAGTAGTGGTTGGGAAGA 

Sequence 1106 

CCGGGCAGGTACGCGGGGAGTTCTCTCAGGCTCTCCAGAGCTCAGGACCTCTGAGAAGAA 
TGGAGCCCTCCTGGCTTCAGGAACTCATGGCTCACCCCTTCTTGCTGCTGATCCTCCTCT 
GCATGTCTCTGCTGCTGTTTCAGGTAATCAGGTTGTACCT 
Sequence 1107 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGAT 

GGAAGGAAGAACTCGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTA 

CTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGG 

AAAATGCAGTAAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAG 

AAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAA 

ACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGA 

TGGTGTTTGGGAGGTGGAGCCCTTCAGAAAGGTAATTAATGCCCTTGTAAGAAAGAGGCC 

AGNAGAGCTTGCCGCACC 

Sequence 1108 

GGGCAGGTACTTGTCCTGATGGGTCATCCAGAAGAGTTTCATGGAGATCAGTTCACAACA 

TTGGACAGACTATGAAACAMGTCTGTAATATCATTTATACTCAGTATTGGTAGGCTTTT 

TCCTTAATACCCCCACTCCAACAAAACAGACCTGCCTTCACTTCTCTTGAAAATATTATT 

ACCATCAGTAACTGCTTGTCTACTATGTCTCCAAGGCAAGCCACAGGCTATAAGCCAATT 

CTGCAAAAGGGAGCAGCAAMGAAATGCTTAGTGCAATTCCTCTATTTAAMGTTTGCTT 

G AGTTTTG ATTA \ I I I I GTTATTTTG 

Sequence 1109 

AGGTCACGCGTCCGACCTAGCTTGAGTCGACCCACGCGTCCGATTTCTGGTTTCTTTGGA 

GACTCGGTTGTGCCATGTGATGCTTTTCTGGAAACTGTCTTATGAGAGGMGCTTTTGCT 

GAAGCAGACACATGGGAGAATGTTTTGTTAAGAACAGACATGTAGTATTTTTCTAGAGGC 

AGCTTGAATAAAGAGCATGTGGTATTTTGCTAGAGGGGACACTTGAGAGAACATGTGACA 

TTTGGAAAGGATATATGTATAACCTAAAAGGACAGTGGCCAATGCTGTGTGGTATTTGGG 

TGTGCCTTACCATTCTTTGCTGGC 

Sequence 1110 

TCGTGATCTAGATCCCTATAGTGAGCGGACGCGTGGGTCGACTCAAGCTAGGTCGGACGC 
GTGGGTCGACTCAAGCTAGGTCGGACGCGTGACCTTCCCCAAAATTCTTCI I 1 1 I ICAGA 
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TGAMGTCATACACTCAGCTTTCATATTTTAATATACATAAAACAGCTCAGTGACTGGGA 

CACTTCCTAACCTCCATTTGGGTAATTCACTTCCCTCCAATAATCCTGAGTTATCATTTA 

TTATATTTFGTATTTTACCTTGGCTGCACTGGAGCCAGTTGGGCAGCCCCTCTTGGGCCT 

CACTCTCATGTTCCTACATGGGCAG 

Sequence 1111 

CGGGCAGGTCACGCGTCCGCATGCCTACCCGGGCJ I I I I CTGACTCCGTGAAGCTGATCT 

CTTCCTCATTCGAGGMGGGTAMCCCTTTGTAGATGGGTAAACCCTTGTCAGTCAGAGA 

CAGTAGCCATCCCCTGTAATTCCAGAGCGCAAGAGACTGAGGCAGGATAATTACCTAAAC 

CTAGGAGTCCAAGATTTGGCTAACGTAGTAAGACCTGTCTCGAAACAAAACAAATCTTGA 

GGGATGTGATGCTTGCTTAGTGTTAGATTGTGTGATTAGGATACACAGGGCACTCAGCTT 

CCTATCTCCACTGAAGCAGTCAATTA 

Sequence 1112 

CTACTGAGACCATTACAAAATTAAATGGTGACTTTTTAATCTGGAATTGCAAACCGACAG 

CANCCTGTCTGGGGAGCTCCATGTNATGAGTGTATGTCTCCAACAGNCTTCAATGACTGA 

CAGGTGAAAAGCTGTGCCACAGCCTGGAATTATTATGCTGACACACAAAAGAGGGCGGGA 

GTGACAGCGTGGTGATCTACCANTGCTGTCTGTGGGGNTAATGGTAAGCTTTCCCACTGC 

TAATGGTTTATNNANAGAACAAAGAAAAGGAAAACGGTGCANGGGAAAATCAATANNTAT 

TAACATAGTCATGGTAATGAATGCAATCCNTTATTTCTGGGAT 

Sequence 1113 

AGGTACATAAAACTTGTAGAGATTTACACACGCCAGCCGCACATCGCCCCTTCCCAATGG 
CTTTACATCTTGGTGACCTTTCAGAAGCCAACGGCACCCTTCCTGTTGTTACCCGTAATT 
GGAAAAGCATCAGAGATAAAAAAAAGTGC 
Sequence 1114 

CGGGCAGGTGAAACATTTTTATTATTTA I I I I I CTTTGTTACAAAAGG ACTAAGTGGTTC 
TGAAAGCCAAGGCAGAGTTCATGTGTCTGGGCTGACATTCCCATGGCCTATGAGCACTGA 
GCAGCCACAGGCTTTTCCAAGGAAGGCCCAAGGCCATCACATCCCCATCTACATCTACAA 
TACTTGAATGTATTTACATGTGTTGTTCTTTAAAMGGATTAC^ 

AATCTAAAAAATGATGGAGAAMGGMCACAGTTTATCTTACACATTTGCAGGATATACA 

AGGGTAACT 

Sequence 1115 

TTGTGTGGATTTACTTGTTGCCCACTGTCCCCATACCCTCCACCCCCACATGGTGTGCTT 

CCTCCAGAAAGGGACTACTTCTGCTGACCACACAGAAGACGTGTGTAAAGTCTGTGTATC 

AATGAATGGATTCTCATCTTTCATAG f 1 1 1 1 HI I AAATAGTTTTATGTAGTGTTTAACT 

AMTTTCACTTAAAAAGATATTTACCAGAAGCTGANAGTANGGTGTGATGAGGTTGGGTT 

CANGAAGGACTGNTATCACATGGCTTCCCTAAGGTTGTATATTACATTGCTAGGACACCT 

GACA 

Sequence 1116 

CGGGCAGGTACTTTTCTCTTAACTTACCTGACTGAAGGCATCCAATCAGAAAGGAAAGTG 

AACAAGTATAATGTTTCTAAGTGAGTCCAGATGGTAACAAGAATTCAAACATTATTCTTT 

TTGTATGTTTAAATAAAACAAATTTTACATTTTATTTGACCAGCAGATATAACTTCCTGG 

GCATAATTTTATTAGCAAAGAACTGATGATACTGGTAATAAAAAAAAAAAAAAAAATGAA 

AGGAAAAAGGAGAATGCACCTCAAACCAGMTACCATAACATGTTTTGTGGAAGCTGCAT 

TTTATTTGAAAAC . 

Sequence 1117 

ACTTTTTTT1 I CTCTGT AGG AAGTC AAC G AT AG AGTG AC ATGTTGGCTCTTTC ATTTC AA 

CTGANACACCTGGTAGCTATTGGATGGCATTTCTGTAATTATGAATAGTAGTTGTGATAG 

GCTTATTTGCACCTTGTACATTCTCTTACTTGGGAACGCCATCTCATGCCATTTCTACGT 

TTCTTATGCAGTTAGTTTGGGGCATGNGGTANTCCACTGGAAAATGACTCAAGCGTGGAA 

TGTATTTATGATCACAGATAAGGAAGTGGCCGTCCACAAAGCTTTGGATGGTTGTATTTG 
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TCATTTGAACA 
Sequence 1118 

AGGTACCTGCTTTGTGATACTGTGAAGATTAGATAAAGCAAATATCCACACAGGAAAGCC 

CTGGGTGTTAGCATAAAGCCGGAGTTGAGMGGCCAGCTTTCTATTTACGTGCTGAACGA 

TGAATAGTTAMTAGTTTCCGTCTGGCCCACGTCCTTACACACACCCCTTGGGTTTTCTT 

CTCTTCCCTGGTAGGACTGCATCGGACGCGTGGGTCGACTCAAGCTAGACCTGCCCGG 
Sequence 1119 

GCGGGCAGGTGTCCGCTCACTATAGGGATCTAGATCGGACGCGTGGGTCGACTCAAGCTA 

GCTTGTACAAAAACTTTCAATAAAATGCTTCAAATAATGATTGTCCGAAGCTTGCTAGAC 

ACTTTGTAAATGTCAGGAACACATTAAATATGTTAAArGAGCATGTCCAATATCCAGAAA 

ACAACACGAATGATTTACTATCAATTTCACACATGCCTGGGAATGTCTCCCTGATGCAAG 

ACAAAAAGGTCCTCTTTGACCAGAGCATACTGACCCCAAGTTTTGGTCCTGGCACAGTAT 

TCATGGGTGCCATTCAT 

Sequence 1120 

GGGTACAATGGAAAAGGGGCTGTGTTACTAAGACTGACCAAAGAAGAGATTCCAATATCT 

TATGATTTGTATGGTCACTGTTAACTGCAATTGTAACATGAACATATTGCCTATAAGCCT 

GTGCTCTCCCCT TTAAA CACAAAATAAAAAACAAGAAAAACAAAACAAAACAAATGAACC 

AAMTAGAAAACCTTTTACAGCGTGAGTTGTTTCTACTTAGGATTGGACATCAGTGCATT 

GATCTGAGTCTGGAGATCTTAGGCCAACTTAACAACCTGTGAAAACTTCTCAATGGGGTA 
A 

Sequence 1121 

CGGGCAGGTCTCCGGCGGACGCGTGGGTCGACTCAAGACTTTTAGCCTCTAAGCCATCTC 
TCTCACCCATTGATTTAAAATATTTTGCTTTGTGAATTTTGAAT^ 

ATAATTCTATCATTTTCTACCTTCATTTTTCTCCTTTCAATCGCTTCTATCTATTCTCC 

ATTTTGCTGG 

Sequence 1122 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCTGTTCTATAT 

CTTGAI I I I I I I I II I CTACTTTACACATCTCATGAGTTTTATCCCC AGCACTGG GTATA 

GAATTGCCCCATTCTTTTCAAGGGCTTGCTGTAAAGTATTCTGGGAACTGGGCGCACTCT 

CTTTAACTCTTCCTCTGTTGATTGTTTTCAGTCTCCTTCTTTCTATAAAGCAAGTTAAAG 

GAAACATTCTTTTAACAATGTTTTTAACAAGTCTTAGCAATGGTTTTT 

ATAATCTAGGAGTAGAGTTGCCAGTTTTTAGAGTATGTGCTTTTGAAATTTTGTT^ 

TCACTTGGGCAGCAGTAAGTCATCTTTTGCTTATTGTCGNCGCTGTCTCGACCTGAAATC 

CTTTCTTTTTATATTCTTGCCTTCTTTT 

Sequence 1123 

GCNCGAGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCAC 

TGAGCAAAAGGTGMGTTCAATTAGGTCAGTTrrATCACTTCTTTTCCTACACACAAACT 

ATGAGGAGAGATCATTTCTTTTCTTTCTTTTTATTTAATTTTNN i II II II I GGAANAAC 

AISIGGGGNTCNCCACNTCTGTCCGCCCATGGTTTAAGAAGTTACCCTGGCCNCGGGGCNGG 

NCCGGNTACTTAGTAACCTAGTTGGGATTNCCCCAGGGCCTGCMGGGAAATTTTCGATA 

CTCAAAGCTTTATCCGGATACCCGCTTCCGACACTTNGAAGTGGGGGGNGGNCNCCCGGG 

TAACCC 

Sequence 1 124 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCG 

ACAGAGTGGGACCCTGTCTCAAAAAAAATAAATAAATAAAAGAAAGAAAAGAAATGATCT 

CTCCTCATAGTTTGTGTGTAGGAAAAGMGTGATAAAACTGACCTAATTGAACTTCACCT 

TTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAGTAGTA 

CCTGCCCG 

Sequence 1125 
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TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTCTC 

TCAGATTAAGGATATGTGTCTGCTAGGTAAAGCGACATCTGGATTCATTGTGTAGGATGA 

MGAMCCAAATTCTGACTGCTTATTTGTGGATCCGCCAAATCTGGCACGATGCCTATCT 

CACGTGGGACCGAGATCAAGTCCTGGCCCG 

Sequence 1126 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGATC 

TCGGTCCCACGTGAGATAGGCATCGTGCCAGATTTGGCGGATCCACAAATAAGCAGTCAG 

MTTTGGTTTCTTTCATCCTACACAATGAATCCAGATGTCGCTTTACCTAGCAGACACAT 

ATCCTTAATCTGAGAAGAGCCCCCGCGTACCT 

Sequence 1 1 27 

CCGCGGTGGCGGCCGAGGTACTGTAGTTAAATGTGGGTTCTACATGGTTCTGCAGTTTCT 

AGCCCTATAAACCTCACATGCCATGTTCTCTCTCCATATATCTTAATCACCTTTAATTTG 

GAMTAACAGATCTACCCTGCTTATCTTACATTGTTGCAGTGAGAAGAATGACTACGTTT 

TTTGAAACCACCAGCGATGCTGTACCTGCCCGGGCGGC 

Sequence 1128 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGCATTTTTGCTAACTGTAACTTCCCACTT 

AATCAACAAAAACAAAAACACTCATTTCTGAACATTCAGTGCATCCATGATTAATCTTAA 

TTACACCACAAAGGTATTTTTCMTGGTGATTTTGCGGGAGTGGGGTAACAGTTTCGAAA 

GCAACATTGTCAGAAACATAGTTGATTTTAAAGGTTCTTTCTGGTGACTTTGACTTCTGC 

Mil l l AGAAGACCTTACACAGAGTTGTATTTATTTCTCCTGGAATATTTCAAGCAATTC 

AGAGTGAAAGGGTATACATTCCAATTTGCGTATGAGATAAAATTTAGTTACATTGAGAAG 

CTATTTTCTTTAGTTACAGGGAAAAAATTGTAGGGCTTTTGGAAGCCTCTTTGATT^ 

ATAGGAGGA 

Sequence 1129 

CTATCAGTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGTGCCACACTGGGCC 

AGGGTGGTCATCCTGAAATACATGTCCAGGGTCTTGTTTGTCTATGATGTGGGTGAAAGC 

TGCCTCAGCCCGCACCACAGTAGAGAGCGGGACCACCTCACGAAAGTTTATAGCAAACTC 

CC AG AGTCTAACCCTG AAAAGCAN N C AGGAAC AAAGACCTTTCCAGAAAGAAGG AC ATG A 

ACANACGCTTAAAGAACGACCTGGGCTGCCAGGGTAAGAACCCTCAGGAGGCCGAGAGTT 

ACTGTGCACAGTACCT 

Sequence 1130 

GGCGGCCGAGGTACAAGGGCAGCAGAAGCAACAAAGGCTGTGGCCAGCAACGAAGGCAGA 

GTCCCCGGTAGGCGGAGGTCCCCGTGCACAGTGTGGTGCAGCAGTGAACCCAAGATTTTC 

CTTCACAGGGCAGGGGAACTATTTCCTAATAAGCATTGGGCATTTCAGGCCACAANGGGC 

AGCCATTACTCAGCACTTGCACTGACCTTGGGAATTTGCTGGGCAAGGAAAATGACGCGG 

CCCTCATCATCGATGGCCACACCCTGAAGTACCTGCCCG 

Sequence 1131 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTTTCGTGG 

CTTATGGGTATCTTAGTATMCCTTTTCTGGTTGAGTGAACTGTGTCATTTCAAAAGCCT 

GAAGACATTGTGATGAGTGCTGCCTCCATAATGGCTACATTCTAGGGGCTTTGCCCTGAA 

TCGCATATATTAACTCAAAAAACAAACAGTACCTCGGC 

Sequence 1132 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGAAAACCCTAAAGCAAATGAACA 

GCTTAACCGGATATCACAAAGGCTACAACAATTAACAGAGGTTTCAAGAAGGTCGTTACG 

CAGTAGAGAAATTCAGGGTCMGTTCAAGCAGTTAAACAGAGTTTGCCACCAACTAAAAA 

AGAAGCAGTGTAGCANGTACCCTTGCCCGGGCGGCCCGCTCTAGAACTA 

Sequence 1133 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTAT 
AATAATAAAACTAAAAAAAAAGTTATCCATGGCCTAGACAAATCAAAAAAGCATGTGACA 
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CTATATGACATCACATTTCMmCTTGTAGAMGGAACAAGAGTAGTTTAACACCAGCA 

GCATTCATATGCAGAGAGCATCATAAGTACCTGCCCCGGGCCGGCCGGCTCTAGAACTAG 

TGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnTGTTTCCTGTGTGAMTTGTTTATCCGCTCACAAATTCCACACCAA 

CATAC 

Sequence 1134 

CCGCGGTGGCGGCCGAGGTACCTACATTCTAAAAGTATAATAAAAATATACAACAATAAA 

AATGTAAACTGTAGGCTATAMGAAAAATMGMTGTTAAAATAATAACAAATTGTGCCA 

AGAAGTTATAAATCATCTGGTAGGCAGTCCACTGGTAAAATTAAGAGTTATACATAGAAA 

TGACACACTGAAATAAAACTTTTATTTCTGCCTCATTTCTCAACTAAAACACCAGTGACA 

CTTAGCACAAAGTTCACATGCACAACTGTTCAATCACAGTTTATAAAAGCACCTGAACAT 

AACACAAAGCCAAAGCAATCCTGGACAAATGGACTAAAGCTGGAGACATCACACTACCTG 

ACTTCAAAATATAATACAAATCTATAGTAGCCAAATCAGTATGGCACTGGCATAAAAACA 

GACACATAGACCAATGGGAACAGATAGAAACTCA 

Sequence 1135 

AACACACATACCATAGGGATGAAGATAAGAAAGGAATGATTGTGACTCTGGATCAAAATG 

CTCCCAGTTCCTGAAGGTGCATGATTTAACTGTGAATGCCAACTGAGATCCTAACATCCT 

ATACAACCAAAGATGAACTAGTCTCTGAACTGGTAACTCATAAAAGACAAATCCCACAGT 

ACCTGCCCG 

Sequence 1136 

AGGTACCCCCTCTCGTTCCCTTTCAGCTTTATCCGTGCTACATAGGGTTCACGTCACCCA 

TCTGAATGTCCATCACATCATAACATCAGATCATGGTTGAGAGCTGATTACATGCTAATT 

GCATAAACTCATTGAATTTTCACAACCAACAGACAAGATAGATATGGACTGACATCGCCT 

CCATTACACTCATGGTACCTGCCCG 

Sequence 1137 

GACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCGAJCCT 

AGTGGGACGCGATCCAAAAATATGCCTATTAAAGATAATGCTTTGGTTATGTTTAATGGG 

AAAGTCTATCTGTTGGCTAAAAAGGGGACAGATGTTCTGCCATCACAAATTGGCCAACAG 

AATTCTGTTCTCCTGATACTCCAGTAAGAAAAGACACGTTACAGACAGTGAGTTCAAGTC 

CAGTCACAGAAATATCCAGAGAGGGTTGTAAATATTGTTTTGGCTAAAAGTAAATCTTCC 

CAGATGGAGACAAAATCACTTTCCAATACCCAGCTTGCTTCCATGGCCAATCTAAGGGCA 

GAGAAGAATAAAGTGGAGAAACCATCTCCTTCTACCACAAATCCACATATGAACCAATCC 

AGTAACTACTTAAAACAGAGTAAGGACTTTATTCACAAATCCMTCTTTCCAGTTGGGAT 

TTAGTACCTGCCCG 

Sequence 1 1 38 

CGAGGTACTCTAACCTGGGCAGCAGAGTGGGACCCTGTCTCAAAAAAAATAAATAAATAA 

AAGAAAGAAAAGAAATGATCTCTCCTCATAGTTTGTGTGTAGGAAAAGAAGTGATAAAAC 

TGACCTAATTGAACTTCACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATAC 

ATTGGCTCAGTGCCAGCAGTACCTGCCC 

Sequence 1139 

CCCTGAAGGCCGCTTACTTGGAACCGTATCTTCTAAATGCTCTGAAAACCTATCCAAAGG 

CAATTAAAATCACAAATATCATTGAAAACATCTAAGAACTACGTGTAAGCACTTCGGGAT 

AGTCAGGTAAAGGCGGCCTCTAATTATGAAGCATTTTGATAATTAATAGAATTCTAAGCC 

A AAGC TTAAATCCGGATACACATCATMTGGTTAAGCCAMTAACACTTMTCATTTTAC 

CTTTTATTATTGATAAAGACAACACACTGAAGTAGACAGACAGTAAAGAAAAAGTCCACA 

TCTTTTTATTGGGTCCAAATAACCTGATTGNAAA 

AGAATTTGTATGTTAATGACCTTAGTAAGATGTGGCTTTGGCCAGCCTCAAAAAGGGTTG 
CTCTNCAGCCAGGGCCCCTTTTTTCCAGTTACTGCTCCCATCTNCTTTAAGTATATTCAA 
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CTGGTGAGTGGAACTCTTTATCCCGTGTGACAAATGCGTATTGAGGCCCTAACATGCCCC 

CAGACCTCAAGATGTGCTGCTCATACATTTGAAGGTAGCCCTGACTTGTTTNGGGGGGNG 

TGGGGGNGGGCCTTTTCCATCAGTTGTNTTGGANGNTGTGGCCAGCAATCCATTCCAAAT 

TGNCTGAGTTTTCTTNATTTTACTTTNGGA 

Sequence 1140 

AATTGNAGCTCCCCGCNGCGGCNGCCGANGTCAATACTAGAGAAGGGAGAGAATATAGTT 
AGGACAATAAATTATTGAACAAGTG ATTTTTC TGTGATG ACTATACCC A7TTTC C AAAAA 
CTAATTTCA l l l l I GGTGTGTGTGTGAAGAAAGTTTGGAGGCTTCAGTTTCAATCACTGT 
CTCMTACTTMTTTTCCATTATTCACAGAGAAATTC 

AAGGMGTAATAATAAMGATGACTTATGCAGATCCCATGTGACCATCTTACATTCTTAC 
TCTTGTGCCACTGTTTCCCTGGCAGTCCGTTACTGGTTTAAAGAGCACTGGGATATAACT 
GTAGAMTACGGATTATTTTCTCACCTCTGACACTCATTCATGGTGGAGCCCTTACCTTA 
NTGCCAAGGTTATTGGCATTTMCTTTACTATTTAACCTTGCTGAACTACAGTGTCCTTC 
CTCAATGTCAGAAGAAGCTTAGATAGTTAAATAACTTCTAAGGATTATCTATCTGGATTA 
TTAGGAl l I l ICATCATTCTTGGCGTGATTGTCACAAAGAAAACAGGACATAGCTAGGTG 
GCAAGTGCATCCTCACTCAAAGTTAC I Mill 
Sequence 1141 

TCACTATAGGGCGAATTGGGAGCTCCACCGCGGNGGCGGCCGAGGTACCAGTGAGGCATT 

TCCTGATTTTTACTCATAATATTTACAAAGATTAAAAGAGATAAAATTCCAAATGTGAAG 

GCTTAMTTGTTCAAATCCTTACTCAAAGAAATAGTGGGAAAGGTGGGTAAGTCAGAGTC 

TGCTGCGTGAAGGAAAAAAATCMGTTAATCTCTTCTGAAATTATCATGATTTGTTTGGT 

TTAGATTCCCAGAAACANAAAAGANNAACANAAAAAAAGGTCCCT 

Sequence 1142 

ATAGGGCGAATTGGAGCTCCCGGCGGTGGCGGCCCGAGGTACAGAATGAGCTATAGCCTG 

CAGTGACCAGAAATCTCCAAGTAAGGGAGTATACACTGAATGTCATGCCCTTCATTAAGT 

TAAGGCCTGTAGTTNAAGTTCTTTCAGTATGTGCCATGAACAACAATAAACAATGTCCTT 

GGTGTACTCCTTTGAAAGATGGACATTGAAAGTTAAAAATCAGTCACTATATAGGGCCTG 

TTGATNGGAAMCCCATTTGNGGTTCTTACAAATTGTAAGCNGGAATGCCTT 

Sequence 1143 

CCGCGGTGGCGGCCGAGGTACACCGTAGCTTATTTCAAGTGACACAGACCAGGCAACTGA 
CAAATACCGACAAGTAMTATTGTTCATTTTCCTAGGTAATCTCTATAATCAAGGGGACT 
GAM n I ATTCTGGMTTTTCCCCAAATGTTTTCTTTGTAGAAATTATGTTGCTCTCAAT 
TGTGTGCATGGTAG TACCTGCCCG 
Sequence 1144 

CCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCAACAGAGTGGGACCCTGTCTCAAAAAA 

AATAAATAAATAAAAGAAAGAAAAGAMTGATCTCTCCTCATAGTTTGTGTGTAGGAAAA 

GAAGTGATAAAACTGACCTAATTGAACTTCACCTTTTGCTCAAGTGACAGATAMGCTTT 

CCTTGATAGCATACATTGGCTCAGTGCCAGTAGTACCTGCCCG 

Sequence 1145 

AGCGGCCGCCCGGGCAGGTACTCTGACGGAGTTACACACCCCCAGCTATGAAGTAGTCTT 

GCCAAANCAATTGAACATGACATTTCATNAAGCTCTTANAACCGACTATACCAATTTATG 

GNATAAACAGAANGACCACGTGAAACAAAANCTCATTAANGTATGCTCATANNGCAAAAA 

TTCACGACTTATAAAAAANACTTTTNACTGGGACCAATAGCATCNGCANGTTTTCTTGNC 

ANACAAAAATATTGANANAAANTGTCAATAANGGGGTAATAGNAGGAACANGTANTAANT 

GTAGATAACANNAAATTGGAATTATGGAGGGNGNAGTAGTAAGGAATTCAAAGCGCATNG 

TCTANCAAACCGGTGGTTGGGGAATGGTTTAATTTTTN^ 

AANANAAAACCAAGGTTACNTT/WVAANAATTTTTN^ 

CTCTTANTAGGAAAAACAAACCGGCANAACCTTCCAAGGGTTTTTNAANAANANGGATNA 
ATTTTTTTTATTAAG N CCCTTCCN GGTAATTGG AAAC NTTTTG GGTNTTTTNTTCCAATT 
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NTCCAATTNTTAAAATTAAAAT 
Sequence 1146 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCMTTATGATTTT 

TAAAAATGCAATGGCATGMTAAATAAATGAGTATACCCATTTGTTGAT.CTTATTCCCTA 

TTACTTCAMTTTGTAAGTATCCTTAGTGGAAAAGA 

ATGTCTCAACATGATTGGGGTCTACTCAAGGGGGAGGGTGGGAGGAAGAAGAGGAGCANA 
AAGGATAACTATTGGGCACTGGGCTTAACACCTGGGTGGTAAGATAATATGATCAACAAA 
CCCTTGTTACACATGTTTACCTATGTAACAAACCTTCACATGTACCTGCCCGGGCGGCCG 
CTCTA 

Sequence 1147 

CCGCGGTGGCGGCCGAGGTACTTGCAAGAAACCTCAGGACTTGAGTAACAGCAACATGGT 

AAGTTTTCTAAGTTTTCTTTTTGTCTCCCATATACGCTGGGCTGTGCTGGAATCACCAAC 

AGGCACAGAAAAAAATGACAACAAAACAACAACAAAACCCCCAAGAATATCCTGTTGTCT 

TTGGCCAAAGTTCAGGAAAGGGGAGCCCCAACAGAGACCCAGTAGGAAGCTCTAGCCCCT 

GTTTTGTACCTGCCCG 

Sequence 1148 

TNGGAGCTCCACCGCGGNGGCGGCCGAGGTACTTAGCTTTGTCGTCTTTCTTTCCTCCTC 

TTGTTNGAGTTTAGTATTAATAAAAGTTGGACTGAGAAAACCTTTTTTTACM 

GGTTATTTTTAGTGTAAACGTTTTAGAAGTANAATATACATATAAAAACTGCACAGATCA 

AA TGTGTG CATCTCAAATGGTGTTCCATTTTCAAAATATGAATACATATGGGCAGCATTT 

ATATTTTTAAAAAGTCAGMGGTGCCTCCTCATGCCCTTTCCACTTCTCACTCATTGTCC 

CTCAACCCAGGCATAACTACTCTCCTGACCTCCMCATCATAMCTAGTTrCACAAGCTT 

TGAAACTTTATCCAAATGAGTCATACAGGATAGATGTTCACAGAGCCTAC7TAGATATTA 

TGGATCCTTGCCAGGTAT 

Sequence 1149 

AGCGGCCGCCCGGGCAGGTACCTTGCAAGTATTAGACTTAGAATAAAACTGTGTTGACTG 
ATATAACTGATGCATGGTTGGAATAGTAGGTTCTTGGGTAAATTAATCCTAACATCAATA 
GCTCAATTCCATCATAGTCCTTGAACAGGCAAAATGTATGCTAATCACAACAGCCGTACC 
T 

Sequence 1150 

AGCG G CCGCCCGGGCAGGTACCTTGCAAGTATTAGACTTAG AATAAAACTGTGTTG ACTG 
ATATAACTGATGCATGGTTGGAATAGTAGGTTCTTGGGTAAATTAATCCTAACATCAATA 
GCTCAATTCCATCATAGTCCTTGAACAGGCAAAATGTATGCTAATCACAACAGCCGTACC 
T 

Sequence 1151 

AGGTACTATTTTCAGATGGTAGGGATACAMTATATTTTCCCTATATTCMGGGATTTAC 

MTTCAGTAGCAGAAACAGACATGTAAGCAAATAACCGCAAAACAACGTATTAACTACAA 

TAGCGTTGTATTCAAAACATTATGGAAAGATAGTAATTAGCAAAACTAAGTGATTAATAA 

AGTTTCATTAAATCTGGCCAGGCAGGGTGGCTCATACCTGTAACCCTAGCAGGTTGGGAG 

GTTGAGGCAGGCAGATCACTTGAGCTCAGGAGTTCGGACCAGCCTGGGCAACAAGGCGAA 

AACTTCTCTCTACCAAAAAAATACAAAAAGATTAGCTAGGTATGGCGCATACCTGTAGTC 

C 

Sequence 1152 

CCGGGCAGGTACCCTTGGTTTCTCAGACAACTCACTGATTTATGGTCTTGAGACCATAAA 

CTCATTTTCCTTATATGAATGACATTTCCACATCCACAACAATACCACCAAATATATGTA 

TCTAGTTCTTACTAACTGCAAATCCTCAAAGTGAACTGCGTGCATTTTAATGTTGCGTAG 

TTTGCTGATTTATGATTTCCCTTAATGTACCT 

Sequence 1153 

CCGCGGTGGCGGCCGAGGTACCAACCAGGGCTTTGATTTTCATGCTGTCCCAAAGTTGCA 
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GACATGTTCACGACACAGCTGCGTTTGTTCCTTCATCTTGTTCAGCAACCTTTGGTAAGA 
/^CTACCATATTTAAGATTGGTATTCTAGATCCAAAAGAAAMTGCATGTGTTTGCTGTG 
TGTGCATGGGCACATGTATGTGTATGTCTGTCTGTAAGGATAGAGAATGGCTGAAAATTC 
ACAGGTTTTTAMTTCTTAAAAACAGG^^ 

AAGATTCTTAATAAAGTTNC ATG ACAAN G TAAAAC AAC TATTTNAATTGAG AATGTTAAA 
TAGCCTTGGTACCTGCCCCCGGCCGGGCGNTCTTAGAACTAGTGGGATCCCCCCGGGCCT 
TGGAAGGGAATTTNGATTNTTNAAGCCTTTATCCGAATANCCCGTCCCACCCTTNGAAGG 
GGGG 

Sequence 1154 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGGCAAAAAAGCAATCAGTATGACTTGAA 
CTGCTGGGTTTMTTACTCAATATAACACTTGCCATTTAAAGATCCATATGCCCATCAGC 
ATGGCAACAGTCTCTCATAAAGATTCCGGTATCATATGGCACAATTTGTACCT 
Sequence 1155 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATACGTAACAGGGACTGGACCACTCCTCCCTT 

AGTTTGTTGTCCATTTCTCCAGTCAACATGTATTTACTGAACACCAGTTATGCATAAGGC 

CTTCTGCTGTCTCTCATTAGCAACGGGAGAGTGAACCAGACTAGAAAGGACAGACAGGTA 

ACTGTCTATATCTAAGAACTCTTTCTTTTGGAAATCAAAACTCATCTGCAGAGAATTAGT 

CTTGCCTCTGTCCAATTTTCCTCCTTATGGCCCCCAACTTCCACTAAGGGTCATGAGGTT 

GGAGGAGGGCTCGAGGGGCCACTGAGTGAATCCCTCTGCCTAGAATCAGCAGCACACTTA 

TAGGATCCTCAATGCACAAGAACCTGTCAACACANAACAGCTTTCAGAGAATACCTTCCC 

T 

Sequence 1156 

CCGCGGTGGCGGCCGAGGTACACGAGGGAGGCACGTGAAGTCTATCATTCCATGCTCTCA 

AATGTTAACTTTCTGCTTTATATACATAGCAATGCCTCATTTACCTAATGTTGGGGAGTA 

AGGTA7TATATGTGAAAATTTTATTTTAGCTATACCTTTTGTGGAAAAAI i I I I ACAATA 

TATGCTTTGGTTATTATTACTAGCATACTGGCAGTAGTTGAGTGAGGCTATGAGATGATC 

GAAAAATGTAGTTTGNTGTCCACTGCGC I I I I I GAATTATTGAATCTACTTGAGTCGATT 

CAGTAGTCAGTCTTTGTCTTCTTTCAGTGGNGGAAAGCCGCTCTCTTGCTGCTGTTAGTA 

TCCTTGCTGCCCTCTTTTAAATTATGCAGCTTCTTATCTGTGGAGTAGGAGCTTCCTGAC 

ATTGAATGTAAAACAGCTAAAATGGC 

Sequence 1157 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCATACAACATAATT 

TTAAAAGCTTCACTGTTTCAATAAAAAAACTAATATAATCATGTTTATCTGACAGACCCA 

TTGCTTTATTCATTATAATCAMTAG 

TGCAGGTTTTC I I I I IGl M l I IC I I II I GAGGTGTGAATACAGTAATAAAATACTGACT 
AATGCTTGAAAGGCTTGAAGAAGGGACTAGCGGTGGAGTCCAAGGACAAGAAGAGATGAG 
AAGCI I \ I I I I I CTTTTCTTTTCCTTTTG GTTTC AG AGGGAAG AAGCTATGTAAATTGG A 
CTGGGAATTAATATCCAGCTTCAGTCAGAGAATAAGAAATAGCTTGAAAGTACCTGCCCG 

0 

Sequence 1158 

CCGCGGTGGCGGCCGAGGTACTTTTAAAACATTTTTTCCTTGAGAAATCATTGCCTAAAT 

CTCATACATATGTGCTTCTCTGCAGAAAAAAAAATGAAATAAAATAAAMTCTTTATTTC 

CAATGAGCTAATGAGGAAAAAGGGGATGATGAAAAAACAGGGAGTGGGAGCAATTTTGGA 

AATG G AAAGAATGCTGAATACCTACATGATG C CTAATTCAGAAG AACATTCTGGTATGTA 

GGAC ATTATTTTCAACTCTATCACTTACGGCAGA I I I I I I II I I ATATGCAG ATGTATCT 

CTACTCTCTAAAGATGTACCT 

Sequence 1 1 59 

CCGGGCAGGTACTGTGTATCATCGCAGTCTTGCTTTTTTGAGTAATGGATTCCTAGATTC 
TATGAGGATACCACAACCACTTTTAAAGAGGTTTCTAAGGCCAGGTGCAGTGCTTACGCC 
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TGGGAGACCAAGGTGGGAGGATCACTTGAGCTCAGGAGTTTGAGACCAACTTGTACCTCG 

GCCGCTCTAGAACTAT 

Sequence 1160 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTCAAATGGTTA 

AAATGGTAAATTTTATGTTACATGTATTTTACCACAATTTTTTTATAAGA 

AAGAGAATCAAACAGAACCTGCAGAAATAAMGACATAATAATTGAAATTAGTTCGATGG 

ATGAATCAAATAGCAGATTAGCTCAGCTAAAGATGACTGTGAATACACAAAATGTAGCAT 

GGTGATACAAAAATTGAATATATCAAACAGAAGTTATGCAACATGAGGACAGAATAGAAT 

GTCCAACATATCTAAATACCATTCAAGAGGGAAAGAAAAGAAAAATGGAGAACCGAGGCA 

ATATTTTGGAGGATNTTATTGCTNGNACTTTTCC 

Sequence 1161 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTATTCCCATTTTATAAGCAAAGAGGTAGGAC 

TGGAAAGGTTAAGAGCTTACTCAGCGAGAGAGAGCTGGTAAATAAAGGAGTGAAGATTCT 

AACCCAGATGGAATGTCATG CCGAAGCTATTAGC CACTAGAAG ATAATACCTC ACTTGC A 

CATTCTGCTGCTGCTGCTGCTGCACCTGAGGCCTTCACCTGGGCTCGGTGGTGAATGCCC 

TAAGATGGTTCAAAAACTGCATGTGAAAAACTGTTTTAGGCCAAGCTATCAACAAATCTT 

CTGGTGATATCAMCCTGGTGGCACAGATMGAGAMTTTGAMGTGACTTTCAAAAGCA 

GGGTGGAAGCCGTAGTACCT 

Sequence 1162 

AGGTACTCCATAATGGATGTGGGCCAGGGTAAAAGCAGTTGTTAATGTATACAGGAGAAT 

GCTCTCAACGTAAGAGGCTACTCCTAGGTTTACCACTAAGACAACCAAGAAGAGAGGAAC 

CAGCAACCAATTCAAAAGTTGGGACACCGGGTACCTTGCCCGGGCNGGCCGNTCTAGAAC 

TANGTTGGATCCCCCGGGGCTGCAGGNANTTCGATATNAAGCTTATCNNTANCCGTCGAC 

CTCNAAGGGGGGGGCCCGNTACCCAGC I I I M 

Sequence 1163 

CCGCGGTGGCGGCCGAGGTACATTrTGAACAATCTACTCTTCAAGTGGATCATTTACAAA 

ACAAGAACATAATTCCCTGAATCAGTAAATTTTATTTTCATGAAAGGATGACAAAGAAAT 

TCCATTTACACTATCACGATTTACATTTTCCTACAATTCATGCAGCTTTCTTCAATTCAC 

AATCTACAAACCACAACAATGCACAAATTTGATGTTAGAGGATACTTTAATGTTTAGTTC 

TCATCTCAACAGTCACTGTTGCTAGTGTCTCTCATTCTCACTCTTCACTGTTACTTAAAG 

MCAAAGMCTAAAGAAGCCAGAAAGAACTAGATCAAGCGATGAGACCAATTTTCAGGGT 

AAGGCATAATAACATTAGCTAGTCAAAAACTGGAGTATTTCTATCATTTAATCCTGCAAG 

ACMTAACCTNCTGTTTTCCCTCACTTTCCTTCCAATAACTTGACAGAAGTCTGGGGAAG 

GATGGATCCCAAGCAGAAAACCTTCACAAATGTGAAAGAAAATGGAAATCCTAGCATAGT 

CACATGACTAACTAGAAATCCCAGAACTCTGTCAACAACCTTTGTTTTCAATGCAGAAAT 

CATTTTTGTACCTGCCCGGGC 

Sequence 1 1 64 

ATTGGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACTGATCGTTTCTGCAGCTGCAGATT 
CTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGMGTAATGTGCCTTCAGACTTTAAACT 
TTGGAAATGGATTTGCTTTGGCTCTCTGCTTATCTTTTTTGCCAAATACTGATGTACCTG 
CCCG 

.Sequence 1165 

CGAGGTACGTTGTCAGGACATTTTGTAATTCGGTATTCTTGGTCTCCTGGCCCCAATTCT 
AAAAAAAGTTCACGTTTTGATGAAAAAAAGATATGGGCATTATGGATGTTGAGGAGATCA 
CAGCTCAAAAGATTGACAGGGGCCCCAGGGCATATAAGAAACATATGGTACCTGCCCG 
Sequence 1166 

CGGCCGCCCGGGCAGGTACTGGTCTGNCTGCAGAGGCACCAAGATGATATCTTCCTCCAC 

TGTCAGAGTTATTTANGAAAAATAATCTCTGGCATANGCCGAGTGCTCATAAATACTTCA 

NGATGAGACAATAAAGCCGCTCCCAGGTCCTGGCATGGCCCAACCTCAGGACACTGGCCT 
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acacatttgcaggtagMgcamggcctgcgtcaggcctgtggccagcttttgcctgtgc 

gggaggctctaamtgaagacaatcttactccaggtctccttccactttgttgcctctag 

aatgtcatctccctcccctcctctnggaaaggagtcactccaccccaaccat 

Sequence 1167 

CTCTGCTCTCATTTCCTGCAGATTCTCTGAAGCCTATCTGTGCTTCTCAGTATTCTCTAG 

CGGAGTTGAAAAACGCCCCTTAGAGGTGCACAGTTAATTATAGAAGCTGTTAGCTTTCCC 

ATCTGTAGTCATGCCCTGGAGCTAAGAGAAGGACTTGAATACAGAAGAGGAGATGCCCCC 

TGTACCT 

Sequence 1168 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTCAGAGAATTTCATTGCTACTGCCTGTTC 

GTTAACCTAGCAAGTGAGTAAATTGAGGCTTTAATTAAAAAAAAAAACGGTTGTTGT^ 

7TGATATACATTTTGACATAGCTCAAGCTTCAACCCTGCCCAGTTCCTATGCTCAGACCC 

CATCTTCAAGGATGCTCCTCCTGCCTAACTCCTCTCTCAGAATATTGI I I I I ICCTTTCC 

CGTTTAATCCCCAGGCTCAGCTCCTGGGACCAATCCTGGTTCAGGCTCCTGTTGCCTGCC 

TCATCCCACAGCCGTCCATAATTTGGTGGTTACCGTCATGCACCTGAAGTGGTAGTGAGC 

ATTTATAGTAATAAAAATCATAGTCTCACACTGCCATT 

Sequence 1169 

CCGCGGTGGCGGCCGGGCGCCATGGCTCACGCCTGTAATCCCAGCATTTTGAGAGGTCGA 
AGCGGGCGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACACCA 
TCTCCACCAAAAATACAAAAAACTAGCTGGGCGTGGTGGGGGGCGCCTGCAGTCCCAGCC 
ACTTGGAGGCTCAGGCAGGAGAATGGCGTGAACCCGGGAGGCGGAGCCTGCAGTGAGCCG 
.AGATTGCGCCACTGCACTCTAGCGTGGGTGACAGAGCAAGACTCTGTATTAAAAAAAAAA 
AAGTAACTTTGAGTGAGGATGCACTTGCCACTAGCTATGCCCTGTTTTCTTGTGACAATC 
ACCGCCAAGAATGATGAAAAATCCTAATAATCCAGATAGATAATCCTTAGAAGTTATTTA 
AC 

Sequence 1170 

CCGCGGTGGCGGCCGAGGTGCTTTC I I I I I CTTTCTTTCTTC I ! I I I I I I I AGCCCTGAG 

ATGTATTTGACTTCTGAACACCTAGAACCCTTTGTTTTCTTCTTTCCTGTATTCCCTGAC 

CCTTTATTCCAGTTCCAGCTTGCTA I I I I I GTTTGATACATTTGCAGATTCACTG AAGAC 

CACCCAGCCCATATATAGCTAGCTCTTCTGTGAGATGTGTATTTAACCCTGAAAAAACTA 

ATGCAAGAAMTAGGAGAGATGACCCGAAGGGGAAAGACTCATAGGATTCTTTTCCCCCC 

TTGGCATTTGGCCAGGGATrrrCATACATTCTCNACCACTTAAGTTGAAGANGGCTTGAA 

GCCTGCTTTGGCTTTTAATMTAACCTGGNAACCCCTTGGCCCCGGGGCCGGGCCNGGTT 

CTTAGAAACTTAAGTGGGGNATCCCCCCGGGGCTTGCAA 

Sequence 1171 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACATCAATAA 

GTCGAGAAAATTATATTAACTGAAAGAAAACAAAATAATAGAGAATTTTAT^ 

TTCTAATGTTTCTCTTCATGTTTGGAGAAAAGCTGCCACATAATTAAAACMTTCTTACC 

CTGTAAAACTGATTGTCTTCCMTCTCAGGAGGTTTTACATTAAACAGGGAATATAGAAT 

AANGAAACAGGCCTATGGCCCGGGCTCCGTGGCTCACGCCTGTAATCCCAACACTTTGGG 

ATGCCGAGGCGGACGGATCACCGAGGTCAGGAAATCCAGACCATCCTGGCTAACCGCGGT 

AAAAACCTAGTTCTCTTACCTAAAAAAATNCANNAAAANANAAAAGGAATGGAAAGGAAA 

AGAAAAAAAAAGTTACCTGCCCGGGGCCGG 

Sequence 1172 

CCGGGCAGGTACCCTCCAGGCCCTGTGTATTATTAAGATCTTTTAGTAGCGAGTTGCTCT 
TTCTCTGGGAAATCGGCTGTTAAAGTCAGAGGGAGCTCTTAATAGTTTGCATGGTATTTG 
ATTAAATG G AAC AGTTGG ATC AGT ACCT 
Sequence 1173 

CCCTTGTTGAACAGGCGATNTNTNNACCATGCNCACAATGAAATCAATACTCAGAGAAGG 
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CAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTNCCA 
CCAG ACG G N GTGATATTTAGTAG CCCNTAAAGCTGCCANGGGGTTGAATG AC ACTATCTG 
MGATATGMCCANTTTGNTCTCCATAGGGAGGATTTTATCAACAGGAAACANATGCCTG 
GAAGGCATTGGATTT 
Sequence 1174 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGATCTCACTCAATCTTACTCCCTTGTA 

GAACAGGCGAtATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAAT 

TCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGG 

TGTGATATTTAGTAGCCCANAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATG 

MCCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCAT 

TGGATTTGCTMGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTG 

AGCTGACTCAAAGTCAAACCTA 

Sequence 1175 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCNGGTACTGAACTTATT 
TTAAGGTCTCCAATTTATTTGAATATAAATTTGTTC 

TATGTGTATTCTGATTGTTTATCTGTGTTTGGCATATAATAAAAGTTGATTGAGCTGAAG 
G ACTTA AAAACCTGMNCACACCTTCCATGCTTTTCATTTGCTTTACCTNTGCATGGAAC 

ACTTTTAATCTGTCTCTCCTGTGTGAAACTTCTCTGAAACCCTTACCAGCTGNCTGTGCT 
GCTACTTGTCTG NTGTACCT 
Sequence 1176 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACAGGGGACCGCCAGGGGCCT 

CGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGTGCAAA 

CACATTGTTCTTAATTTGAAAACTGTGGGCAGAMCAGAAGCCCGAGACTMTTTTTCCA 

TT GCTA ACTCTAGATTCTCGGCCACTGGAGTCTGAAGATACTCTCTTTGAGAATGCATAT 

TATTTT GCTCAC AGCTAAAACATTTAAGTATCATAGCTGATCAGTGGAGTGAGATTAAAA 

GGTTTCTTTTTTGAATCATCAGCTAGAGATGTGCAAGGGAGACAGCAATGCAGGGTGTGT 

TTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTTAAAAATGTGATNGTTGGTATA 
TCATTC 

Sequence 1177 

CCGGGCAGGTACAAATATTGAAGGGGAAGAAMTATAATTGTCCTTTCTATCTTTTTTCT 

CATAATTTAGCTCTTTCTCCACCACGTCATGCAGCTGAATTGAGCCAAGCATTAGTGGTA 

GGGATAAAAATAATGCCGGAATCACTCTCCCCCATTTCCAACTTCCACTCTCTTCATGTG 

CGCAGCTGCACTTCGTACCT 

Sequence 1178 

AGGTA CAATACTAGAGMGGGAGAGAATATAGTTAGGACAATAAATTATTGAACAAGTGA 
I I I I ICTGTGATGACTATACCCATTTTCCAAAAACTAATTTCATTTTTGGTGTGTGTGTG 
AAGAAAGTTTGGAGGCTTCAGTTTCAATCACTGTGCTC 

ACAGAGAAATTGTTTTTCTGGAAATTGGCAAGAGGAAGGAAGTAATAATAAAAGATGACT 
TATGCAGATCCCATGTGACCATCTTACATTCTTACTCTTGTGCCACTGTTTCCCTGGCAG 
TCCGTTACTGGTTTAAAGAGCACTGGGATATAACTCTAGAAATACGGATTATTTTCTCAC 
CCTCTGACACTCATTCATGGCGGAGCCTTAACCtTAGTGCCAAGGTTATT 
Sequence 1179 

TACCCAATTGTCCCAAAMCCTTTATTAACAATCAAAATCAACCTTGACTCCACTCAACT 

ATTATTCCAACTTCACCTAAAACTAAATTCACAANGTGTATATGGGTCTATTTGGGAATC 

TNTGTCCTGTCACAC TGGC CTGTTATTCCTGAGCTGAGCACTGTAGCATGGACCANACCC 

CACACACCTCTCTTCTTTTGCCAGAGGCTCATGGCACTTCTGTAGGTCA l I I I I I I ICTT 

TTTTGAATATACCCTTGGTACCTGCCCGGGCGGCCGCTCTAGAACTAGTTGGATCCCCCG 

GGCTGCAGGGAATTCGATATCAAGCTTTATCGATACCCGTCGACCTNGAGGGGG 

Sequence 1180 
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GGTACATGGAGGCAAAAMGCAATCAGTATGACTTGAACTGCTGGGTTTAATTACTCAAT 
ATAACACTTGCCATTTAAAAATCCATATGCCCATCAGCATGGCAACAGTCTCTCATAAAG 
ATTCCGGTATCATATGGCACAATTTGTACCT 
Sequence 1181 

AGGTACATCTCTAGCTGATGATTCAAAAAAGAAACCTTTTAATCTCACTCCACTGATCAG 
CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATC 
TTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 
GTTTCTGCCCACAGTTTTCAMTTAAG 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAG 

TACCTGCCCGGGCGGCCGCTCTAGAACTAGT 

Sequence 1182 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGATCGTTTCTGCA 
GCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAGTAATGTGCCTTCAG 
ACTTTAGACTTTGGAAATGGAtTTGCTTTGGCTCTCTGCTTATCl Mil I GCCAAATACT 
GATGTACCTGCCCG 
Sequence 1183 

TGAATNAGGGGATAACGCAGGGAAAGAACATGTTGAGCAAAAAGGCNCAGCAAAAAGGCC 

AAGGGAAACCCGTTAAAAAMGGGCCCGCGTTTGCTNGGCCGTTTTTTTCNCATTAGNGC 

NTCCCGCTCCTACCTTGTACCGTAGCCATTACACCAAAAAAATTCCGACCGTCTCAAAGT 

TCNAGNAAGNGTTGGGCCGANANACNCCCGAACCAGGGACCTTATTAAAAGGAATACCCC 

AAGGGCCGT 

Sequence 1184 

NGGTGGGCGGCCGAGGTACCTNTCCACAAGCCTGAACATGGAGTAGATGCATGGCACGGC 
CAGATTAATGCAGGACGCAACGAAAGGCAGTAACAGCACCGCCCCAGGGTTACTGTGTGT 
CTTCAGGAACTCTAAGTTGTACCTGCCCG 
Sequence 1 1 85 

ATGCATGGCACGGCCAGATTAATGCAGGACGCAACGAAAGGCAGTAACAGCACCGCCCCA 

GGGTTACTGTGTGTCTTCAGGAACTCTAAGTTGTACCTGCCCG 

Sequence 1186 

AGGTACAAAGGGAATAGATAACAGGGACATTGATCTAAAGGGAGGTTAGGGAGGACTTCT 

TGGAGGAAGTGGTATTTGTGTCTGAGTTATATGGGATGTGTGGGATTTCTTAGGTGAAAG 

AGGGCAGAGCAATACGGTGGTAGGCTAGATCTGTGCAGCCGGCTTATTAGACAAGACTTG 

TTACCCTGCCCGGGCCGGCTCGCTCTTAGTAACNTAGGTGGNATTCACCCCGGGGCGTGC 

CAGGGAATTTCCCATTATCAAAGNCTTATTCGCATTACCCGTGCGAACCTCNTAGGGGGG 

GGGGCCCCCGGGTACCCCAGNCCTTNTTTGTCTCCCNTTTTMGATGMGGGGGTTTAAA 

TTTGNCG NCCGCCTTTGGGCNGTAAATCATTG NGTGCAATAAGACTTGTTTTN CCTGGTT 

GNTGGAAAANTTNGTTTAATTNCCGCCTTCAACCAATNTTCCCACCANCAAACNANTAAC 

NGNAGGNCCCGGGGNAAGGCCATTAAAAAAGNTNGGTAAAAAAGNCCCTGGG 

Sequence 1187 

NTCGGNCGTTCCNGCTGCGGCGAAGCCGGTATCANCTCAACTCAAAAGGCGGGGNAAATA 

CCGGGTGTATTCCCACCAGGAATTCCATGGGGGGAATAAACCGNCAGNGGNAAAAGAAAC 

AATGGTNTAGTCAAAAAAGGGCCCCANNCCAAATAAGGGGCNCACGGTAAAACCCGGTAA 

AAAAAANGGGCCCCGNCGATNTGGCCTGGGGCGG 1 1 1 1 1 1 1 1 I NNCCAATAAGGGCCTTC 

NCCGGCTCCCCCCCCTTGTMCGGAAGGCCAATTTCAACCNAAAAANAAAATTCCGGAAC 

CNNCCTTTCAAAAGGTTCNAANGAAGGGGTTGGGGNCGNAAAAAACCCCCCGGNACCAGG 

GGGAACCTTAATTTATANAGGGAATTAACNCCAAGGGGGCCGGTTNTTTTCCCCCCCCCT 

TGG 

Sequence 1188 

CGAGCGGCCGCCCGGGCAGGTACCTTGGCAATAAGTTGCTAGTTATCTCAGCCTATAAAA 
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TGTAGGGCATATGGACATTAAGATTATAAATTAGCTTTGGTGTMTCAAATTTTATAATT 
TGTAAACTGGACTATGTTGTGCATCCTTTCATAGTTTAGGATGATTMGAGTTTGGACCT 
TATAAGTAAAC TGCCACATTTTGAATCATAGGCCCCAACTGGGTACCT 
Sequence 1189 

GGCGGCCGCCCGGGCAGGTACTACTGACACTGAGCCAATGTATGCTATCAAGGAAAGCTT 

TATCTGTCACTGAGCAAAAGGTGMGTTCAATTAGGTCAGTTTTATCACTTCTTTTCCTA 

CACACAAACTATGAGGAGAGATCATTTCTTTTCTTC 

GACAGGGTCCCACTCTGTCGCCCAGGTTAGAGTACCT 

Sequence 1190 

NCACACAACAATACAGAGCCCGNNGTANCCATTATANTNTGTTAAAANGTNCTNGGGGGT 

GCCCCTAAATGGAGTTGTAGCCTAACTCCACMTTTAATTTTGCCGTTTGCGNCTTNACN 

TGCCCCTCTTNTCCNTGTCNTGGTAAACCCTGTNNTTTGCCAAGCCTNACAATTANATGN 

AAANTNTGGCCAAACNGTNTCGTGGAGTAGGAGNGGCGGGTNTTGGCCGGTAATTTGNNG 

CCGCCTTCTTTTCCCG 

Sequence 1191 

AGGCCAGCAAAAGGCCCANGAAACCCG1TAAAAAAGGCCCGCNGTATGCCTGGCCGTTTT 

TTCCATAAGGCCTCCCGCCCCCCCCTTGACTGAAGNCATTCACCAAAAAAATCCGGANNG 

CCTCAAANTTCAANTANGGGTGGGGCGTAAAAACCCCCGGACCAAGGGAACCTTNTTAAA 

MGGAATANCCCAAGGGCCGGTTTTTTCCCCCCCCTTTGGGAAAAGACCTTCTCCCTTCN 

GTTGGCCGGCCTTCNTTCACCTTGNTTNTACCCCGMCCCCCCTTGGCTCCNGCNTTTTT 

ACCCCCGGGAATTMCCCCTNGTTCCCCCNNCCCTTTNTTCTTCCCCCCTTTT 

Sequence 1192 

GGGCGGCCGAGGTACCCATCCAGGGCTCCAACATGAACCACACAATGGACCTTAGGCCCA 
TTCCAGCACACAATAACTCAACAAAGGATGCTTAAAACATTAAAAGGATCTTTACTATCC 
TCTTGGCAAAGTACCTGCCCG 
Sequence 1193 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATGTGGGCTGAG 

AGAGCCAACCAAATCCAAGTGGGGTTTTGTCAGGAACGGTATTGGAAACTCTGAACTAGA 

TAAATAATTCAAAGGATATTAGCTAAAGTGATCAGGGACCAGGACAGGAAGGTACCTGCC 

CGGGCGGCCGCTCTAGAACTA 

Sequence 1194 

GGCGGCCGCCCGGGCAGGTACANGCATCGCTGGTGGTTTCAAAAAACGTAGTCATTCTTC 

TCACTGCAACMTGTAAGATAAGCAGGGTAGATCTGTTATTTCCAAATTAAAGGTGATTA 

AGATATATGGAGAGAGAACATGGCATGTGAGGTTTATAGGGCTAGAAACTGCAGAACCAT 

GTAGAACCCACATTTAACTACAGTACCT 

Sequence 1195 

CGCGGTGGGGCGGCCCGCCCGGGCAGGTACCCTCCAGGCCCTGTGTATTATTAAGATCTT 

TTAGTAGCGAG7TGCTCTTTCTCTGGGAAATCGGCTGTTAAAGTCAGAGGGAGCTCTTAA 

TAGTTTGC ATNGGTATTTTG ATTAAAATGGAACAGTTGGGATCAAGTAC NCTCGGN CCGC 

TCTAGGAAACTTAGGTGGGANTCCCCCNCGGTGCCTGCAANGGAAATTTCCGAATAATTC 

AAAGGCCTTTATTCCGGNATTACCCCGGTGCNNAACTCTACCGGAGGGGGGGGGGNGGGG 

CCNCCCNGNGNTTAACCCCCCAAGACCTTTTTATTGGGTNTCCCCCTTTATANTANAGTT 

TGGAAGGGGGGGTTTTAAAATTTTGGGCNGNCCGGCCNTT^ 

CAATTGGNGGTTCCAATNMGGCCTTTGGTATTTTTCCCTTNGGTGGGTNGGAAAAMTT 
ATTGGTTTTAATTTTCCCCGNCTTTCTACCNAAATTNTCNCNAA^ 

NNCNGTAANTCCCCCGGGGGGANGGCCANTTAAAAAAGNTNGNGTANAAAAAGNCCCTTG 
GGGGGGGNTGGCCCCTAAANTTGGAAGGTTGGGANGNCCCTTAAACCTTCC 
Sequence 1196 

TCGAGGTACGCGGGGGGATGTGGAGAGGACCATGTGAAGAGAGAAGCTGAGACTGAAAAG 
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GATTTATGTATTAATATTGACAGAAGCCAAGGAACACCATCTGAAGTTCTGACGGCAACA 
TCAGAAGCTAAGAGAAAAGGCATGGAAAAGATTTCTCACCTTAGTAGCATNCAGAGGGAT 
GAGGTTGGNTCNCTGCANNACAANCNCTNTGNTNTTATCTTGACCCTTCTNGACCCTTCNI 
CNNAAAACCTTGTAGNAGNGNGAAAAGAAAAATTTTTCCTTC 

CCANCAACCAGGCCTTTGGTNGNGGTTACCCCTTGCCCCCCGGGNGNCNGGGCTNCGTCN 

TrCTTTAGTAAAACNCTAAGGJTGGNGNNATCCCCNCCCCCCNNGGNCNTTGGCAANNGG 

AMNTNTTCNNAATTMTTTNNAAMGTCCTTTTATTTCNGAATTACCNCCGNTNCAGAN 

CNCNTTCTGATAGGGNGGGGGGGGGCC 

Sequence 1197 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCAGAAGATATACACCATCCT 

GTTGCCATATATCTAAG I I I I IAACAAI I I i I I CTTAATATCTATCTCCTCCCC AGATTC 

TGAAACAATAGATGCACAAATATGAAACGATTAAACTAACTTGATTAGCTGAAGCCAATA 

ATAACTACCCAAACAAAAGCTGAACACTTAAACCTACTCAGCCATAAACAGAAGAGCTTG 

AAAGCTGCAGGCTTTGTCCCTAAAGCTAGGGCCTGCTGTTGTGGTGTTAACATCGAGAAG 

CTTGAGATAAAGGTGTCTGTCCCATAGAAGGCTCAGTTAGGTAGGCAGTATCATTNGCTC 

ATTTGCTCCAGCCAGTNGGAGGGTAAGATTTrCCATTCTTAATTTTCTCATCCACTAGGA 

TCATCAGGTAGAGAATGGAGAGATTGGGGAACC 

Sequence 1198 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAACTTG 

TGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTAT 

ACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTAAGCCCT 

TGGTTAAATTTATTGGCTTCATTTATGTATC 

AGGAATAGAAACTTACTTTTGGAAATACAGTACCTGCCCG 

Sequence 1199 

AGGTACAGGCAAAACACATTTCTTGTTTCACACTTCCTTATAACACCCAGCAAAATACCA 

GGGAAAAATAGCAAATATGAATCAATCTTTAGTAAGTTTTATTCTAAACAAAAGGAGTGA 

CTGATGAATCACATGGNAACAGGAGGGGGATTGAATTGTTTTTGTTTTTTAAAATGGNGA 

TNCAAATTCCAAGGGAGGAGAAGATGAAATTATTACCTAAAACCCAGGCTTCTCCNCAAT 

TATTGNATCCCTTAAAACCTTTAGNCAACCCA I Nil I GCCCCGCCCTTGTTGGAAACAT 

GGCTTCTTTTTNCAAGGAACCTCCGTTCCAAATTGGGGTNGTTGGAAAAGNTGGCCTTTT 

TCCTTGGAATGGTTGGTTTTCCCCTTTAATTCAAAACCCCCCACCAATTCCTTTTGGN 

TMGNCCAACCTTCCCCTTAATTAACCT 

AMTTTTGGGGGGTGGGTTNCAACCNCNTTTCCCCTTGGTTAAGGGGGMTTGGCCTTTA 

AACCCCCCCCCAAAAAAATTTTCNCCC 

Sequence 1200 

AGGTACGTATCTAAACCCCGTGAGCTAGGCACTCCCCACCCATCTGCATACACTTTGTCA 
TCTTTGCAACTACAGCGTTTAACAAGGGCTGGGAGATTTCAACTAAAACAGACAGAATGA 
ACTTTCTTTTAGGGATCTTAGAACATTTTGCATGGATAGGTC 

ACCTATAGGGATAACTGTCTTTCTAAAGGTCTAAAAACAAACTGTCCCCAGTTCATTAAC 

CAACATATTTATTGCCACTAGAAATCAATTTTCAAAGAGTGGTAAAAATGACAGGGACTG 

TAGCCACTGCCCAAAACCCGACTGACAGGGTCTCAGCACTAAATCAAGTAAAATTTAATA 

AGTGTCACTGTCCCAGGTGTGACAGCTAGGATA 

Sequence 1201 

CCGCGGTGGCGGCCGGTGCCACACTGGGCCAGGGTAGTCATCCTGAAATACATGTCCAGG 

GTCTTGTTTGTCTATGATGTGGGTGAAAGCTGCCTCAGCCCGCACCACAGTAGAGAGCGG 

GACCACCTCACGAAAGTTTATAGCAAACTCCCAGAGTCTAACCTGAAAGCAGCCAGGAAC 

AAAGACCTTTCCAGAAAGAAGGACATGAACAAACGCTTAAAGAACGACCTGGGCTGCCAG 

GGTAAGAACCCTCAGGAGGCCGAGAGTTACTGTGCACAGTACCT 

Sequence 1202 
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CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGTATG 

TTGAAAGAGGTAMTAAATGTATGAAAATATAAAGCATTCAGAAGACAATTGTCCATAAA 

AAAGAACCAAGTGGGTATCCTGGAAATTCAAAATACAATGTCTGAAGTTGAGAATTAGAT 

TTGTTACAAACCAGATGGCACACATCAGAAATCTGAATGAGAAAACTCAGAAGCAAGTTA 

ACAGAAAATATCCAAACTGAATCACAGAAGAAGACTAAAAGAGAGCATGGGAGTTATGTG 

AG AC ACAGAC AAAAGTTC C AAC ATCGATGAAATTGGAATC CTGG AAGAAGAGAGAGGGGA 

AGGG CAAGAGAAATATTTAMGAAAAAAGTAATTG AGAATTTTC CAAATCTGATGAGAAA 

CTTTAAATCACTGAATCAAAAAGTTCACTAAGCCCAACAGGGTGGACACAAGTGCACACC 

CAAACNTTCCACACAAAACCCATCCAAGTACCTCGGGCCGCTCTAGAACCN 

Sequence 1203 

M M M M I I I I I NTNTTTTCGMTGACAGCCTANTCTTTATTGCAGGTTTTTTATTTTA 

ATTTAATACCAGAGAGCCCTAAGCCAGGATTTATTTTTCCCATATGAAAATATTTNGGTC 

TGACTCTGAAGTCTCANANCGGTNGCTGAATTCTTCAAAGTGAAGAGTGAAAAGCTTTGT 

MGTTTCTTCAAGGCATTTCATGTGTCATTGTTTTNTCCTCCCAATAGANANTGAANGTA 

ATGGTGATGTCCNCCAGCACATCACCTGGATTAAATTrNTGCTGNGCANGAAATGCATCN 

GCACCTTTACCGTNACCTAATNAATTTAACNGCCCAGGCCCNTGGGNAANCAATNCCNGC 

MGGTTCNCATTTTTTCTTNACCAAGAANGTTTNTTACTTTTCAGATCTCGAT 

ATTGNTGGTTTGCCTTGAAGTNCCCTNGCCGNTTCTANAACTANGNGGGATNCCCCCGGC 

TGCAAAGGAANTCNAATTTCTAANCNTTANTNNANTNNCCGGCAAACNTGAGGGGGGGGG 

C 

Sequence 1204 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAGACTTTATTCTCTATTTCAGGGGCCTTGTT 
TATTAAGCAGACAGAACTCTGCTTGATTATTCCATGAAGTCGCCAACCATTAATCATGCA 
CATTTGTTTAACAGGTCAMTGAAGAGAGGAAATTTGTGTGCAGTCACGGCAAATACTGC 
CTC ATTTT ACCC ACTGTG ATACTAAAGGTAC CT 
Sequence 1205 

CGCCCGGGCAGGTACCTCTAGGCAATTCATTACAGTAGCATTAAAAGACACCAAGGCCAA 

ATGCTTTGGGGTCACTCTTCTAAAACACCGTGTCTAGACAAGAGTCATCAAGCAATGATT 

GCTTGACTGATTTTTTAAAATTATAAGTCACACAAACACACAGTCATCTAATAAAAAAAA 

ACTGATTTTCTTAAGATTCTCAGAGTGGTAACACTCACAACTGTCATGTCAGCAAGTGGA 

ACTTTCTTCTAAGTAAACATCAGAATTCCAATCAGCCTTTGMCCACTGCAAATATTTCA 

CAGTCTCAAAATTCACCTACCTTAAAACCAAGCGCGTACCT 

Sequence 1206 

ATTGGACTCCACCGCGGTGGCTGCCNACGTACCATGTCTGCACCAANAGTGGCAATAAGG 
AGGAGGTGGCATGTGGTGGACNGANGGANAGCCANATTCTGTTTCAGGTTACANCATGCA 
AANAAGGGCGATGAGAATNTTTGCATTATACATA 
Sequence 1207 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACCACTATTTGAGTTCTGCAGTTGGCCAGGGTT 

GTGGGAGGGGCCTCAAAGTTAAGTGGAGACACTGAGATGGACAAGACCAGATGCTATGCA 

CAATGGAAATGCATGGTTGAAGTTTTCCTTCCTGCCTGTGTGGGGCCTTAGGGCCACAAG 

GGCAGCCATTACTCAGCACTGCACTGACCTTGGGAATTTGCTGGGCAAGGAAAATGACGT 

GGCCCTCATCATCGATGGCCACACCCTGAAGTACCT 

Sequence 1208 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACGATCTCTTTTGTGGTCTTCTATGTCAACT 

CTCAAACTAATCTCCTACAACTTTTCCACCCCCCACGCACCGTGCCACAAATCATCTGTT 

ACCTAACACATAAGTGACAACACAATTCTGTTCCTGCCATTTTCTGCTGAAGATTGTTCC 

AATTTACTTCTCTGTTGACATATAAAGTAGCCTTTAAAGCTTTTAAGGTCTTTA^ 

TAAAGTAGCTTCAACATGATTTTGATTCTGCTTTTTATTCTCATTAAAATGAAA 

TGGGATTGAACAATGAGAGNAAAATAATAGAAGAGTAGGTGGTGAAAAATTAAAAAAGTA 
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GGGCATAAGCAAAGTNGTCCCTATTGGTCACAAGACCGTCAATTAAANGGCAATGAAAAA . 
TACAG AAGC NTG AAATGGTGG AATTTA 
Sequence 1209 

CCGCGGTGGCGGCCGAGGTACTGATGTGGGCTGAGAGAGCCAACCAAATCCAAGTGGGGT 
TTTGTCAGGAACGGTATTGGAAACTCTGAACTAGATAAATAATTCAAAGGATATTAGCTA 
AAGTGATCAGGGACCAGGACAGGAAGGTACCTGCCCG 
Sequence 1210 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCATAGATAT 

CCCTGGGGGATGTAGTGGATTCAAAACATGAGTTGGATTCTCCAACGTGCCATCGGAATT 

CTTCTAATATTCANCAGCTTTGATTGTTGTTTTACATCTGACCTGTATTTAGATTTCTC 

GATTCCTGGAATATTTGTGGAAAATAAGTACCTGCCCG 

Sequence 1211 . 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATGCTGTTTGGATTACTACAGCTTTGTAGTA 

TATTTTGATGTCAGGTGGTATAATGCCTCCAACTTTGTTTTCTTTGATCAAGATGGCATT 

GGTGCCAGCCTCGTGGCTCACACCTGTAATCCAAGCACTTTGGGAGGCCAAGGCAGGTGG 

ATAACTTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTA 

TCAAAAAATACAAAMTTGGCTGGGCATGGTGGTACCTGGAAACCCAAGATTCTTGAATT 

TACCTCCTGAATCTTCATTGCTTGTAAGCTGAGTTTCTCAGCTCTACTGTAACTTGCTTC 

TTAAGTTTCTAGTTATAAACTTCAGGTTGTAGACCCTGCCTTACTTAGTTGCAAAGAAGT 

TGTGAAAATGAGTGT 

Sequence 1212 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAACAAAGGATCTG 
ANAAAAGTTGGGAAAGAGTANCTTGGGTCAGGAAAGGAGGAAGGTAGGTGGCACTATGGA 
AGTTGACCCTGTCAAGAAGAGAATGGTCANAAATCAGGAAGGGAACTTGTCTTCTAAAGA 
ATAGGTGGCTAANAGAAAGTAACAGTGACCTGAGGTGAAACCAAGTGGGTAACCATTGTA 
AACTTGTGAGTGCTGGACTCTGACAGCNCCCTGGGGTGTGGCTGTACCTGNCCGnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGGGAATTCNANTATAAANNTTTATCAA 
TACCGTCTNACCTCTTNGGGGGGGCCCNGTTACCCANCI I I I I GTTTCCCTTTTA 
Sequence 1213 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGGAAGGTTATTGC 

ACAGCTCCTTAACTGATCGGGGTCAGGGCAGAGTGGTCACTTTCCCCACAGCCAGGCTCT 

GGATTTGCCTCCTGTGAAGACACCATGCCTAGCACAGGCTGACGGGGCGGCTGCAGTCNA * 

ACCTTGCCTCCAGATTNATGAAACCAGTTAAGTAGCACAATTTCTCGTTGGCTACTTTCA 

CTTCAAGAGTGTCATGTTTATTGATGTGGAGCT 

Sequence 1214 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTATTGTCTCTCATCTTGTCCAT 

CATGTTGTTTCTCCTTTTTAGGCGTATATATATCTATGGAAGAAAAMGTTTAAATAG^ 

ACACTCATTTGATGAGGMTTTTGGTGTAAAGAAAGGGTAAAAATGGGAAATNNATAGGA 

AAAAAAAGTTTNCT 

Sequence 1215 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGCAACTTC 

CTTGATCCCTGCCACGCACGACTGAACACAGACAGCAGCCGCCTCGCCATGAAGCTGCTG 

ATGGTCCTCATGCTGGCGGCCCTCCTCCTGCACTGCTATGCAGATTCTGGCTGCAAACTC 

CTGGAGGACATGGCTGAAAAGACCATCAATTCCGACATATCTATACCTGAATACAAAGAG 

CTTCTTCAAGAGTTCATAGACAGTGATGCCGCTGCAGAGGCTATGGGGAGATTCAAGCAG 

TGTTTCCTCAACCAGTCACATAGAACTCTGAAAAACTTTGGACTGATGATGCATACAGTG 

TACCTGCCCG 

Sequence 1216 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGG 
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GCTTCTATTTCGACCGCGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAAT 
TGGCCGAGGAGAAGCGCGAGGGCTACTGAGCGTCTCCTGAAGATGCAAAACCAG 
Sequence 1217 

^^^ ^^^ATGTCATCTCCC CATCCTCTTGAGAGTGTCTGAGG AGGCCTCTTTTC 

CTTTTTTATTGCAATGGCAAGGTTGGAAGAAACTGTGACNAGTAAGAGGCAGAGACCCAN 

AGCTGAGTGTAGCATCATGTCTTCTAGAGACTCTCNCNNAANAAANCTGACTTNGGCCAG 

TGCTNTGGTTGGAAATGTATTCTGGATCCCCGCAGTACCTGCCCGGGCGGNCGNTCTAGA 
ACTAGTGGA 

Sequence 1218 

GCMTTCTNCAGTGGGAGTAAACCACAGACTACGAGGGAG7TTGACAGCTATTAAAACCA 

GGGCTCCACAGTTAGGAGGTAGCTTTGCAGTTTGGGGAGGGCTGTTTTCCATGATTGACT 

GTATTATGGTTCAAGTCANANGAAAGGAAGATCCCTGGAACTCCATCACAAGTGGTGCCT 

TAACGGGAGCCATACTGGCAGCAAGAAATGGACCAGTGGCCATGGTTGGGTNAGCCGCAA 

TGGGTGGGCATTCTCCTAGCTTTAATTGAAGGAGCTGGTATNTTGTTGACAAGATTTGCC 

TCTGCACAGTTTCCCAATGGTCCTCAGTTTGCAGMGACCCCTCCCAGTTTGCCTTCAAC 

TCAGTTACCTTCCTNACCTT 

Sequence 1219 

CCGCGGTGGCGGCCGAGGTACAGGTCCGAAAGTATGAAGAACACAGAAGGCAGGCCAGGG 

GCACTGTGAGATGGTAAAAGAGATCTGMGGGATCCAGAATTCAAGCCAGGAAGAAGCAG 

CMTCTGTCTTCTGGATTAAAACTGMGATCMCCTACTTTCAACTTACTAAGAAAGGGG 

ATCATGGACATTGAANCATATCTTGAAAGAATTGGCTATAAGAAGTCTAGGAACAAATTG 

GACTTGGAAACATTAACTGACATTCTTCAACACCAGATCNCGAGCTGTTCCCTTTG 

Sequence 1220 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGGACGGCGTGGAGGTGCAT 
AATG CCAAG AC AAAGCCGCGGG AGG AGC AGTACTC AG NTAAAAT ATTTCTTCATAAATAG 
TTACAAAACCTGCCAAAACACAAAGGGGAAAGTATTTTT^ 

AATGTNACNCAACACAAGAAACAATGCTTAGAGACATGTTATTGTTTCCAGAAGAAAATG 

GTCAGTAAACTACTTAGCTGAAGACAAAAG 

Sequence 1221 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGTCGCACTCATTTACCCGGAGA 

CAGGGAGAGGCTCTTCTGCGTGTGGTGGTTGTGCAGACCCTCATGCATCACGGAGCATGA 

GAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTGTANAGGAAGAA 

GGAGCCGTCGGAGTCCANCACGGGGAGGCGTGGTCTTGTAGTTGTTCTCCGGCTTGCCCA 

CTGCTCTCCCACTCCACGGCGATGTCGCTGGGATAGAAGCCTTTGACCCCCGCGTACCTN 
GGCCG 

Sequence 1222 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATACCTCACCGTGGCTGCTGT 

CTTCCGTGGTCGGATGTCCATGAAGGAGGTCGATGAGCAGATGCTTAACGTGCAGAACAA 

GAACAGCAGCTACTTTGTGGGATGGATCCCCAACAATGTCAAGACAGCCGTCTGTGACAT 

CCCACCTCGTGGCCTNAAGATGGCAGTCACCTTNATTGGCAACAGNACAGCCATCCAGGA 

GCTCTTCAAGCGNAT 

Sequence 1223 

CCGCGGTGGCGGCCGAGGTAC i rTTTTTTTTTTTTTTTTTTI I GTTGCAGTTAGTTTATT 

GAAATGGAAATGAATTTGAGGTTTCAACAGTTGTCTAGCAGTTTCCACATAAAAAGACAA 

CACANGAAAGAAAAGTGGTTTCTAGCAATCAACAGATGAGTTCTGGCCATAGTCTGTAGC 

CCTNTGAGCCAAACGCCrrGGGTAAAGTTAATTACTCTTCATATTACACCAAATGCTGTC 

GTACCTGCCCG 

Sequence 1224 

CCGCGGTGGCGGCCGGAAGGAGGATGGTATCACTCAGGCTCTCAGGGTGACACTGAAGCA 
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AGACACTCATGGGGTAGGACATGACCCTGCCAAGGAGTTCACAAACCACTGGTGGAATGA 
GCTCTTCAACAAGACTGCGGCCAACTTGGTAGTGGAAACTGGGCAGGATGGAGTACCTGC 
CCG 

Sequence 1225 

AGGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCT 
GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 
CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1226 

CCGCGGTGGCGGCCGTGGTACTCCATCCTGCCCAGTTTCCACTACCAAGTTGGCCGCAGT 
CTTGTTGAAGAGCTCATTCCACCAGTGGTTTGTGAACTCCTTGGCAGGGTCATGTCCTAC 
CCCATGAGTGTCTTGCTTCAGTGTCACCCTGAGAGCCTGAGTGATACCATTCTCCTTC 
Sequence 1227 

AGGTACTTTTATTAAAAAATAGTCACGCAGACAGTGCCCTGGTGGCTCTGCCCCGCATCC 
CAACTCTGGGGTGGGGGAAAGGGGTCAACGTTTTCGCAGCCCCAAACCGGGCCATCACTT 
GCCCACCGAGTCGAATATGATGCGGTTCTGCTCGGCGCGCTCCCGCGTACCTGCCCG 
Sequence 1228 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAAGTTCCCTATGGCTTATGGA 

GAGTTATTTATTAATTAACTTTATGGTAGGGCTAGTATGAATACCTTTTTM 

TGCTATTACAACMTGAAGATTCAMTGACTCCGCTTTGMGGATGTTTTCTCTATATGG 

TAAMTATATATGAAGAAGTCTTGATTACNGTGGMGATCACTTGACTCAGAATACTTCA 

ATGTATTTTGTTCACATTACCACTAAGCATATTATCAGTAAACTATTAACTGACTGCACA 

TTATGTAA 

Sequence 1229 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCTTGTGGATAAACGCTTCTAGTTCTTGG 

CGTGTTTGGGTTGCTTTTCCTCCTAAAAACTGAATGGAACTAGAACTTTCTAGGAACCCT 

TTC AAG CTGC AGTAATG ATTTACTG AAGCAGAAAGAAC ATGGTTATTTTGAGTCTC AC N A 

NAATAAAANANANATAAAAGTNCANATTTNCANCTNTTTGAATCAGGTTCAGGAAAAAAG 

CAAACATGCACACCTAAAACTTAAMGACCGTTTCATCACTATGCATACGTTTATCATTT 

GACATCCATCAACTGTATACCTGGTTTCAAAAGTAAATTTAACTTGTGATCTCAGCATAG 

CTCATGTNCACATTTCATGCANAGGGTCCAGGAAACTCAAATCACACTTTGGTAAGTCAC 

CATGTTCACTCATTTTT. 

Sequence 1230 

CATTAGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGCCAGGTACTCCATCTAGTC 

TGGGTGACGGAGCGAGGGTCCATCTAAAAAAAAAAAAAGACATNGAAGNNCNGNAGAACT 

GAGTTAGAAACCTTCTTCNATTCCGTGTTTGGACCTTGGGAAGGCTGTTAAGCCTCCTGG 

TGCCTAGTTTTTCTCATCTGTAMCAGGCNTAATNAACACCATTTGCACATGGTTGGTAC 

CTN 

Sequence 1231 

AGGTGCGGTATCCTTGCACCTTGCTTCTCATACTTCAAGCTACTGCTTGGGTGGGTCCCG 

TTGTAAACTANATGAACACTCATCTGACTATGGATATCATCACTCCTAGATACTCCTGCT 

CATATGATCCACTTACCAGAGNNTCTGTNTGGACATCTTTTAACCAAGCTAGTAGTAACT 

GCCCG 

Sequence 1232 

CCGGGCAGGTACCANGCTGTAACCAATACGATTCTGGGGCAGGTTG TGGGCGAGTAGAAG 

AACCTCCTTCCCCTCTGCGACATTGAACGGCGTGGATTCAATAGTGAGCTTGGCAGTAGT 

GGGTGGGTTCCAGAAGGTTAGAAGTGAGGCTGTGAGCAGGACCTCCTTCCAGGGGACATG 

CAATCTGCAGGGAGGGGCTGAGGGGGGTCCCATGGTCTCTGCTGTCTTCTCTGTCCACCT 

CTTTGTAGAGGAGCTTGAGCTCCAAGGAATGCTCTGGTCAGGGCTGCTGTGACTGTTGGC 

CCTGCTGTCC7TCCTTCTTCTTGTCCCCGCGTACCTTCGGCCCGN7TCTTAAAAACTNGT 
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GGANTCCCCCCGGGCCTTGCAGGGAATTTTCGAATATTCAAAGCCTTATNCNATTNCCCN 

GCCGAANCCTTTCNAANGGGGGGG 

Sequence 1233 

TGGCCGAGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGC 
AGTTCCAGATTTCAACTGCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCACGCCA 
CAGTTTCGTTTTGATTTCC 
Sequence 1 234 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACGGCCTCAATTTCTCGGGCAGCAtCAGCTC 

CACTTTATCGAGGAGCCGGTGGTAGGTGGCCCGCAGGCCTCGGGCACCGGCGGCCGACAT 

CGCCGCCGAGGGATCGTTGGCAGCCGGGTGGGAGCGTGGCTGTGTTCTCGTCCCTGGCTG 

ACAACGAAGGGGAGCTAGTCACTTTTCCTGCCACGACGACTCGCGTCCGCCTCTCGCCTG 

GAGTACCT 

Sequence 1235 

CCGCGGTGGCGGCCGAGGTAC I II I M I I t I i I I M I I I I I I I I I I GNGTGGAGCATGGG 

ACCTTTATTCGTTAAGACATCAGGCTCCAGATATGAACTTTCAGCAAAAGCGCTTGCCGG 

GAGCAAAGGGACAGAAAAGCTGAGATGAACAGTGCCTGGCAGCAATCACAGCCGGGCAAG 

GGTGCTCCGAGCCTCGCATCCCC 

Sequence 1236 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGCAG 
AAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTATC 
CCAGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnTTTGCAAGTTTATTACTGTCTCCAGTATAATAACTGGCCTCCGTACCT 

0 

Sequence 1237 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGAACTAGCTCCTTCTGGTTAATTTGTTGAT 
TGGATTGGAAATTAGAACATGGAGCTGGTCAATGCACGGTATCTGGTAATTGTGGATGGG 
GGAGATGACTGGGGCAGMCTGAGCTCTATTTTTGCCAACATAGTACCT 
Sequence 1238 

GGCGGCCGCCCGGGCAGGGACTTTCTTCTGTGTGACAA7TACTGCACAGTCTTTCCCTCT 

GACAGCTACTGATGTAAGGCCACCCTGGTTAATAGCCTTAAAAGCATATTCTACTTGGTA 

GAGCCGACCCTCGGGTGAAAAAATGGTAATGTGGCGGTCAAAACCGGCGCTGGAACCACG 

GGACATGTTGGTACCT 

Sequence 1239 

AGGTACACTTTTGGCCAGGGGACCAAACTGGAGATCAAACGAACTGTGGCTGCACCATCT 
GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 
CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1240 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTGG 

GATAGAAGTTATTCAACAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCAG 

ATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCC 

CCTGGCCAAAAGTGTACCTGCCCG 

Sequence 1241 

CCGCGGTGGCGGCCGAGGTACAACTAAATATTAGCTAATGGTAGCACAAGTAGAACCTGT 

TTAAAATTTTTAAAAAACAACCTGTTCACTGTAAGAATGAGATACCCAAGGAAAGAACAG 

GAAAG AGGTGTA I I I I I AATCCTGCCTTTTGTATTTAGTGTG CCCC ATGACTAGCTTGTT 

CATCTAACATGCTTCTTGGTTCTCGGCAGGTGAATGGCTATGTCGTGMCCTTTTCTGGG 

CATTTGACACCCACAAGCAAGAGAGGAGAGCTTTGAACTTNCGAGGAAAGCATATTGTCA 

CACAAAGGTAACACTTTTTGGTGCTGGTCAGTGTTGTGTGCCAGGAAGGGGGTAAANGGC 

TNGGACCCAATNCCTTTTCCCCGCCCGTAANAAGCCTTTGAACTNGGGGGGTTCCCCAGG 
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GGAAAAGG 
Sequence 1242 

CACTACTATAGGGCGAAATTNGAACCTCNCCGCGGNGGCGGCCGAGGTNCGGGGGCCAGT 
TATTATACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACTAGTGGCCCTGG 
TGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 

0 

Sequence 1243 . 

CCGGGCAGGTACGCGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAACCATGGAAGCC 
CCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGGAGAAATAGTG 
ATGACG 
Sequence 1244 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGAT 
GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAACTGGCCGAGGAGAAGCGCGAGGGC 
TACGAGCGTCTCCTGAAGATGCAAAACCAG 
Sequence 1245 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACAGCTATTCAACATGTTTAAACAGATGTTTGC 

TGACAGCACTGACTTCTCCAAGAGCCTAAGAGAGGAGAGCAGCCAATTAAACTCCACTGT 

CTTCTAAAATTTTAATCCTAGATTGGTTTCAGATGGGTTAAGACAGTCATTCAACCCACA 

AGTATTTAAGGAAGGCACCTGGGACTGACTCAGTTCCTAGATTTAAACCTATTCTGTGAA 

CTGCCCACCCCTACCTTCAAAGTACCT 

Sequence 1246 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTTACAGCCCTTATGACATGTTGGAAAG 

CATCAGGAAAGAGGTTAAAGGAGACCTGGAAAATGCTTTCCTGAACCTGGTTCAGTGCAT 

TCAGAACAAGCCCCTGTATTTTGCTGATCGGCTGTATGACTCCATGAAGGGCAAGGGGAC 

NCGAGATAAGGTCCTGATCAGAATCATGGTCTCCCGCAGTGAAGTGGACATGTTGAAAAT 

TAGGTCTGAATTCAAGAGAAAGTACCT 

Sequence 1247 

TCGCACTCATTTACCCGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 
CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGT 
GAGCTTGCTATAGAGGAAGAAGGAGCCGTCGGAGTCCAGCACGGGAGGCGTGGTCTTGTA 
GTTGTCCTCCGGCTGCCCATTGCTCTCCCACTCCACGGCGATCCCGCGTACCT 
Sequence 1248 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGGGAATAGA 

TAACAGGGACATTGATCTAAAGGGAGGTTAGGGAGGACTTCTTGGAGGAAGTGGTATTTG 

TGTCTGAGTTATATGGGATGTGTGGGATTTCTTAGGTGAAAGAGGGCAGAGCAATACGGT 

GGTAGGCTAGATCTGTGCAGCCGGCTTATAGACAAGACTGTACCTGCCCG 

Sequence 1249 

CCGGGCAGGTACGCGGGGCCGCAGTAGTTGGAGTCTAAGGACTCGTGACAATCTTCGGGT 
GCCCTTCGAGAGAAAAGGGGAGGATGCCACTGGAGTCATCCTCTTCAATGCCACTATCCT 
TCCCATCTCTCTTACCCTCAGTACCT 
Sequence 1250 

CATACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCACCATGTCCGCCTCGGCT 
GTCTTCATTCTGGACGTTAAGGGCAAGCCATTGATCAGCCGCAACTACAAGGGCGATGTG 
GCCATGAGCAAGATTGAGCACTTCATGCCTTTGCTGGTACCT 
Sequence 1251 

ATCACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACATCATTT 
TCAAGAAGTTTTGCCATAAAAGTGAAAAATGGTAATAGCTGAAAAAATTTAGCGTTGACA 
GAAGGCIl rm lAAAGACAGGAAAAATTATGGCGCAGAGGGAAATGCTGATGATACAAG 
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ATGGAGAGAGAGAAATTACTGGAGTAATGACTCTGAGCAGATGTGGATGGCATTAATAA 
Sequence 1252 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTACTGGCACTGAGCCAATGTATG 

CTATCAAGGAAAGCTTTATCTGTCACTGAGCAAAAGGTGAAGTTCAATTAGGTCAGTTTT 

ATCAC TTCTTTTCC TACACAC 

TATTTAl MUM IGAGACAGGGTCCCACTCTGTCGCCCAGGTTAGAGTACCTGCCCG 
Sequence 1253 

CCGGGCAG GTAC AGTAGTCAGCTACTTGGATATCAAGACTAAGTGCTGATATGAAGACTT 

GCCTTTGTGTTTTGTTATGAAACACGCAAGCATAAAGCAGGACTCAAGCACAAGGCTGAC 

TGTTCTACTTTGAATAACAGCTTCCTTGCAGTCTCCTCCACATGGGTGGTACCTCGGCCG 

CTCTAGAACTAGT 

Sequence 1254 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAACTTTTGGAAACACCGTCCCTTCAAATGC 

TACCTTACTTAAAATATGTAGACATTAGTCCATCCAACCTCTGCTACCAGGCAGTAACCT 

TTCATAGATCTTCCAAAGAAAATGTTCTTGATAAAACCAGAAGAATCTTCTCAGAGAGAC 

CCAGGGGAGTGATCTGACCCTCACTGCTATCTTGACTTAAGTTTTAGCTGCAGCCATGGC 

GACTCAAGTACCTCGGC 

Sequence 1255 

NCCGGGCAGGTACTAACCCAACAATAGAATTACCTATCCNTAACTCGTTAGCCCAACACA 

GGCGTGCTTTAAGGAAAGATTAAAAAAAGTAAAAGGAACTCGGCAAACACGAACCCCGCC 

TGTTTACCAAAAACATCACCTCTAGCATAACAAGTATTAGAGGCATTGCCTGCCCAGTGA 

CTNAAGTNAAANGGCCCCAC 

Sequence 1256 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAATTTACCAGTATTTTCTCTTACGGCTTT 
TGGAT TTGACTCATAGTTGAAAAGGTCTACCATGTTCAAGGTCATTAAGAAAGTATCCTG 
MM ICTCCTAGATCTTGGGTAGCTTTTGGTGTTGACCTTTACATTTTTTATTCATTTG 
AATTTGGCCTGGTGTAAGACAGGCTTTCCCCTAGAATGCTGCCCAGGTAGCTCAGTATCA 
AATATTGAACTATCCCTATTTTCATTCCACTCTGATTTATAATGCMTTTTTATCCCACA 
TATATTTGGATCTATTTCTGAAC I i II I M i I I ! AACATACATATTTTTACACGTTCCAA 
CTCTTAGAAATGTTCCATGCTGTTTCTGTTTTTANGCTCCTTTTANGTACCT 
Sequence 1257 

CCGGGCAGGTACTCTTGTTTAACCATCAGAGGTGATTCCATCACCTTCACAGCCCCAGCC 

TCTGCTCCAGTCCCTCCCCAGCGAAAAGGGCCGCCCATGCCATCCTGCTGCTGGTGATTT 

GCTTTGTGGTCATGGACTCAGTGGACATCATTATTTTATTMCCGTGTGGTAGGTTTTTA 

ACTCAGTTATCCTGGATATCCAAAGGTTTGTGGTCCATCTTTAGGCTTCCGTTTGTTCTT 

TGGTACCT 

Sequence 1258 

CGNGGTGGCGGCCGAGGTACAAGATGCACTTGGTCACCCGGTTGTAGGCTCCAGGCTCCA 

TGAACCAACAGAAGCTTCTCTAAGTAGGAAAACTGTGCGATGGCGTAATCCGAGTTGTTG 

GTGGCCTGCATGCCTTCATTCCCACTGATTCCCACACCCACGTGGGCTGTCTGGATCATC 

CCGACATCGTTGGCGCCGTCTCCGATGGCGAGGGTGATGGCCTTCACCCGCTTCTTCACC 

ACATCCACTATCTCAGACTTCTGCAGAGGAGACACTCTGCAGCATATGACCGCTTTGCAC 

GAGAGTGCCAAATCCAGGAAACTCCTCCGGACTTCGAAGGAGAGCGCGACCTGCCCGGGC 

GG 

Sequence 1259 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCGGTTGCAAGCACAAGCA 
AATGTGCCAGGGTGGTTGATGCAGCTGTGGTCACAGGTCCTATCCAAAGAGCACTCATCC 
ACATCTTGGCAAGACTTCTCATCTGTTAATAATTTAAATCCTTTCTT 
Sequence 1260 
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AGGTACI 1 1 I I I IGTTTGI I M I I I I I I ! CACCTTAAGGGAGGATTTAATTTGCTCCCAA 
CTG ATTG TTC ACTTAAATG AAAATTTAAAAATG AATAAAAAG AC AT 
Sequence 1261 

AGGTACATTATATACAAATATATATACGATGCATATGTAGATGTATACATGTATGTATAT 

ACTGTTGTATAGAGATATGAGTATACATGTATGTGTATATCTCTTTATATAGATATAGGT 

ATAGATATAGATATAATC ATATAGGTTGTATAGC GTTTCC AAG G CTCTTTCATAACTAGT 

TCTCAACTGGGAGCAATCTTGCCTCCCATGGAACATTTTGGCAATGCCCAAAGATATCTA 

TATTTGTCACAACTGAGGGGGAGGGTTTACTAGTGACAGGTACCT 

Sequence 1262 

AGGTACTTTGATCTTGGACTTCCAGCCTCCAGAACTGCAGTGTTGGTCTGCAAGCTTCAA 
GAGCCAGTGCCCTGACTGCCAAGTGATTTGCCGAAGGGAATTATGGGGAATTCGCAGGTA 
TTCCTGATCTGCTATGTATTCCAGAAAGGCCATGGGCAAAAGGAAGTACCTGCCCG 
Sequence 1263 

AGGTACCGATTTAGCGTGGTCCCAGGCATGTCGGATGTGGGAGCATTCGATCCGCGGCGC 

TGCCTTCCGTCGAGCATTCCGGTTCAGACGCTTTCGGTTGACACTGTAACCAAACTTCTG 

CCTCCGGGTTTTGCCCTTGGCCTTGGGGCAATTNGCGCTGNACACACCCGGCACNCAGNC 

AAGACTCAAACAACGCTGCNCTCTGTACTNTTCAAAACNCTNNGTGTTCCCANGCCGATA 

ACCTTGCACCGGGGCCGGNCCCGTTTCTTAGGAACCTTAGTTGGGGAATNCCCCCCCGNG 

GNCTTGGCAAGGGNAAATATCCGAAATATTCAAANGCCTTTTATTCNGNNATAACCCGNT 

NCAGAACNCTTNTGAAAGNGGGGGGGGGGNCCACTCGGGNTTMCCCCCAAGANCTTTTT 

TATTGGTTATNCCCC 1 I I I I AAGNTTGGGAAGGGGGGGTTTTAAAATTTTTGATCNNCCC 

GNCCCTTTTGGGGGCCCGG 

Sequence 1264 

AGGTACCTATGATTCGATAATTGACAATGGTTATCGGGGTTGTCATACACTTGTGCTAGG 

AGAATTGGTTCTTGTTACTCATATTAACAGTATTTCATCTATGGATCTATAGATTAGCCC 

AATTTGTAATATAGGAATTTATTGAGGTTTGTGGAATTAGTGTGTGTGTAAGTATGTATG 

TTGAGCTTGAACGCTTTCTTTATTGATGGCTGCTTTTAAGCCTACMTGGTTAAGTGGAT 

TGTAGTTGTTTTATTC ACTATTTAAGGN TTTTTCCCG 

Sequence 1265 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTAAAAACTCTGG 
ACATTACATTAAAGCAAACATAACAAAACATTGAAGGGTGGGGAAGAAAAAGCAAGGTGG 
CTAGAAATGACATGGTGGTAAGTTCCCTGGGTTTCCTTTTTGCCCAATATATCCCAGACT 
TGAAGCTGAAGAAACCAGTAACCTAGAAACGCCAATGGGTACCTGCCCGGGCGG 
Sequence 1266 

AGGTTCTTCAGCAAGCCAGCTTCTCGGATGGCGGCTGGGCCATATTCCACTCCTTTCTTT 

TTCTGTGAATCATACAAATCCCACAGACATAAGAGAAGGTGAGAAAAGGAGCAGAAACAC 

TTTACACACAAATCTCTGATGCTACACGGAGTTCCTAACAGATTCCTCTGCAAATCTTTC 

ACAGCATCAAMCTTCTTCGGTAGAAGGCTCGTCGGTTCATTAAATATTAATGAATGGAG 

GMGGAAGATTTTGTTCTTGAAAGCAAATCAACCTTCTGGGTCCCTCACTGAGAGCCAGA 

CCAGAGGAGTCTGTCTTCCGATCGGGGAAAGGGCT 

Sequence 1267 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnNGCGCTTGGCCGTAATCATGGGNCATAGGCTGTTTTCCT 

GTGTGAAAATTGTTATCCCGCTCACAATTTCCACACAACATTA 

Sequence 1268 

ACGGGCAGGTACACATATTACAAGAGCCTGCTACTGGGAACTAACTGAATTCACAAAGAA 
ACAGTGATTCAAGTTCGTTAAAACTTTTGCTCTATCTTGAATAACAAGCTTATTACATGC 
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GAMCCTATTGGGGACCMCTGAATTCACCGTMTACTTAGA7TCCGTTCTTTAAATGTT 

GCTATAT ATTTA AATGCACACTCATATAAGCATGTCCCATTGGNAACTCTCTGAAATCGT 

CTAGAAATTTTGACTCCATTCGTGAAAATTTTTTNTACATCCGGGAACAGTCCACTTATT 

ACTTTCTGTGGCCTA 

Sequence 1269 

GGGGCAGGTACATGCGAATCCTATTGGGAACCTACTGAATTCACCATGATACTTAGATTC 

CGTTCCACAAAATGTTGCTCTATAATTGAAAAGCAAACTCATACAAGCATGTCCCATTGG 

GAACTCACTGAATTCGCCTAGAAATTTTGATTCCATTCGTGAAMTTTTTGTATATCCCG 

AACAGTCCACTTATTACTACTGCGGCCTACTGGGAACTAACCGAATTCACCATGTTACTN 

AGATTTCGGCTCACCCAAGTTTGGATAAATCTTTGA 

Sequence 1270 

CCGGGCAGGTACGCGGGGACCCGGCGCTCCATTAAATAGCCGNAGACGGAACTTCGCCTT 

TCTCTCGGCCTTAGCGCCATTTTrTTGGAAACCTCTTGCGCCATGAGNAGCCAAGTGGTA 

GGAAGAAAGCCGAAATGTTGCAANGCNTGAAANCCGCCAAANAGNAANGNAAAGAATTGA 

AGGGCAGNTAGTGTTCCAMGNTAAAAACNCGCCTAGNCTTTGTTTTGCCACCCCGCTGG 

GNAGNGCCCANCAAGGNAGTCCANNAAAACANTGTGNAAATTGNCACAAGAACTGCTTGG 

GGGGGANTGCCTTGGNTAANCCNTTCGNGGCNCGGCCTTCTTAGTAAACCTTAAGNTNGG 

GAATCNCCCACCCGGGGGTCTTGGCTANNGGAAANTTTCGGATTAATTCCAANAGGCCTT 

TTMTTCCGNATTAACCCCGTTCCGTAANCACTTTTGTAGGGGGGGGGGGGGCCCCCCNG 

GGTTATCCCCCCAAGACC 1 1 I I 1 1 I IGGGTG 

Sequence 1271 

CCGGGCAGGTACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCA 

CCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTTGAAATCTGGAACCTGCTCTCT 

GTTTGTGTGCCCTGCTTGAAATTMCTTTCCTAATCCCCCAGAAAGNAGGGCNCCAATAA 

GATATCNCTTCGGGGGCTCGNCNTNCTTAAGAAACCTTAANTTGGGGTAANTTCTCCCCC 

CGGGGGGCNTTGGCCAAGNGGANANTTATCCGNAATTAANTTNATAAGGNCCTNTTAATT 

NCTGGAATTAANCCCCGGNTGCCTNANNCTCCTNCNGTNAAGNGGGGGGGGGGGGGGACC 

CCCCCGGGGGGATNAACCNCCCCNANGCCCTTTTNTTTTGGTATTTACCCCCCTTNTTAT 

NANGCATAGGNAAGGGGGGGGGTTT 

Sequence 1272 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTTAAACTGTATTGTATTCATGTTGCTAA 
ACAATATTGGCCTTCTCGATGATTTTATTCATGTTGCTCCAAAGTTAAAACCCTGTAGAA 
CTAAGTAGGTGAAGAGATATTTTGTATAAGTGCCACAGAAGAGAAAATATAATAAATTAA 
ATAGTGAATTGAGCATCACTAGAATAAAATAAAATGAGTAGGCATTCCTAAGATGTGAAA 
TGATCACCTMGATATACATGCTCCAACCATATTGATTTTAGAACAAAACACAGCAGCCC 
ATAACA GTTTG TGGCCTCCACTAACTGTCCTCTGCTGTCCCATCTAAGAGGGTTATGTTT 
CTCCTAI I I 1 1 AAAAATAAAATGGTAGTTTAAAATTAGCCTGACGGGATGTTTCCT 
Sequence 1273 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTACT 

TACCAGCCCTTGAACCATTTCTATATGATGCTCCTCCCAATATTCTGAAACATGTGGTTG 

GACAGTTCAGTAAGGTAAGAAATGCCAGCCTCTAAGGGGAGGAAGAAAATTAAAATGATT 

TGTTGTTA I M II ll I ICTGTTGGGTCAGTAGCTTTTTCTGTAACACGGGGTCATGGTTT 

TACTGCTTAACTCTCACCAATCTGTATGGACAGAGAAGTTACCTTAATAAAATTCACATG 

TAATTAAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCCT 

Sequence 1274 

AGGTACGCGGGGCTTTTCACAAGATGGCGCCGAAAGCGAAAGAAAGGAAGCTCTGCCCCC 
TTCTTAAAGCCTGAAGCCCCAAAAGCCGAAGGGCTTTTAAAGGCCAAGAAAGGNCAGTGT 
TGAAAGGTNGTCCACAAGCCACAAAAAGAAGAAGATCCCGCACGTCACCCACCTTCCnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnCGACCTTCGAGGGGGGGGCCCCGGTACCCAGCCTTTTTGTTCCCTTTAGTGAAG 

GGGTTAAATTGGCGCGCTTTGGCCGTAAATTCATGGGTCATTAAGCCTGTTTTCCCTGTG 

GTGGAAAAATTTGGTTATCCCGCTTCAACAAAATTTCCCACACCAAAACATTAACCGAAG 

CCCGGGGAGGCATTAAAAGGTGGTTAAAANCCTGGGGG 

Sequence 1275 

CCGGGCNGGTACCACTCATGAAACCTGGCCAGGCTNCCAGGCTCCTCATCTATGGNGCAT 

CCACCCANGGCCACTGGTATNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnhnnnnn 

nnnnnnnnnnnnnnnnnnnnnCNCCCTGAAGATNNTTACAAATTTTAATTATTGNCAGCC 

AGTTATTAATAAACCTGGNCCCTGGTGTACCCTCGGGGCCCGCTTCTAAGAAACCTAAGT 

GGGMTCCCCCCCGGGGCCTTGGCMGGGAAMTTTCNIGAATATTCAAAGCCTTATTCGA 

ATTACCCCGTTCNAACCTTCNAAGGGGGGGGGGGCCCCCGGGTNACCCCCAAGCCTTTNT 

TGGTGTCCCTTTTTAAGTTGGAGGGGTTATAAATTTTGCCGGCCGGCTTTTGGGGCCGAT 

AAAATNCAATTGGGGTNCNATTAGGCCTGGTTTTTCCCCTTGGTGGTTGGAAAAATTTNG 

NTNTNATTCCCCCGGCNTNCACCAAAANTTTTC 

Sequence 1276 

TCACTACTTAGGGCGAATTGGAGGTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCGG 

GGGGATAGCTGTCCAAGGTCTCCCCCAGCACTGAGGAGCTCGCCTGCTGCCCTCTTGCGC 

GCGGGAAGCAGCACCAAGTTCACGGCCAACGCCTTGGCACTAGGGTCCAGAATGGCTACA 

ACAGTCCCTGATGGTTGCCCGCAATGGCCTGAAATCCAAGTACCTGCCCGGGCGGCCGAG 

GTACCAACAGAAACCTGGCCAGGCTCCCAGACTCCTCATCTATGATGCATCCGACAGGGC 

CACTGGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnTTTTGCAGTTTATTACTGTCAGCAGCGTAGCAACTCGT 

ACCTGCCCGGGCG 

Sequence 1277 

CGAGGTACGGGGGCCAAGTTATTATACTGGCTGAAAGTAATAAAATTGCAAAATTTTAAA 

ACTGCAGGCTTGCTTGATGGTGAGAGTGAACTTTGTnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnTACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAAGCCTGGC 

CAGGTTTTCTGCTGGTACCCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnTCGATATCAAGNCTTATCGATACCGTTCGACCTTCGAGGGGGGGGGCCC 

CGGTACCCAGCTTTTGGTTCCC I I I I I AGTGNAGGGTTTAAATTTGCCGNCGCTTGGGCG 

TMTCMTGGGTCATAGCTGTTTTCCTGTGTTGAAAMTTGTTATCCGCTCACAAATTTC 

CACACAACATTAACCGAGCCCGGGGAGCATTAAAAGTGTTAAAAGCCTTGGG 

Sequence 1278 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTGTCTTGATTAT 

AGTAGCTTCATAGCGAGTCTTAMGTCAGGTAGCAGGTAGCATAATTTCTCATTTGTTCT 

TTTTCMGATTGTGGTAGATATTATTTTTATTATTA I HUM I GAGACGAATTCTCACT 

CTTGTCCCCCAGGCTGGAAGTGCAGTGGTGTGATCACAGCTCACTGCCTCCGCCACCTCC 

TGGGCTCCCCGCGTACCTGCCCG 

Sequence 1279 

CCGCGGTGGCGGCCGAGGTAC 1 1 I 1 1 1 1 I IACI ITTTTTTTTTTTTTn I AAGAAACTGG 

GTCTCTCTGTCACCCAGGCTGGAGTGCAGTGGCAACGATCATAGCTAACTGCAGCTTTGA 

CCTCCTGGGCTCAAGTGATCCTCCTGCCTCAGCCTCCTGAGTAGCTAGGACTACAGGTGT 

ATGCCACCATGCCTGGCTMTACAAAAAAATTNTTTTAAGAGATGGGATCCCTCCCTTTT 

AAMTCAGAACTTGTTCACATGGTGGTTGCTTGTGGCAAAACGGAGTTCAMTTTTGCTC 

TCCTATTGCTATAATTCTGCTAGCAATCTGTTGAGGTGAAACTTGGGATCTGACTCTTCA 

GCAAGCAGCAAATGACCTAGTAACTCAGGGACAACTATTTTTGAACTTTAAGTGCCACTT 

TAATGCAGTTAGTT 

Sequence 1280 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTATGTGGAGGAG 



Page 172 of 635 



WO 01/51628 



Table 1 



PCTYUS01/00798 



ACTGCAAGGAAGCTGTTATTCAAAGTAGAACAGTCAGCCTTGTGCTTGAGTCCTGCTTTA 

TGCTTGCGTGTTTCATAACAAAACACAAAGGCAAGTCTTCATATCAGCACTTAGTCTTGA 

TATCCAAGTAGCTGACTACTGTACCTGCCCGGGGCGGCCGCTCTAGAAACTAGTGGATCC 

CCCCGGGCTGCAN 

Sequence 1281 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTGGANAGGAC 

CNTNTN AAAAAAAAAAC NTN AAACNGAAAAGGATTN NTGTTTN AATATTGACAN AANCCA 

AGGAACACCATCTGAAGTTCTGATGGCAACATCAGAAGCTAAGAGAAAGGCATGGAAAAG 

ATTCTCACCTAGAGCATCCAGAGGAGAAGGTTGGTCCTGCAGACACCTTGTTTTCTGACC 

TCTGACCTCCGCAACTGTGAGGGAAGAAATTTCTGTTGCTTAAAGACACACAGCTTGTGG 

TACCTGCCCG 

Sequence 1282 

CCGGGCAGGTACGCGGGGAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCAT 

TTATTGGTGGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCC 

GAACAGCTGAGGAATTGCACNTGCAANCCCTTNMNTGANNAACCAANTMGAAATTTCC 

CTTTGGGNANAAGCAANGCCCTACNGGGCAAACCTTAAAAAANTGNCCCANTNTTCCACC 

CNTGGNAAAACCTTTGTATTGGNGAAGGGGNGGGAAAAAACCTCCCCTNTNGGAATAGNA 

NTTTTTTGGAAAGGAACCMNAGNTTTNTTTTTH^ 

CCAGGAAAATTCTGTTTGAAAATTTCNAAAAAGNCCCCANCCAANTNGATTGGCAAAACN 

CTTAACNTTGGGGGCCCCTTAATTCCNTAAAACAGNCCAACCCCTTCCANNAAAGGGGGG 

CCAAAAAAAACNNAAAGGGCCAATAGCCCCCCNTTGGGNAAAATTGGCTTNTNANCCGTT 

TAAAAAAGG 

Sequence 1283 

CGAGGTAC I I I i II I I I I ! I I I I II I II M M I I I GGGGGNAATTGGGCATAATTTACTA 

GCCAATTTAGGAAGTTCCCCTCACATCAGTAACATACAGTACATCACCCAGTATGTCAGA 

GGACACAATGGCATAAGTTTGCCTTTGCAAGGTTTGAGGGATGGCCATTTCCCGACCTGA 

CTCAGGAAAGTCTGTAGCTGATATCCATCTTCAAGTTTGNGGNTCTTTCTCTCTATATAT 

ATATTTGAGTNCAGCAGTCATGCTGGAGTCCAGAGTAGGTGATTCTTTCTGCTTTAGCTT 

GACTCCTCCTTAAGATTGCAACTCTCTCAGTTTTACA I I I I U IG 

Sequence 1284 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTCCTCCTAGCGTTTCTGGTTAACTATGT 

GAACACACAGGCAGACAGATCAAGCCGCTATGTGAGGGACAAATAAACATACTGAAGACC 

TAAAAATGTGAAGTCAGCTGAGCCTCAATTCAATCCAAACCTTTGGGGGGCTTATGCACC 

ATTAGTCTGTATCATGGCCTCCAGTCATTTGTCCCTCTAGAACAGGAACTCTGAAAGGCC 

TGACAAAGAAGAATGGACAATCACAGATGCTGGTTTTGCCTTCTCATTCCCTGCAAGTCT 

ACTGGATGCAGGAAGAGGGTGGGGGGGTTCAAGCTGCCTGCTTAGGAGCTTAAACATGAC 

MGGGATTTTCTAATGAAAAGCTAACCCAGAGATTCCTCTTTTTCCTATGATACCACAGC 

ATCCATCATTCAAAAACCGAAAAAGGGAC 

Sequence 1285 

CGAGGTACAAGGACCACCAGCATCAGCATCACCTGAGAACTTTTTAAAAATGCAGAATCC 

CAAGTCAGCTGAATCACAGTGTGAATTGTTAACTAGGTTTCCCGCTGATTTTCTATGAAA 

ATTTCTGGTCTATACGGAGTGTATATGACAAATATATACATGTGTGGCCATGCAGACATG 

CTTATTCtCACCTATGGCAAATAGAACACAGCTCTCCATGGTCAGGTGCTTCCATCCCTA 

ATGGCTTCCCACCAGTGAGAAAGACATTMGAGTCAGAATACTTTTGCTGTAATTCTCTG 

GTAAATTTTGGTACCTGCCCG 

Sequence 1286 

CGAGGTACAGAAGAAACTTGATTCTATAATTGATTTGACAAAAGAAGGCCTATCCAACTG 

CAATACAGAAAGTCCAGTATCCCCCCTGGAGTCACATTCGAAAGCTGCTTCAAACTCAAA 

GGAAACAACCCCATTGGCACAAAATGCAGTCCAGGTTCCTGAGTCCTTTGAGCACCTGCC 
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ACCTCTCCCAGAACCACCAGCACCACTACCTGAATTAGTAGACAAAACCCGAGACACACT 

TCCTCCCCAGAAGCCTGAGTCGAGCGGCCGCCCGGGCAGGTACTTTGGCCTCTCTGGGAT 

AGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCAGATG 

GCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAACTTGGGCCCCT 

GGCCCAAAGTNTACCTTCGGNCCGNTCTAGAACTAGTG 

Sequence 1287 

AGCTCCACCGCGGTGGCGGCCCGAGGTACTGGAGGACACTGTGCTTGGTCTCATTGATCC 
TGTTAACTTGACTGACTGCTCGATGGTCCAGCCACATGGTGACGTTTCGATGGGAATCCC 
CTTCCTGGTTGACTGGTAATGGGTGAAACTGCTTATCAAAACAACCAGAGAACACGTGGC 
ATCAAACCCAAGTCCTCGAATTTGGTTTAAATCMTCCCTTGTACCTGCCCGTGTNTNTT 
AAATCTANAATGATGNAMGTTTTGAATAAGNNGACTATCTTACTTCATGCAAAGAAGGG 
ACAC ATATG AN ATTCATC ATC AC ATG AN ACAGC N AATACTAAANGTGTN NTTN G ATTATN 
AGAGTTTAGATAAATNTNTGAAATGCANGACCACNNNANGGNATGTTTATGGGG 
Sequence 1288 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCAAAGAGCCAGTTGGG 

TTGGCTGTGTTCTTTCTATTCGTGGAAAACAGCTGTGGGTGCCGGGTGTTGAAGTCAGGC 

TTTTGTTCCATGCTAAGGTAGGAAGGTTTGTTCTTGAGACCTATCACTCTTCTGTGGAGG 

AGGAGTTCACAGAATGGTGAGCCGCTTTATCACCATCTCGGAGGCTTTTGAAATATGTTG 

ATAATGACCTGGGAAAGAGGGCCGAGTGGGATGAGGGTGGGAGTAGGAAAGAAACATTCG 

GCCCACACAGTGTTCATATCTTCTCTTGTTTTTGTATCTGTGGCAACTGCAGATTAAATG 

AATTAGGTTATTCTTTTCCATTGCCTTCTACCAAATAAGAAGTACCTGCCCG 

Sequence 1289 

CCGCGGTGGCGGCCGAGGTAC I M I ill I I I ill I I I' I lit II II MM I AAGGAACTGG 
AAACTAAAAACTGCCCTGGCCTATCATAAGGGAAAAGATGGGAGTCTTTTGCTGTCCATC 
TTGAATGTAATTCACATTGTCCATGAGTGCTGATCTAACCCCATATGCCTNTGCCCGACT 
GCCTTTATGAAATATGGTAATTTAATGCTTAAACAACTGTTTCATTTGTGTACCTGCCCG 

0 

Sequence 1290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGAAGAACCTGT 

GTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATA 

CGGCAACCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGAGTCTATATG 

TCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAAAAAA 

AAAAAAAAAAGGTCCCT 

Sequence 1291 

TACCATTGCCCCTGGCNNNATGCACNATACATNAGGAGCCTGGGATCCTGGCCAGGTNTC 

TGCTGTTACCTGCCCGGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAGTGAGGNGTTAATNGNGCGCTNGGnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGAGCCATAAAAG 

TTGTTAAAGCCTGGGGGTGCCTAATGAAGTGAAGCTAACCTCAACATTAATTGCCGTTNC 

CGCTTCACTGNCCGATTTNCCANTTCGGGAAAACCTGTTCNTTGCCAAGCTTGC 

Sequence 1292 

AGGTACTGTTGAATGTAACATTAGAAATTGGGCGGTTAATTAACATTCCCTAAGTCATAT 

GCTTTAATTTGATAATGATTGCAATCCAAGCTCTGTGGTTCCGAAAATGAGATTTGTAAA 

TTCAGTTTGCATTCCCCCATCTTTAGGAGTACCTGCCCGGGCGGCCGCCCGGGCAGGTAC 

AGAAAGAAATATGTCATCTACATCGAGCGGGTGCAGCGCGAGAAGGCCAACGGCACAACT 

GTCCACGTGGGCATTCACCCAAGCAAGGTGGTTATCACCAGGCTAAAACTGGACAAGGAT 

CGGAAAAAAATTCTTGAACGCAAAGCCAAGTCTCGACAAGTTGGAAAAGAG 

Sequence 1293 
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CCGGGCAGGTACTCATCAGATGGMTGTTTTACCCTGCCGAGGTTTTAGTCATGATGTGC 

TGAGCTCTCTGCGTCTGACGTGACTGACTGGTAGCTGGGCGTCGGCGTGACCCTTCTTTC 

CCTCTAGGATCACATCATGGAGATATTTTCCACCTATGGGAAAATTAAAATGATTGACAT 

GCCCGTGGAAAGGATGCATCCCCATCTGTCCAAAGGCTATGCGTACCT 
Sequence 1294 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAACTTTTACTTTGGTTTTGTTATCTGACTAT 
TGGGTCTTTGACACTTAATGGGCACTTC I I I I I I M l I I ATTTAATTTAG AAGTTATGTA 
ACTTCTAAAGCTATGGTTCTCAAGTGTGATATCAGGATTCCTGGAATTTCCTGAGAGCCT 

TTCATGGTGTCTATGAGGTCCTTGTTTTTCCAACTATGTATCTGTGTCAGGTCAGGTTTT 
CTTCATGTACCT 

Sequence 1295 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGGGATTTGTCTTTT 

ATGAGTTACCAGTTCAGAGACTAGTTCATCTTTGGTTGTATAGGATGTTAGGATCTCAGT 

TGGCATTCACAGTTAAATCATGCACCTTCAGGAACTGGGAGCA7TTTGATCCAGAGTCAC 

AATCATTCCTTTCTTATCTTCATCCCTATGGTATGTGTGTTCTGAAGTTTAACTGACAGA ' 

TGGCAGCTGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCG 

CGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCG 

CGAGGGCTACGAGCGTNTCCTGAAGATGCAAAACCAG 

Sequence 1296 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAAGTTTGCGTTCATCGTCCACTT 

CMTGAGTTTCTGGCAGCCAGTGGCTGGGAAGGAGATGTTCAGCTTCATCTTGAAGCAGC 

TGAACGCCTCCGAGGCGCCACGGAAAAGAGGCCCCGCGTACGCGGGATTGGGATTGGGGG 

GA AAGAGG GAGCAACGGCCCATAGCCTTGGGGTTGGACATCTCTAGTGTAGCTGCCACAT 

TGATTTTTCTATAATCACTTGGGGTTTGTACCTGCCCG 

Sequence 1297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAATCTGCAAGCCAGCTTCACCCCT 

CCCATCGAGTTCTGGTCAGGAGGCTTCTGGTTGAGTTCAAATTGCTATAAAGTTCAGCTG 

GAGCCTTCCTTCTCCCTGTGGCCTTTTCCCAGTAC I lllllll ICTTTTTN l TTTTTTT I 

CCMTACTTGTCTTAGCTTGTTTTTCATTTTCTATCTTGAAACTCT^ 

TTTTCTATCTTGNTTANTATCTTCCAATTTTCTTCTAAGTTTGTACCT 

Sequence 1298 

AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTAC l rTTTTTTTTTTTTTTTTTT 

I i I 1 1 GGAATGGGTTATGAGGAATTGTGGTCAAAAMCGAGAACTTCTTTTTGGTGTTAA 

CAGGGCTATGTCTGCTTTGCTCTAGTGATTCCTAACCTACTGTGCGAAGGAGGCATAGCA 

AGGCAAGACTGTTTAAGTATAACCATAAAGGGACTCACGCACTCTCAGGTGAAGTGCAAA 

AATACTGATCACGGTGCTCAAACATTCTGGAGGAGTAAGCATTCAAATTCAGAAATGTAA 

GACGATTTCTCAMGAAAAGAGGAAAAACCTATCATTAATTTTTTAAMTATACCTTCCA 

GAGCAGGTTTCTCAACAATTTTACATAGAAGACGAGTATTTTTGTCTCCTTAAGATTTGG 

ACTTAAGGTCAAAGTTGTTGAAGGTTGATCGGCTATTGTAAGCTTCAGGTGAAACCTANA 

GAGGAGAAGGGMGAAGATAAATCTACTATCCTAGATTCTTTTT 

Sequence 1299 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTGGG 

ATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCAGA 

TGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCCC 

CTGACCAAAAGTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTGTTCCCTTTTAGTGAGGGTTAATTTGCGCGCTTTGGCCGTAATTCATGGTCATAG 

CCTGTTTCCTGTGTGAAAATTGTTATCCGCTCACAAATTTNCCACACAACATACCGAGCC 

GGGGAAGCCATTAAAAAGGTGTAAAAGCCCTTGGGGGTGGC 
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Sequence 1 300 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATCCATGAATACATTTCC 

AACAAGAGCACTGGCCAAGTCAGCTTCTTCTGAGAGAGTCtCTAGAAGACATGATGCTAC 

ACTCAGCTTTGGGTCTCTGCCTCTTACTCGTCACAGTTTC7TCCAACCTTGCCATTGCAA 

TAAAAAAGGGAAAAAGAGGCCTNCTGAGACACTCTCAAGAGGATGGGGAGATGACATCAC 

TTGGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnGCCGCCGCTTGGCCGTAATCATGGTCATAAGCTGTTTCCT 

GTGTGGAAATTGTTTATCCGCTCACAATTTCCAC 

Sequence 1301 

CCGGGCAGGTACAGATCTCTGCAGTTACAGGAACATCGTCTGCTTCATGGTAAATGCAAT 
TTCCTGTTCAGAAACCACACTAACATTACTTCTTTCAGTATrTCTGCCTCAMGTTGGTT 
TAAGATTATC ATC ATTATAMG AAAMTTTTGTNCTANTTGCTGGN CCAAAGGG AAAA . 
Sequence 1302 

AACTCCTATAGGGCGTTTTGGAGCNCCACCCGCGGTGGCGGCCGCNCTAGAANTAGCGNA 

NNCGGCNTTCNGNAGGANTANTTTATCAAGCTNATCNATACCGTNGACNCCGANGGGGNG 

CANNGGAACCCANACTTTGAGTAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNANTAAN 

CATGGNCATAGCTGGGACCNGTAGTGAAAATTGTTATTCCGCTGCACAATTCCACACAAC 

CATACCGAGC 

Sequence 1303 

AGGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCT 

GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 

CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAAGTAACCTTCCCNGGGCGGGCCCGCTT 

CTTAAGAAACNTAGGTTGGGATCCCCCCCNGGGNCTTGCCATGGGAAATTTCCGAATTAT 

TCAAAAGGCCTTTATTCCGAATTAACCCCGGTTCTGAACCCCTACNGAAGNGGGGGGGNG 

GGCCTCCCGNGTTTACCCCTCAANGCCNI I I I I I 1 1 NGTTTTNCCCCC I II I I I AAGGTC 

GGNAAGGGGGGTTTTAAAATTTTNGGCCGGCCCGGCCTTTTGGGGTCNGGTTANAATTC^ 

AATTGGGGNTACCMTTAAAGGCCTTGGGTTTATTCNCCTTGGGTGGNTNGGAAAAAAAT 

TTGGGGTTTTMTTCCNCCGGNCTTTCNAACCAAANTTTTCCCCAACCCAACCAAAACCC 

AATTANCCCGGANGGCCTCCGGGGGGGTAAGGCCAATTAAAANAAGGTTGGTTAAAAAAG 

NCCCCTTNGGGGGGGGGNTTGNCCCCCTTAAAANTGGAAAGGTTGGGAAGCCCTTTAANA 

ACCTTCCAACCAATTTTAAA 

Sequence 1304 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGCCAAGGTCCT 
CTGTGACTCGCCGCCCACTACCCAAGTGAATGAGTCTCCCCTAGAGCTTTGCTACTCAGA 
GGGGTCTGAGGACAACAGCATGGGCCAACACGTGCACTCGAGCTGCCTGGAGATCTTGTT 
CAAAGGCAGATTCTGAATGAGTAGGTCTGGGTTGGAGCCTGAGAGTCTGTACCT 
Sequence 1305 

ATTGGTCTCCCCGCGGTGGCGGCCGAGGTACCATGCAGAAACCTGGCCAGGCTCCCAGGC 

TCCTCAl i 1 1 iGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCTGCAAGTT 

TACTACTGTCAGCAGTATAATAACTGGCCCCCGTACCTGCCCGnn nnnn nnn nn nnn nnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnGTCATAGCTGTTTCCTGTGTTGAAATTGTTATTCCGCTCACAATTCCANCA 

CAAACATACGAGCCCGGNTAGCATAAAGTGGTAAAAGCCT 

Sequence 1306 

CCGGGCAGGTAC Ml I I M I M I I I M I I I M M I H > NGCTATAAAATTGGTATTTGTA 
TCTTCCTCCCATTCTGCGCTCCCCCAACCCTGCAGGAAGGTGGATTCTTCAAGGGAACTG 
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GAGGAGACTCTTCTGTTACCACCAAGCCAGGAAGTTTCATGGCTTGGGAAGCCCCGCGTA 
CCTN 

Sequence 1307 

AGGTACGCGGGAACACATTTCTTTGGGATTTTGCCCTTCCTGGGGTATAGGGGATCAGAA 

ATATTGATCCATGTGCACGCAGATAAMTGGCTTCTGCTAAACAGACTAAAATCTTTCTC 

TCTAGTC7TTCTCACTTGTACCTGCCCGGGCGGCCGCTCGACCGTCTTTCCCTTTCGCCC 

TCCACCTCCTTCCTGAGACCAGCTGTGTTTCAGCTCCCCCTTCCCTGGGCCCACACTGAA 

TCTCTGTTACCCTTACAAACCCCTGCTGCCTCTAAGAGTAAGGACCCCTATCCCATCCCA 

AACTCAAACCTAAGCACATGCAGCCTCATCCCATTTCTCCAAAACATGTGAAAGCAAAAG 

AAAATAAAAAACTMCTCAGCAATCTTGTTCTTTCCTACAGTGGNCrrTTGGCTAC 

Sequence 1308 

GATCATATAGGGCGAAATTGGANCTCCACCGCGGTGGCGGGCCGCCCGGGCAGGTACGCA 

GGGGGCCTCTGTAAGGCACATGGAGGTGATGATGTTTCTTAGAGAGAAGATCACTTGAAG 

AAACTTCTGCTTTAATGACTTTACGAAGCTGGCAATATTACAATCCTTGACCTCAGTTGA 

AAAGCAGTCATCTTCAGCCGTTTTTCCAGCACCTATAAGCCACCCCCCAAGTGTGGTNTA 

TTGCCCTCCCTCGGATTGCCTCCNGTTAGCTTCGGGCNCGCTTCTAAGAAAACCTAGGTT 

GGGAATCCCCCCCCGGGGCCTTGCCAGGGAAATTTCGGAATNATCAAANGGCCTTAATTT 

NGAA TACCCGG T TCGA ACNCCTTCNGGAGNGGGGGGGGGGGCCCCCGGGTTAACCCCCCA 

ACCCi i i i iTTG GTTTTCCCNCNTNTTTAAGTTGGAAGGGGGGTTTTAAAATTTTGGNCC 

NNCGGCCTTTTGGGGCCGTTTAAAAATNCCAATTGGGGTTTCCMTTAAGNNCNTTGNTT 

ATNTNCCCCTTGGTTGGATTGGAAAAAATTTTGGTNTAATTTCNCCGGCCTTCCAACC 

Sequence 1309 

GCGGNGGCGGCCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCA 

TCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCAGTTCTGGAAGATTTCTTGCAGTTTT 

ACTTACTGGTCANGCCAGTATAAATAAAACTGGGCCCCCCGGTACCTTCGGCCCGCTTCT 

AANAMCTTAGGTTGGGAATTCNCCCCCNGTGGNCTTGCCAAGGGAAATTTTCGGAATAA 

TTCAAAAGCCTTTAATTCGNAATTACCNCGGTANCCGAACCCNTTNCGNAAGGGGGGGGG 

GGGGGCCCCCCGGGGTTAACCCCCCAANCCTTTTTTTN^ 

TTGGAAGGGGGGTTTAAAATTTTTGNCNGGCCCGCCCTT^ 

AANTTGGGGTTTCCMNTTAANGCCTTGGTTTTTNCCCCNTTGGTTGGTNGGNAA 

NTTGGTTTNAATTCCCCGGCCTTTCCAACCAAAATTTTTCNCACCCAACCCA 

Sequence 1310 

TGCCAGTGGCCCTGTTGGATGCATCATAGATGAGGAGCCTGGGAGCCTGGCCAGGGTTTC 
TTGTTGGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnTAGTGAGGGTTAATTGGCGCCGCTTGGCCGTAAATCATTGGTCATTAAGCTGTTTTCC 

TGTGTGGAAAATTGGTTTATCCCGCTTCACAATTTCCACACAACAATACCGAAGCCGGGG 
GAGCAT 

Sequence 1311 

CCGGGCAGGTACAMCTTATGAAGAAGGTCTCTTTTATGCTCAAAAAAGGAGACAAGGTC 

TCAGGAAAGGAGTGGGTAAAACACTTATCTCCAGAAAGCCAGCAAGAGGGAAGAAGGAAG 

AAAACCCCATCCTGCATGGTATGATGCTGAGATTCCTCACATATCTTCAAAGCATCCTAA 

GAAAAATTGAAGGACCCATGTTTTCAGATCAAGATGCTCCAGCTAGTAACCTGAACAATG 

GAGAAAAAGGAACTCACAGCAGAACATCTACCTGGAAAATGCAACCACTTTGGGAAAAGG 

GGAAA TACT AAAAGTTTCAGAGAACAGAAGTGTGTCACATACAAAAGACCAACAATGAGA 

ATGGCTTTTAACTTTTCAATATCAGCTCTAGATGTTTAAAGACAATAA 

Sequence 1312 

AGGTACGCGGGGACACTTCCGGGCGCGGCGCCGGCTTGCTGCCACTGCAGAGCCCCGCCA 
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TGGAAGACACGCCGTTGGTGATATCGAAGCAGAAGACGGAGGTGGTGTGCGGGGTCCCCA 

CCCAGGTGGTGTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGGCTT 

GCAGGAATTCGATATCAAAGCTTATCGATACnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGGTTCATA 

AGCTGTTTTCCCTGTGTGAAAATTGTTATCCCGCTCACAATTTCCACACAAACATTACCG 

AGCCCGGGGAGCCATTAAAAGTTGTAAAAGCCCTGGGGGTTGCCCTAATGGAGTTGAGCT 

AACTCACATTTAATT 

Sequence 1313 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGGTTTC 

TGCTGGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTGGCGTTAATCATTGGTCATAGCTGGTTT 

CCTGTGTGAAAATTTGTTTATTCCGCTCACAATTTTCCCACACCAACATACCGAAGC 

Sequence 1314 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATACT 

GCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCAC 

CATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCGGGCGGCnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAAGGGGTTAATTG 

CCGCGCTTGGCGTAATCATGGGTCATAGNCTTGTTTCCTGTGTTGAAAATTGTTTATCCG 

C NTCAC AAATTCC ACAC AAACATACCG AAG 

Sequence 1315 

TACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGATGTGGAGAG 

GACCATGTGAAGAGAGAAGCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGC 

CAAGGAACACCATCTGAAGTTCTGACGGCAACATCAGAAGCTAAGAGAAAGGCATGGAAA 

AGATTCTCACCTAGAGCATNCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGAC 

CTCTGACCTCCGCAACTGTGAGGGAAGAMTTTCTGTTGCTTAAAGACACACAGCTTGTG 

GTACCTGCCCGGGCCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTTGCAGGGAATTT 

CNATATCAAAGCCTTATCCGATACCCGTCGACCTCGAAGGGGGGGGGGCCCCCGGGTACC 

CAAGCTTTTGTTCCCTTTT 

Sequence 1316 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGATGTGGAGAGGACCATGTGAAGAGA 

GAAGCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGCCAAGGAACACCATCT 

GAAGTTCTGATGGCAACATCAGAAGCTAAGAGAAAGGCATGGAAAAGATTCTCACCTAGA 

GCATCCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGACCTCTGACCTCCGCAA 

CTGTGAGGGAAGAMTTTCTGTTGCTTAAAGACACACAGCTTGTGGTACCTnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTN 

CGACCTCGAGGGGGGGGGCCCCGGTTACCCAGCNTTTTGTTCCCTTTAGTGAGGGGTTAA 

TTGCG 

Sequence 1317 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGTAATTCATNGGTCATAGCCTGTTTCCTG 

TGTNGAAATTGTTATTCCGCTCACAATTCCAC 

Sequence 1318 

CCGCGGTGGCGGCCGAGGTACGCGGGATTGATGATGGGAGGCAGTGAGTCTTGATGATAA 
GGGTGAGAAACTGAAATCCCAAACACTGTTTTGTTGCTTGTTTTATTATGACCTCAGATT 
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AAATTGGGAAATATTGGCCCTTTTGAATAATTGTCCCAMTATTACATTCAAATAAAAGT 

GCAATGGGAGAAAAAAAAAAAAAAAAAAAAGGTACCTGCCCGGGCGGCCGCTCGATGC7T 

GGTGCTGMGTATAATTAATATTTGAGTGTTTGCTATGTGCCATAAACTTTTCTAAATGC 

CCTTTACATACATTAGTTATCAAAACAACCCTNTCCTGGGTAGGGGCATTTATCCCCATT 

CTTCAAGATGAGGGAMCTGAGGCCACCAAAGAMTGGTTTCCAAGGTTTGGTACCCTCG 

GCCG 

Sequence 1319 

CCGGGCAGGTACCATGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATC 

CACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAA 

TAACTGGCCCCCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnAGTGAGGGTTAATTGCGCCGCTTGGCGTAATCATGGGTCATAGCCTGTTTTCC 

TGTGTGGAAAATTGTTATCCCGCTCACAATTTCCACACAACATACCGAGCCGGGGAGCA 

Sequence 1320 

AGGTAC I 1 1 I 1 I 1 1 I I I I I 1 1 I I I 1 I I I 1 1 1-1 I I 1 ! GTTTCTTGGTTTCTGAGTCCAGTC 
CCCTTTGCATAAACGCCCGGCGTGGCTCTCTCAGCGTCCCCGCACCTTCATGCNCAGCAN 
ATTCTTGGACAACTTTGTCTTTCTCT 
Sequence 1321 

AGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGG 

GCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAATAACTGG 

GCCTGTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnNGGTTAATTGCCGCGCTTGGGCCGTAATCATTGGGTCATAGCNTGTT 

TTCCTGGTGGTGAAMTTTGTTATCCGCTCACAAATTCCACACNAACATTACCGAAG 

Sequence 1322 

AGGTACAGATAATGAGGGTTGTAAAATGAAAGAGAGAGGAGAGGTGAGACGAAATGAAGA 

CAGCTAATTTTTCATTCGACAAAATCAGGGTTCAGAGACATTTGTAGCTACACTGCCCAA 

TGCAGCTACTATTAGCCACATGTGGCTACTTAAATTTAAATTAATTAAAATTAAGTACCT 

GCCCG 

Sequence 1323 

CCGCGGTGGCGGCCGAGGTACCAAGTCAGGTTGTTCATTTGAGCCAACAACAGATTTCTT 
GGTTATTGTGCTATTGCCACAGTAGGTCCTATTTGCATTGAATTTGTGTGTTTTTAGCTT 
CTCTCCTTTTACGTGTTGCTTTATGCTACTCTCCAA 
AAGTTGTATAATTTTTCTTTGCTCTGTTTTGTTTTG 

TAfAAGCCATTTCCATCATCAGTTTCCTGTCTCTGCTGGGGGTTATCTTAGTGCCTCTCA 

TGMTCGGGTGTTTTTCAMTTTCTCCTGAGTTTCCTTGTGGCACTGGCCCGTTTGGGAC 

TTTGGAGTGGTGGATGCTTTTTTACACCCTTTCTTCCACATTTCTCATGCCAAGGTCACC 

CACCAT 

Sequence 1324 

ATAAAGATGATGTCTTTTTGTTATTGNCTGTCTGTTTNGTATGTGTCTGAGATAAGGGAT 

AGAGAGGAAACATCCGTCAGGCTAATTTAACTACATTITATTTTAAAAATAGAGAAACAT 

AACCTCTAGATGGGACAGCAGAGGACAGTTAGTANAGGCCACAAACTGTTATGGGCTGCT. 

GTGTTTTGTTCTAAAATCAATATGGTTGGAGCATGTATATCTTANGTGATCATTTCACAT 

CTTAGGAATGCCTACTCATTTTATTTTATTCTAGTGATNGCTCAATTCACTAm 

ATTATATTTTCTCTTCTGGGGCACCTTATACAAAATATCTCTTCACCTACTTAGTTCTAC 

AGGGTTTT 

Sequence 1325 
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CCGGGCAGGTACCTTCTTGGCAGAGCCAAAGTCACCCGTTCTGCTTTTGAGGGAAGGAAT 
TATGAGAAATTCCAACTTCTGCCTTCAAGCAGGAACTTTCCCAGCCATTTCCTTGAAAGG 
TTGTTTACAAGGTCACAGGCAGCCACAAACATGATAAAAATTCATTCCTGTACCT 
Sequence 1326 

CCGGGCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTC 

AAATTTCCTGAATTGCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTTACT 

GAAGAATGGAGAGAGMTTGAAAAAGTGGAGCATTCAGACTTTGTTCTTTCGGCAAAGGA 

CNTGGGTCTTTCTATCCTCTAGTACCCTTNGGACCGCCTCTTAGAACCTANGTTGNGNAT 

TCCCCCCCGGNGGCTTGGCAANGGAMTTTCTTATTAATTCAAAGNCTTTTATTCNGCAT 

TAACCCGNTCCNGAACCCCTTCTGNANGNGGGGGGGNGGGGGGCCCCCCCGGGGTTANNC 

NCNCCAAGGNCC I II I I I I I I GGTTTTTTCCCCCCTTNTNTTAANGGTTGGNAAGGGGGG 

GNTTTTAANAAATTTTNGGCCCGNCCCGGCC^ 

TTTGGGGTTCCCNATTTNAAGGNCATTGGGTTNTnTNCCCNTNGGGTGGNTTCGGNAAA 

AAANTTTTGGTTNTAATTCCCCGGTCTTTCCAACCAAAA 

Sequence 1327 

CGCGGTGGCGGCCGAGGTACTGNAAATCGTCAAGGGTGAAAGGTCGATCGCNTGCCAGCT 

TGCTTCCACTNGATTCTCGGATTTTTTTACCGGTANTGATGTTGTAACATTCGTATCCAA 

GGCTCCGCAAAGTTNGNCACNTCGGTATGTGANAAGGCTNTCCTCATTCCCACNTATATT 

GGNGTTACCCTTGGCCCNCGGGTCCNAGGCCCNCCTTAATTAANAAACNTTANNTTTGNT 

A7TCNCCANCCGGGNCNTGNCMNNGTAAATTTTTCGTAATTAATACNANANGNCTTTTA 

ATTCCGNNATTANCCCGGTTCTNTAACCCCCTANGATAGGGGTGGNGNGTGNGNCCACAC 

CGGNGNTTANCACCCNAANNATCCTTTTTTNTGNATATTTNACCCNTNTTTNNATGNTTG 

GNAAGGGGNGGTAATTAAAAANTATNGACNCNTCCNTCCNTATTGGGAACNGAATAAAAA 

NTTCAAATTGNGGGATTCCMNTTNAGTCCATTTGNCTATTNTTCCCCCTTGGTTGGANT 

TGNTAAAAAAAA 

Sequence 1328 

GGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACTTACA 

GAGAGTTCCAAGAAACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACGAGAATATAT 

AAGAGCTTTAAATGCTTNCCAAAACTGGAACGAGTATNTTGCAAMCCATTCCTTGCTTC 

GNTGGATGGTCACCGTGATGGAGTCAATTGCTNGGCAAAGCATCCAGAGAAGCTGGCTAC 

TGTCCTTTNTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAATTG 

TATCCGTACCT 

Sequence 1329 

CCGCGGTGGCGGCCGCCCGGGCAGGTAQGCGGGGGAGCAGAAGCAGAAAAGGGTTGTTGT 

TACCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGA 

TGGCTCCCCTTCTACTGAATTAGTTTCTGGAGGATCTGATAATCAAGTGATTCATTGGGA 

AATAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTA 

TGCGGTGCATGCTGTTTACCAGAGGAGGACATCAGATCCTGCATTATGTACCT 

Sequence 1330 

AGGTACAGATACATGGACACAATCATGGCAGCCAGCTCGAGGCCCCCAATTCCAGCTGCC 
ACACCACCCACGGTGACTNGCATTAGTTCGGATGTCATACAAAAGCNTGATNTGAAGCAA 
CACCTCTACGTTTTTGGTCGTAGAGCTC I I I I I GCTTTGNGNTGCAAGGATTTTCATTTT 
GGGGCTTGTTGNTTrGGGATTGNACNGNTTTGTTCATTTTGNNMNACCATNAAATTG 
TGTGTGTAAAN AANGG NCCAACCTTG ATTTTC ATTCTTNTTTGG N ATAGTAT ATGGGN G N 
TTGTAAGGTTTCNCCTTTCAAAAAAMNTTCCCCGNTTTANTTAGNATT^ 
AAANGACTCCAACCATGTNCAAACCTTTTGGAAGGCCCCCCTTTT 
Sequence 1331 

AGGTACGTGGGCCTGTCTGCAAACCAGTGTGCCGTGCCAGCCAAGGACAGGGTGGACTGC 
GGCTACCCCCATGTCACCCCCAAGGAGTGCAACAACCGGGGCTGCTGCTTTGACTCCAGG 
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ATCCCTGGAGTGCC7TGGTGTTTCAAGCCCCTGCAANGAAAGCANGAATGCACCTTTCTT 

GANGGCCACNCTCCAGCTGACCCCCGGCACTGNTTNTAGTAACGTAGCTGGGTAATCTCC 

CCGGGTGCCTGGCNAAGGGAAATTTTCCGNATANTTCAAAAGGCNTTTATNCCNNNTTAA 

CNCGGNTNNGTACNCCTTCNGTAAGNGNGGGGNGGGGCCNCTCNGGGTTTACCCCCAAAT 

CTTTTNTATGGNTTTTCCNCNTTTTNAAGCTTGGAAGGGGGGTTATAAMTTCTGGACC^ 

TCCGGCCTTNTNGGGCCNGGTTAACATTTCTAATTGNGTNTCCAATTAAGGGTCCNTGGN 

TTTTATTCTCCTTGGATTGGTTGGA 

Sequence 1332 

TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGTGAGGCATTTCCTGATTTT 

TACTCATAATATTTACAMGATTAAAAGAGATAAAATTCCAAATGTGMGGCTTAAATTG 

TTCAAATCCTTACTCAAAGAAATAGTGGGAAAGGTGGGTAAGTCAGAGTCTGCTGCCGAT 

GAAGGAAAAAAATCAAGTTAATCTCTTCTGAAATTATCATGATTTGTTTGGTTTATNAT^ 

CCCAAAAAAAAAAAAAAAAAAAAAAAAGTCCCT 

Sequence 1333 

AGGTACTAAAGCATTCATGGAGGCTCTTCAGGCTGGTGCAGACATCTCCATGATTGGGCA 

GTTTGGTGTTGGCTTTTATTCTGCCTACTTGGTGGCAGANAAAGTGGTTGTGATCACAAA 

GC AC AACG ATGATGAACAGTATGCTTGGG AGTN ATTCTG NTGGAGGTTTCCTTTC N ACTG 

TGGCGTGCTGACCCATTGGGTGGAGNCCCATTTGGNACAGNGGGCTACCCTTGCTCCGAG 

GNCNGGCANGCTTGCTACTAGNAANCTTAGATGNGATCCCCCCGNGGNCTGGCNAGGGAA 

CTTNCGTATTATCNAAGGCCTTATTCCGGATANCCCGCTCCGTACNCCTANGGATAGGGG 

G TGGG CNCCCCGGTTACCACCAANCATTTTNTGGTTTCNCCCTTTAAAGGATGNANGGGG 

CTTTTAAAATATNGCTGCCGGCCTTTGGGCCGGTTAAAATTCCAMTTGGGGTTCAANT^ 

AGTCCTTGGATNTTNCCCCTTGGTCGNTTNGAAAAA 

Sequence 1334 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGGTCCTGGTGGGATGGTGAACCGTGACGA 

GTACGCCATCAGGGTAAAGAACTCGTCCAGACTGTCAACTGTGATCCTGGACTCGCTGTA 

GGTGAGTAGCGAGTTCTTAGATCCTAAGAGACTGATGCATACATGGGGAAAAACAAATAT 

AAAACCTGGCAGTTGTACCT 

Sequence 1335 

CGGGCCCGCCCGGGCAGGTACATTTCCTTGTAGACTCTGNTAATTTCCTGCAGCTCCTGG 

TTGGTTCTGGAGCAGATGATCTCAATGGAGAGAGTCCTCGTCGGTTCCCAGCCCCTNTCA 

TTGCGAAAGCTTTTTANCNTCANAAAGCCGTTCATTACCTGAGGCCAGGGTGGTCTTTCA 

AATAAGGGCCCCAAACAATTCAACCGGTNCTTCCAAGGGGTGGGGCCMGNATAAANGGG 

CNTGGANCTTNTCAAGCTGGCTTGNATTGNCAAAGNTNTCCCl I I I II i INGGGTGCCCT 

TACTACTNGGTAAGGGGCGNAAAAGGGCNAAATTAATNCCCTNGTTNCTTCTTGNTCGNC 

AANANTGNCNTNGCCGNGGTATTGGNNTCCANNAAAATTGGTNTGGAACCAAAATGGGGT 

NGGAACCCTTTCAATTCTCCMCCAACNCTTTTTTGGGNTACN7TAGGCAATNGNGGCCT 

TGNNTTTCCCAAAATTGGTTTACCACANAAGGCCAAATTCNCCCGNCCTTTCAAGANCAA 

TTTCNAAAACCGGTTTNAAGGTNAATTAAGGGGNNCTTTTTTGGGANCANGGGAACCCCC 

CNATTAATTTGNCCAACCTTTTGGGGGGGG 

Sequence 1336 

GTGGGCGGGCCNGCCCGGGCAGGTACAGCTGAAGCCTCGTAACCCAGACTACAGAAGCAA 

TAAACCCATTATTCTCAGAGCTCTCTGTGACTACAAACAAGATCAGAAAATCGTGCATAA 

GGGGGATGAGTGTATCCTGAAGGGACAACAAACGAGCCGCAGCAAGTGGTACCCTCGGCC 

CGCTTCTAAGAAACTAGGTGGGATCCCCCCGGGGCCTGCNAGGNMTTTCCGAATATTCA 

NAGCCTTATACGAATACCCGTTCGNACCTTCTGAAGGGGGGGGGGCGCCGGNTACCCCAA 

GAN fTTTTI IGTTTNCCCCTTTTNAGATGAAGGGGGTNTAAAAATNTGNCGCCGCCNTTT 

GGGCCGTTAMTTCAATGGGGTTCCATTAANGCCTGGTATTTTNNCTTGGTTGN7TGGAA 

AAAATTTGGGTNTAATTCCCCGGCCTTCCAACCAAAATTTTNTCCACCAANCAAAACCAA 
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TTANCNGNAAGGCCCCGNGGGGAAGGCCAATTAAAAAAAGGTTGGTTAAAAAAGGCCCCT 
TGGGGGGGGGTTGGCCCCCTTAAAATTGGMGGGTTGGMGGNCCTTAAAACCCTTCCAN 
CAATTTTA 
Sequence 1337 

CATATAGGGCGAATTGGAGCTCCAACCNGNGGGGGCGGGCCGCCCGGGCAGGTACCAACA 
GAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGATGCATCCAACAGGGCCACTGGCAn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
AGTAGCCTGAAGATTTTGCAGTTTTATTACTTGTTC 

CTCGGCNCGCTTCTAGNAACATAAGTTGGGAATCCCCCCCGGGGCCTGCANGGAAATTCC 

GCATANTCAAAGCCTTAATCCGGAATAACCCGGTCCGGACCCTTCNNTAGGGGGGGGGGG 

GCNCCCGGGTTACCCCNCMGACTTTNTTTGTTTTTCCNCTTTTTM 

TTTTAAATNrTGGNCGGCCNCCCTTNGGGCCGGGTAAMmCMTTGGGGTNCAATTAA 

GCCTTGGNTTTTNCCCCTGGTNGGNGGNAAAAMNTTGGTTTAATTNCCCGGCCTTCAAC 

CAAAATTTTCTCAACCCANCCAMNCCATTTAACTNNAGGCCCCGNGGGNAAGGCCATTA 

AAAAAG NTTGGTT AAAAAAAGCC N 

Sequence 1338 

AGGTACAGCTGGAGCAGAGTTGCACTTCACTGTGTAGAATGGCTTTATCTGAGCCACTAC 

ATTCTGCC ATTG ACTGCGTTN CTTC NC AATCTTTCCCAAGATCTCCC AC AAAAAAATGCA 

I I I I ICCCTGTCAGGGTNATGTTCATNAAACATACNCTTNTCAATAACANnTCCCAAAG 

GGTTTTGNTTTCC7TCAATNCCMCAAGGGNTCANACCGGNAGTTAAGNTTTTNGCCA7T 

CCATN C AAATAAAAANTCCCCTTTTC ATTCTTCCACCCACCTTATNTTCTCAACAN ANAN 

ACCCGGAACAGTTTANTTANAAACCCNAGGGNAAAANGAACCAAAGGAGGAACGNGGCCC 

ACCCAAAGCCTAATGGTCTTGGGGGTCCCTTTANGNAAATTANTGCCAAACAAANAAATG 

GTACCANAGTTTGGGGGNAGGAATGNGGAACCCCCCCCTTATCNATTCNAGCCTNGCGTT 

TNCCCCAAAGGGGTGGGCTTCAAACC 

Sequence 1339 

TCCTATAGGGCGAATTGGGAGCTCCACCGNGGTGGCGGGCCGAGGTACTATGGGAACAGG 

TTCAAAATGAATATAAGGGAGTTATTTATATATTAGGTAAAGCACAAAAATGATAATTA^ 

TGGACTGCACACTCACAGGTTAAAGGTCATTTGGGNGCAGNAAAAATGTCCGGACCCGTT 

TGGGGAAACAAGAACCATTAAACCTGCTTGTAAAAAGGAAGCCCAAGGGGGGAGGGGGTA 

MCCCAAAAAAGCCGTTATTTCAAGAAATANGCTTGGTGGGAAAAGCTTGGGGGGGGAGG 

ACCACCCTTGNAAAAAAAATCCTAAGCCANATTAAGGCCGTTGGAAAAAMTTCAACCTT 

TAAAAGNTTAAAAAAAAAATTAAAAAACCTTTTTCAAGGAGGGGGCCTGGAAAGGCCTTA 

TTTTTTGAAAATTTTAACCCANACGNAA 

AMGGAAGGGGAATTTAAGGTAAAAAAAATTTTCNTTTCCCATTGGCCNNAAAAMTN 
CAAACCAAGGTTAAATTGGGGCNTCCCCAATTCCAACCGG 

CCCCCAAMNGCCCCCGAAGGAGGTTNCACCNTTATTGGTTAAAGGTTGGAGGAAGGTTT 
TGGAAGGTTGCCANAATTTTTGGTTAMTTAMGGGAAAATTAAAAAAAMCCM 
Sequence 1340 

CCGCGGTGGCAGCGGCCGCCCGGGCAGGTACCATCACCCCTTCATGCTGGCCCTAAGCTT 

TCTCCAGCAGTCCCACTTCCTGATATTCGTTCTCTTCAGCAGCCTAAAATACAGCTTTCT 

TCTGTCCCCAAAGTAAGCCGCTGTGCTCATTGCCCTAATGAACCCTCCACTTCGCCAATG 

CGTTTTGGTGGTGGTGGTGGCGGTAGCGGAGGNACCTCGGCCGCTCTAGAACTAAANGGA 

TCCCCCGGGCTGCAGAGAATTCGATATC 

Sequence 1341 

CGCGGTGGCGGCCGAGGTACCAACATGTCCCGTGGTTCCAGCGCCGGTTTTGACNCGCCA 
CATTACCATTrTTTCACCCGAGGGTCGGCTCTACCMGTAGAATATGCTTTTAAGGCTAT 
TAACCAGGGGTTGGCCCTTACATCAGTAAGCNTGTCAGGAGGGNAANAGGACCTGTTGNC 
ANNTANATNTGTTCACCACCAGNAAAGAAAAAGTTACCCTNGCTCCNGGNGCGGGCCCGC 
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TTCTTAN GAAAN CTT ATGN TGGGATTCNCCNCTCGG NGCCTTGCCAAGGGAAATTTTCTG 

ATTATTTCNNANGNCTTTTANTTCNGGAATTANCCCGNTCGGATCCCTNCGGAGGGGGGG 

GGGGGGCNTCCCGGG GTMCC CCCCAAGGCTTTTTTTGGTATTCCCCCTTTTTAAGATTG 

GAAGGGGGGTTTTAAAATTTTTGGCCTGGCNGGCCCTNTTGGGGCCGGTTAAMTTNCAA 

TTGGGGTTNCAATTMGGNCTTGGTTTTTNCCCCTTGGTGGGTTGGNNAAAAAATATTGN 

GTTTAATTTCCCCGCCTTTCAACCAAAATTT 

Sequence 1342 

TCACTATAGGGCGAAATTGGAGCTCCACNCGCGGTGGCGGCCCGACATCCGCCGCCCGAG 

GGATCGTTGGCAG CCCG GGTGGGAGTGTGGCTGTGTTCTCGTCCCTGGCTGACAACGAAG 

GGGAGCTAGTCACTTTTCCTGCCACAGACGACTCGCNGTCCGCCTCTCGCCTGGAGTACC 

TCGGCCGCTCTAGAAACTANGTGGATCCCCCGGGCCTGNAGGGAATTCGATATCAAAGNC 

TTATCCGAATACNCGTCCGNANCCTCTGAAGNGGGGGGGGCCCCGGNTACCCCAGNCTTT 

NTTNGTATCCCCTTTTTMGTTGAGGGGGGTTTAAATTTTGGCNGCCGCCNTTTG 

TAATATTCAATTGGGGTTCCANTAAGCCTTGATNTTTCCCCTGTTGGTGAAAAAATTTN 

TTTAATNCCCGGNNTTCAACAMTTTGCCCACCAACCAAACCAATAACCGTAAAGCCCCG 

GGGGAAGGNCCAATTAAAMGNTNGTTAAAAAAGNCCCTTGGGGGGGTTGGCCCCTTNAA 

ATGGNAAGTTTGGAAGNCCTTAAATCCTTCCAACCMNTTTTAAAA 

Sequence 1343 

CCAATGAAAATTCCTGTTGCCATTCCAAGAATACCGCCAGCATTGGCCTTGACAATGCCA 

TTATAAAAAGGGACAAGCCCTGCNTTAAACCGGTTCATTGCCCAAGCCCTTTTAAGTTAA 

GAAACAAGGCAAAAAGGGTNCACCCCTNAATTTTCAATTGGAGGGGCCAAACCTTCCATT 

GGACCAAGGAAACCACCAAGGCCAAAACCAAGNAAAAGGTTAACCCCCTTTGGGGGNNCC 

GGCTTTCTTTAGAAAAMCTTMNGTTGGGGGAAATTCCCCCCCCCCGGGGGGGCCTTTG 

CNAAGGGGAAAAATTTTCCCGGAATTAATNTCNAAMGGCC^ 

ACCCCCGGGTCCGGNAACCCNTTCNGGAAGGGGGGGGGGGGGGGGGCCCCCCCGGGGTNN 
NCCCCCAAAAGCCN I II II I I I GGTTTTNCCCCCTTTTTTNANGTTGGAAGGGGGGTT 
Sequence 1344 

CGAGGTACGCGGGAGGAACTGCTCAGTTAGGACCCAGAGGGAACCATGGAAGCCCCAGCT 

CAGCTTCTCTTCCTCCTGCTGCTCTGGCTCCCAGATGCCACCGGAGACATnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nGTCAGAGTGTTAGCAGCTACTTAGCCCTGGTACCTTGCCCGGGCCGGCCCGCTCTTAGA 

AACNTAGTTGGGATCCCCCGGGGCCTGCCAGGNAATTCGATTATTCAMGNCTTTATCGG 

AT TACCG TTCGACCCTCGNAGGGGGGGGGCCCCGGGTACCCCAAGACGTTTTTTGTTTCC 

CCTTTTNAGGTGGAGGGGGGTTAAAATTTGCTGCCGCTTTGGGNCGGTTAMTCAATTG 

GGTCCAATMGCTTGGNTTTTNCCTTGGTGGTTGGAAAAMTTTTGGTTNTAATTCNCCG 

GCTTCCAA 

Sequence 1345 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCATTACTGCAGGAAAAGGTCCCG 

GAGAGCTGAGCAGTCAAGATGTGTGACTTCACCGAAGACCAGACCGCAGAGTTCAAGGAG 

GCCTTCCAGCTGTTTGACCGAACAGGTGATGGCAAGATCCTGTACGTGCCAAGCATCCTC 

GTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGCCGGCCGTTGCACTCATTTACCCGGG 

GACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCATGCATNACGGAGCAT 

GAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTNTAGAGGAAG 

AAGGAGCCGTCGGAGTCCANCACGGGGAGGCNGTGGTCT 

Sequence 1346 

CCGGGCAGGTACCATCACCCCTTCATGCTGGCTCTAAGCTTT7TCCAGCAGTCCCACTTC 
CTGATATTCGTTCTCTTCAGCAGCCTAAAATACAGCTTTCTTCTGTCCCCAAAGTAAGCT 
GCTGTGCTCATTGCCCTAATGAACCCTCCACTTCGCCAATGCGTTTTGGTGGTGGTGGTG 
GCGGTAGCGGAGGTACCT 
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Sequence 1347 

GGAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGC 

CCTCGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGfGGCTCACGCCTTCCAGAGCCACAT 

CATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGGCCTGCAGGTACCAGC 

TGCCAJCTGTCAGTTAAACTTCAGAACACACATACCATAGGGATGAAGATAAGAAAGGAA 

TGATTGTGACTCTGGATCAAAATGCTCCCAGTTCCTGAAGGTGCATGATTTAACTGTGAA 

TGCCAACTGAGATCCTAACACCCTATACAACCAAAAGATGAACTAAGTCTCTGAACTGGT 

AACTCATAAAAGACAAATTCCCACAGTNCCCTTCGGGNCGGTTNTAGNAACTTAGTGGAT 

CCCCCCCGGGG 

Sequence 1348 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGATCTCTGCAGTTACAGGAACATCGTCTGCT 

TCATGGTAAATGCAATTTCCTGTTCAGAAACCACACTAACATTACTTCTTTCAGTATTTC 

TGCCTCAAAGTTGGTTTAAGATTATCATCATTATAAGAAAAATTTGTCTAGTGCTGGTCA 

AGGAAATGGGTCTGTGTTTCTTTGGCATGAGCTTTGGAATGGCTGTATTTGGAGGAGGGG 

TAAGGCAAGTCAGATCCACTACTGCGTATCATGACGCAGATGCACATGCACACAGATGCA 

CATGCTCGAAAACAGGCACCTGGAGATTTTAGNAAMTTCATTTGGGGGMGATTNGCCT 

AACTTTCCTTTCTACTTCAGTGGAATGAATTACCATTTCT 

Sequence 1349 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACACTCAGGGCAGTTTCCAGCTC 

CTCTCACAMCAGTAAATCTACACAACTTTCACAGCCAGGTCAGCAACTGGTTCAAGAAC 

CGCCGGCAGCGCGACAGGAACCCCTCCGAGACCCAGTCCAAAAGGTGAGCGCCAACTTTC 

CTCCTCCCCCGCGTACCT 

Sequence 1350 

CCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCC 
ACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCTGCAGTTTTAACCTACCTGTTCAGCCAG 
TANTACATAACCTGGCGCCCCNGTTACNCTCGGGGCCCGCATCNTAGAAACCTTAGNTGG 

GNATTCTCCCCGCGGGGGNCTTGGCAGGGAAATTTTCGNATTATTCAAMGNCTTNTATT i 

CCGNATTNCNCGGNTCNGTACCCTTCTNAGGGGGGGNGGGGCCCCCCGGNTNACACCCAG 

GCATTTTTTNGTTTTCCCCCTTTNTAANN^ 

GGCCTTTTGGGGGCNGNTTAAAAATTCCAAATGGGGGATTCCAATTAAGNCCCTGGGGAT 
TTTCNCCTTGGNTGGGTTGGAAAAAAANTTNGGTNTTNATTTCCCCGGGCGTTCCAACC 
Sequence 1351 

CCGCNGTGGCGGCCGCCCGGGCAGGTACATCTACATACACATCAGTGCCGTGCCTTCCAT 

TCCCAGGGAGGGCAGGGTCACTAGACAACATCATTAAAATGACCCTTTATCAGAGAACGT 

TTTTCTGTTTAAGGAAATAMCTAGAGGTACAATTTTG 

GGATGTTMCCAGTATCAATAMTCACTTCGTATAATCTCAAAAAAAAAAAAAAAAAAGT 
ACCT 

Sequence 1352 

ACGCGGTGGCGGCCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCT 

CATCTATGGTGCATCCACCAGGGCCACTGGTATCCCAGCCAGGTTCAGTGGCAGNNGGGT 

CTGGGACAGAGTTCACNTCTCACTCATTAAGTCAGCNCTGGCAGNTCTTNAAGNATTACT 

TGCANATTTTACCNACCTTGGTTCAGGCTAGTTANTANATTAAACNTGGGCNCCCCNCTG 

NTACNCTTCTGGGCNCGCTTTCCTTAGNAAACCTTANGTTGGGGATTACTCNCCCCTGNG 

NGNCTTGTCAAGTGGNAAATTTCCTGAAATTANTTCAANAAGNCCNTTTTAATNCCGTAA 

TGANCCGCCGGTTCCGTAACCTCNTTCCGTNAAGCGGGNGGNGGGGNGTCCCCCCTGNGN 

TTAACCCCCCAAAGGNCTTTNTNTTTTGGTNNTTCNACCCTTTTTTAAAGATNGGA 

GGGGTTTTNAAANTTTTGNCCAGCCGGCCNTTrTGGGGCCGGTTAAAAATACCNAATGGG 

GNTTCCAATTMGGCCNTTGNTTTTTTCNCCTTTGGNGGGTTNGGAAAAAAAT 
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Sequence 1353 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCATGGTACCACGGGCCT 

GGGCACTGCATCTTCCCAGAGAAGCAGACTCCACAGGGGCCTCATGACCATCTTTCTCTG 

GAGAGGCAGGCAAACTCCCTCGACAGCCAAAATCCAGAACCTCCCCCCGCGTACCTCGGC 

CGCTCTAGAACTAAGTGGATTCCCCCNGGGCTTGCANGGAAATTTCGATATCAANGCTTA 

TNCGTTACCGTNCGNACCCTCGAAGGGGGGGNCCCGGTACCCAGCCTTTTTGT 
Sequence 1354 

GGCGGCCGAGGTCGCGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAACCATGGAAGC 
CCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGGAGAAATAGT 
GATGACGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCCACCCTCTTCC 
TGCAGNGGCCAGTCAGAAGTGTTGGCACCAACTTANGCCTGGTACCTGCCCG 
Sequence 1355 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGCAACGTGCCA 

TGCTACAGCCCCTGCTTGACAACCAGGTTGGTTTTAAGAACATGCAGAATGTGGAGCATG 

TTCCGCTGTCCTTGGACAGAGCCATGCNGCTGGTGAAAGATGTCTTCATTTCTGCGGCTG 

AGAGAGATGTGTCCTGCCCGGGCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnTCAATAACAAGCTTANNTNTACNATGGGCCTNCATNGGGGGGGGCCGTTCAACCT 

TTCTGTTTGCTGTGTGCTGGCCAGAATGCCTNTTGGGCCGTCCTGTTGTTCTCTTCCCGT 

GTTGTACCTGCCCGGGCGGCA 

Sequence 1356 

GCCTGGGATACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCC 

AGG i i i CTGCTGGTACCTNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnATNAAGCTTATCGATACTCGNCCGACTTTNCCAGGGGGGGAAACCTGTCGTGCCA 

GCTTGNATTAATTGAATCGGCCMCGCCGCGGGGAGAGGCGGTTTTGCGTATTGGGC 
Sequence 1357 

CCGCGGTGGCGGCCGAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGA 

GAAGGATGGGTGATCCCTAAATTACCTTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCT 

CCAGGTGTAAAAAATGCCAGGGAGGGGACAATCATGAGTCCTAGGCAGGAAAGGTCCTCT 

GGAGAGAGGAAATGGCACACGCGCAAGCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGA 

CACTGGGCCGGGACAGATCANACTCCAGGCAAGGAGGGAGATGGCGCCAGACTGTGATCA 

TAAGCTTCACGATCCCCGCGTACCTGCCCGGGCGGCCGNTCT 

Sequence 1358 

CTCATAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC rTTTTTTTTTTTTTTTTTI 

AACCTTTAATCTCTTTGGACTTTATCCTTGGGCTTTTCTGAAAGCACTTTCTGTCCAGCC 

CTTGCTGTCACCCAGCCAAACCTAGCTACCTCACAGGCCCCCAGCAGCTTTCCTGGCTCT 

GCTTCNAGCCCACCGGCCTTNCAACCCTTCCCATAACAATTGGTGCTACCAACCCTTCCN 

MTAACCCCCTGGTCCCCCCMCCCCTCCCCAATTACCTGGGNTCCCTTTCAACCCTTTC 

NCAACNAACNTT NGTTTCTC CNCAACCCCCNCNTTTTTAACCTTNTTTTTGGCCCTN^ 

CCCTTCNTTAACCI t I I I I I I GGCCTTCCCCATNGGGNTGNCGTTTTTTTCCCAATTTTT 

TGGCCCAACCCCTTTCNAAAATTGGNAACCCC^ 

TANGGGG l I I I I GGAAAAAAAGGNTTTTTTNCTTAAGCCTCCNTGGGGGTCTTNTAAAAA 

AANNAATNCANCCTNTTTTTTGTAAAAAANTTT 

Sequence 1359 

CCTCCTAAAGGGAATAANCTTGCGGCCGCTTCATTAATAAATAGACAAACTTCTTATATC 

TCTGCCTTCTGACCATCAAACTGAATAATTTTAATTCAACAGGTCTTATTTTCATTT 

TTGAAATTTTCCTAATTGCCTTCCTTT^ 

NGGAATAACCGCAANAAAMGAAAANGAANTGGGAAAAAMNTAAAAGTTAAAGGGGTAA 
GGGGAAATTANTAAAAAGGAAGGGTTTCCCTTCCCTTGGATTTNCNTAAAGGTTAATTCN 
GNGGMNCCCCAAAAAAACCTTGGGTGGGTTGGTTAATNCCTTTTTTCCAAGGG 
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GGGGGGGGAAGNAAGGGGAATTGGNGGCCAACCNCCCAATTCNTTTCTTCCCCCCTTCNA 

AAATTTGGCCCCMGGGGAACCNCCTTCCCCCCCCTTTNCTTNCTTTGGGAAGGGGAACC 

CNTTAANCTTTCCAACCNCCANTTGGCCCTTTCCCCCTTTCCCCGGGAACCCAANNAAAG 

GGGGCCCCCAATGCNAACCTTTTTCNAANTAAACCNTTGGGNTTNTTGGNAAGNGGGGNG 

GNGGAAAAAGNCCCCCAANGGNCTAAACCCCAACCAAAAAAAGNCCCTTGNCTAAANGGN 

GGAAGGGAAAAANTAAAGGGGGTTTTGGGGGGGAAAAA 

Sequence 1360 

CCTCCTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATCI rTTTTTTTTTTTTTTTI 

TTTTCCTTTNAAAACAGCGTCATACTAAAGTTTATTTTTCTTTACACM 

AGGCAGACMGGAGACATATATTGGAGTTTTCCATTTTTAATATTTGGTTAM 

CATTNTTTTTGNTAAANCAAACCGNCANGCACCAAAGGGCCAAACNAACCAATAAANGGG 

AAATAAAAAANGCCANAGGAAANAAAACCAMTAAAAANGCCCTTAGGGGGGTTTGGCNC 

CCGTTATATTGTGGGNGNCAATTCANGGGGGTTCAAGGGGCCCNTTGNGTTNCAANCTTT 

CCCCTTTAAAAAAAGGGGAAAAGGTNNGGGGGGGCCTTNTTTNTTTNGGCCCCCTTNTGG 

GGNGNCNANTTANAANGNCATTTGGGGGGGTTCCCCTCTTNGGTTTNCCCNTTAAAAGGC 

CAACCCCAAGGAANATTTTGGGGGAATNTATTNGNANACCCANGGNANGTAAAAAACCC 

Sequence 1361 

CTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC I I I I I I i II I I I II I I M I 1 I G 

CAGGTTGGACGAGTTTGTATTAAGGCATTCTGACTCCTTCACTCTCCCCAGCTCCCTCCA 

TGTCAATCCCTTGGCAGAAACACTGTAGACTTATAAATGGATGACAGATGC^GGCTTTTG 

AGAGGGGACCAAAAAGGGAACCACCTTCTTCCCACTTGGGGGGTGTTGGGGGGTCAAGTC 

CATTCCTTGGGGAGGGGGNGCTTGGACCATTTAGAAGGGGGCNAAAAGCTTCAAGGGGNC 

TTTACCAGGCCCACNAGGGCCACCATTATTGCCCCATTCTNTCTTCCCTTGGTGCCAGGG 

CTTCAATTTATTNGGGGCAA I I I I I CCTTGCCAAGNCTTTTCC 

Sequence 1362 

CNGCCCGAGGTAC I I I I I I ICTTAAl ITTTTTTTTTTTTT1 I GAACCAAGCAGGTGTATT 

CTGGAAGAAMTCAGACACCCTCCCTGGTTAAGMTTAGTTAACTATAATGTATTGGTGA 

ATTGAGTTTATAGTCTTTTGGTGGGGGTAAAAGGGCCCTCAAAGTCATAGCAGGTTTAGT 

TGTACCTGCCCG 

Sequence 1363 

ATNGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATTTCCTTTCTGTGGTTTTCTTTTGCT 

TCTTAGAAATTCTGTTAGTGGTTAGTAAAGAATTTGAAAGTACGCGGGGCTTTCTTTCCG 

CGCCGATAGCGCTCACGCAAGCATGGTTANCNTGCCTAAAACCCGCCGGACTTTCTGTAA 

GAAGTGTGGCAAGCACCAACCCCATAAAGTGACACAGTACCTGCCCG 

Sequence 1364 

AGGTACTCAGAGTGAGGATTTTCTGCAACCTGCGTTTGTCCCTCCAGCATCTGCTCTGGC 

TCCATGGCGGACCCTGCAAGTCACAGTCCCCGGATACCAGTCTGGGCGCCGGGAGCACCA 

CGAGCAGTAGAGCAGCAGAGTGGGACAGTCCACGACTGGGCGCCTACATGGGGTCTGGAA 

ACTCTACGACAGTCCCAAGTACCTGCCCG 

Sequence 1365 

AGGTACCTTGGTQTTACTTCCTTTGAAAGATGGACATTTGAAAGTTAAAAATCAGTCACT 
ATATAGGCACTGCTGTATGGAAAACGCATTTTTGTGTTTCTACAAATTGTAAGCGGGAAA 
TGCCTTTTGGAGTACCTCGGCCCGCTCTAGAAACTAGTGGATCCCCCCGGGGCT 
Sequence 1366 

CGNGGTGGCGGCCGCCCGGGCAGGTACTGTTTACTACACACTTAAAAACCCGAGTGTCNA 
GTGCCTTTAGGGGAAGAACACTTCTGCCTGCATGAAAATGTCTTAAAAGTGAAGGAAAAT 
TTGAAGAACGCTNTTGATCAGAAGTCCAGACTGTAAACATAATC 
Sequence 1367 

GGNGGCGGCCGAGGNACGAGGGCCTTGGTTNAGNGGGGGGAAAACCCCAAACCGCAAAAN 
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CNNCAGACNGCAGGCNGCNGNNNGTGAGAGNGAACGGGGNCCCACNCCCANANACACNGA 
ACCNGGCNGGGANACCAGNGGCCCNGGNGGANGCACCANAGANGAGGAGCCNGNGAGCCA 
GGCCTGGTTNCNGCGGGGACCNGCCCG 
Sequence 1368 

ACTATTAGGGCGMTTGGTAGCTCCACCGCGGTGGCGGCCGAGGTACTTTTTTTTGTTTG 

1 1 n i n n n icaccttmgggaggatttaatttgctcccaactgattgtcacttaaat 

GAAAATTTAAAAATGMTAAAAAGACATACTTTCTCAGCTGCAAATATTATNGGAGAATN 
GGGGCACCCACAGGNAATGAANAGAGGAAAGCANCTTCCNTAACCTTCAAAACCATT 
Sequence 1369 

CCGGGCAGGTACCAAAATGGTTTTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTAG 

TGGGTAGCCCCMTCTCTCCATAATATTNTGCAGCTGNAGAAGTTATGNCTTTTTATTCA 

TTTT 

Sequence 1370 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGCTCTGACCACTATTGGTTTTG 

AGACCACGATGTTGGGAGGGTATGTTTACAGCACTCCAGCCAAAAAATACAGCACTGGCA 

TGATTCACCTTCTCCTGCAGGTGACCATTGATGGCAGGAACTACATTGTCGATGCTGGGT 

TTGGACGCTCATACCAGATGTGGCAGCCTCTGGAGTTAATTTCTGGGAAGGATCAGCCTC 

AAGGTGCCTTGTGTCTTTCCGTTTGACGGAAGAGAATGGGATTCTGGTATCTAGACCAAA 

TCAAGAAGGGGAACAGTACCTCGGCCCGTTCTAGAACTAGTGGATCCCCCCGGGCT 

Sequence 1371 

CCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTG 
GGNGCCCCAATTCTCCATMTATTNTGCAGCTGGAGAAGTATGTCTTTTTTATTCATTTT 
TAAAl I I I ICATTTTAAGTGACAATCAGTTGGGGAGCAAATTAAATCCTTCCTT 
Sequence 1372 

NCCGGGCAGGTACTTACTTGATGTGACTCTCCTCTCATGCCTGGGCCCTGCTTACAGGTG 

TGATTGTGACACATAGCTTGGCCTAGCCCCTAGGTTATGTNACTCTCCTCTTATCCTTCA 

GTTATTTTCACAAGGGGCATTGCGACATATTGCTGGACTTGGGAACCCAGGTGATGTGAC 

TCTCCTCTACTGNCTGGAATACAACCANAAAAAAAAAAAATNAANANAAAAAGNNCCCTN 

CGNCCGCTCTAGAAANTAGTGGGATC 

Sequence 1373 

CCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTG 

GGiTGCCCCAATTCTCCATMTATTTGCAGCTGAGAAGTATGTCTTTTTATTCATT^ 

ATTTTCATTTAAGTGACAATCAGTGGGAGCAAATTAMTCCTCCCTTAAGGTGAAAA^ 

AAAAAAACAAACAAAAAANGTACCT 

Sequence 1374 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTGATTGCATTGCACTT 
AAATGCAGATTATTTTGGAGTTTGAAAAGGGACTATTAATGA 

TTTCTCl l I I ICCCTTCCCCGCCACTGATTCAGTGAGCTGGAGATTGGATCACAGGTATA 
ATTCAAGCTTTCATGTAGTCATGTAGATCACTAGACTCCTTGGTGTACCTGCCCG 
Sequence 1375 

ACTACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTTTGG 

CCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACT 

GCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTCTCATCTCCA 

GCCTGGCCCCTACCAAAAGTGTGCGGCCAGTTATTATACTGCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGG 

Sequence 1376 

AGGTACCTGNAGGCCTCCTACACCTACCNCTCTCTGGGCTTCTATTTCGACCGCAGATGA 
TGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCAGAATTGNCCGANGAGAAGCGCNGAG 
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GGCTACGAGCTGTCTNCTGAAGATGCAAAACCAGCNTGGTCGGCCGCTNTAGAACTAGAT 

GANATCCCCCGGGCATGCAGGTAATTCGATATCAAAGCCTT 

Sequence 1377 

CCGGGCAGGTACCAAAATGGTTTTGAAG7TMGGGNGCTGCTTTCTCTCTTCATTCCTGT 
GGGGGCCCCAATTCTNCATAATATTTTGCAGCTGAGAAGTATGTCTTTTTATTCAI I I I I 
AAATTTTC ATTTAAGTG AC AATC AGTTG G G AGC AAATT AAATCCTCCCTTAAGGTG AAAA 
AAAAAAAAACAAAAGTACCTN 
Sequence 1378 

ATCACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTT 

TAMCTGTATTGTATTCATGTTGCTAAACAATATTGGCCTTCTCGATGATTTTATTCATG 

TTGCTCCAMGTTAAAACCCTGTAGAACTAAGTAGGTGAAGAGATATTTTGTATAAGTGC 

CACAGAAGAGAAAATATAATAAATTAAATAGTGATCGAGCGGCCGCCCGGGCAGGTACCA 

GCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGG 

TAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnTCTTGCAGNTTACTACTGTCAGCAGTATAATNACTGGCCCCCGT 

ACCTTGGCCGCTTCTAAMCTAGTGGGATCCCCCCGGCCTGNMGGMTTTGATNTTTNA 

GCCTTTTTGATNCCCGNCGACCTTCANGGGGGGGGCCC 

Sequence 1379 

CCGCGGTGGCGGCCGAGGTACTTTTTCCTAGTAGTGACCCAGTAAATATCTNTNCAACAA 

ACTTGGCTTTGCTGGTGTGCAAAGTGTAATATAAGGAAAATTAGTGAC Al 1 1 I II I GCCT 

TTTAGTGTGTAACNTGGGAATGTATANGTATAGAAMCAGAGAGATAGTTATATATTTTA 

ACMCTTGAAATTCATAGGTGTTAATAGATTCCTNTTTTTGAAATTAAATATC^^ 

ATTTTGATCACTTACAGTGTTGTATGCACTACCTTA 

Sequence 1380 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC ITTTTTTTTTTTTTTTTI 
TTCCAAAAGCCTTGTTTTTATTTATATAGAGTCCTAACCACTTCGGTGGTAGGAGGGGTG 
GGAGAGGCTCC I I I I ICAA 
Sequence 1381 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGTATCATTGCA 

GNCTTGCTTTTTTGAGTAATGGATTCCTAGATTCTATGAGGATACCACAACCACTTTTAA 

AGAGGTTTCTAAGGNCAGTTGCAGTGCTTACGCCTGGGAGACCAAGGTGGGAGGATCACT 

TGAGCTCAGGAGTTCGAGACCAACTTTGTCNTT 

Sequence 1382 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTNNTTN I I I I I I I I I I 
TTTTTCTTCTTCTAGTTGTTTCTTGGTTTCTGAGTCCAGTCCCCTT^ 

GCGCAGCAGGTTCTTGGACAACTTTGTCTTTCTCTCGGCCGCCATGGTAGGTTCCGCCCA 

GCGCCGCACTC 

Sequence 1383 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACCAGCAGAAACCTGGC 

CACGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnACAGAAGTTCACTCTCACCATCAGCAGCCTGCAGTCTG 

GAAGATTCTGCAGTTTACTACTGTCAAGCAGTATAATAACTGGCCCCCGTACCTnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGTGGAGGGTTAATTGC 

GCGCTTNGGCGTAATCATGGGTCATTAGGCTGTTTCCCTGTGTGAAAATTGGTTATCCGC 

TCACAATTCCACAACAAACATACCNAACCCGGGGAGCANTAAAAAGTGGTAAAAGCCCT 

Sequence 1384 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACAGAAACCTGGCC 
AGGCTCCCAGGCTCCTCATCTATGATGCATCCAACAGGGCCACTGGCAnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnAGCAGCGTAACCAACTGGCCTNGGTTAACTTTTCGGNGG 

ANGGACCAAAGGNGGAGATCAAACGAACTGTGGCTTGCACCATNTGNCTTNATCTTCCCN 

CCATTTGNTGAGCANTTGAAATTTTGGAACTGCCTTTTGNTTGGNGGGGCCTGCTGAATA 

ACTTCTATCCCAAGAGAGGCCAAAGTTACCTGCCCGGGGCCGGCCCGCTCTAAAACTAGT 

GGGATCCCCCGGGCTGCAANGAATTCGATATCAAAGCTTATCGATACCCGTCGACCTTCG 

AAGGGGGGGGGGCCC 

Sequence 1385 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGGACATTCATGCCCGGCGG 

TTCGGGGCGACGGACTCTCCATTCCAGAACCATGGCCCAATTTGTCCGTAACCTTGTGGA 

GAAGACCCCGGCGCTGGTGAACGCTGCTGTGACTTACTCGAAGCCTCGATTGGCCACATT 

TTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTAATCATGGGTCATTAGCTGTTTCCTGTG 

TGAAAATTGTTATCCGCTCACAAATTCCACACAACATACCGAAGCCCG 

Sequence 1386 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTNTTCTGTGTGACAATTACTGCA 

CAGTCTTTCCCTCTGACAGCTACTGATGTAAGGCCACCCTGGTTAATAGCCTTAAAAGCA 

TATTCTACTTGGTAGAGCCGACCCTCGGGTGAAAAAATGGTAATGTGGCGGTCAAAACCG 

GCGCTGGAACCACCGGGACATGTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTrAMTrGCGGCGCTTTGGCGTA 

ATCATGGTATTAAGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

ACATTACCGAGCCCGGGGAGCATAAAAAGTGTAAAGCCTGGGG 

Sequence 1387 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGAAATGCATGAACTGGCAANTTATT 

TGTGACTGCAACAGCCCGTGCGTGTTCAGGAGTGTGTCTTCCAATCTGNCCTAAGGCCCA 

AGCAGCTGCAGCCTTAATATGATCTTCCGGTTCTTCTGACMGCANACTGACAACTGGGG 
TACCTGCCNG 

Sequence 1388 

CTTNGGCGGCCGCCCGGGCAGGTTCTTTGGCCTCTNTGGGATACGAAGTTTTTCANNCAG 

GNTCACAACACGAGGCAGTTCCAGATTTCAACTGCCCATCAGATGGCGGGAAGATGAANA 

CAGATGGTGCAGCCACAGTTCGTTTGATATCCACTTTGGTCCCAGGGCCGAAAGAAGGCA 

GACCAAAATATTGCTGACAGTAATCCCGCGTACCTNGGCCGGACCA 

Sequence 1389 

AGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACGCGGGAGGAACTGCTCAG 

TTAGGACCCAGACGGAACCATGGNTTCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTG 

GCTCCCAGATACCACTGGAGAAATAGTGATGACGCAGTCTCCAGCCACCCTGTCTGTGTC 

TCAGGGGAAAGAGCCACCCTCTCCTGCAGGGCCAAGTCAGAGTGTTGGCAGCAACTTTGC 
CTGGTACCTGCCCG 

Sequence 1390 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGANGTCCNNNNACACCCNNNGTGC 
ANNGGTCTTGCGTNGCCNATGCCNCCACCATTGTCCANAAGACTCAGTTAACATTATTTG 
CAGTACAGACGGTTATTGATCCACGATGATAGAAGAGGAT 
Sequence 1391 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTGGCCTC 
TCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTC 
ATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTT 
GGTCCCCTGACAAAAGTGTACCT 
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Sequence 1392 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCAATGCCTCAGGTTGCACCTGACTTATA 

TGCTGAACTACAGAAGGCACATTTMTTTTATTCAAGGGTGATTTGMTTACAGGAAGTT 

GACAGGTGACAGAAMTGGGAGTTTTCTGTTCCATTCATCAGGCTCTGAATGGCTTCCAT 

CCTGCACCACTCTGTACCT 

Sequence 1393 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCAGGCTAAGGAGGTGCTAA 

CACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnTCCGGTGGTATCTGGGAGCCAGAGTAGCAGGAGAA 

GAGAAGCTGAGCTGGGGCTTCCATGGTTCCCTCTGGGTCCTAACTGAGCAGTTCCTCCCC 

GCGTACCTGCCCG 

Sequence 1 394 

CGAGGTACTGTCGTCTCCCAACTCCCACCTACAGGGCCCGGGAATCCTAGGGAGCGGATA 

AAGGCCTGGGAGAGGGAGCAAGATGGGCTCCGCCACGAGATCCGGCTGAAGTCCGGCTCG 

GCGCCGGCCTCAACTGCAAGAGGAAGGCACTCTCTCTCCCCAAGCTGAAACACCAGAAGA 

GCCACACT 

Sequence 1 395 

GACTTTGTTTG I nTTTTTTTTTTTl ICACCTTAAGGGANGACTTNATNTNTTNCCNTNT 

GACCGCCACTTAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAAT 

ATTATGGAGAATTGGGGCACCCACAGGATGAAGAGAGAAAGCAGCTCCCTAACTTCAAAA 

CCATTTTGGTACCT 

Sequence 1396 

CGGCCGAGGNACGCGTTGCTTTTCACAATTTGGCNCCGAAANTTTAANAACGAAGCTCCT 

GCCCCTCCTAMGCTGAAGCCAAAGCGAAGGCTTTAAAGGCCAANAAGGCAGTGTTGAAA 

GGTGTCCACAGCCACAAAAAGAAGAAGATCCGCACGTCACCCACCTTCCnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCNATATCAAGCTTATCCGATACCCGT 

NCGACCTCGAGGGGGGGCCCCGGGACCCCANCTTTTTGNTCCCTTTAACTGAGGGGTTAA 

ATTTGCCGCCCCTNTGGGCCGTAANTCATTGGGGCAATAAGCCTGGGNTCCCTNGGGGGG 

AAAMNTGGGTATTCCCGCCTCNACNAATTTTCNCCACAAACAATTACCGAAGCCCGGGG 

AGCATAAAAAAGGGGTAAAAACCCCTGGGG 

Sequence 1 397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGCCAGGTACTGGAGGAGTGAGTCC 
CTATGCTGACCCCAATNCTTGCAGAGGTGATTCTGGCGGNCCCTTGATAGTTCACAAGAG 
AAGTCGTTTCATTC AAGTTGGTGTAATC AGCTGGGGAGTAGTG GTGTNTG N AAAAACC AN 
AAGCGANAAGCAGGTACCT 
Sequence 1398 

ACTACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCGCGGGAGCTTCTCC 
TTGCCAGTTTCTCCCAGCAGGACCCTCTTCTTGTTTTGAAAGATGGTCGGCTGCrTTTGG 
TAGGCACGCTCAGTCTGAATGTCCGCCATCTTCC 
Sequence 1399 

GTACGCGGGGGAGGCTCGCGGTTNGAGTGCGGCGCTGGGCGGNTCCTACCATGGTTTTNG 
AGAGAAAGACAAAGTTGTCCAAGAATCTGNTGCGCATGAAGTTTATGCATAGGGGACTGG 
CTCAGAAACCAAGAAACAACTAGAAAGAAGAAAAGAAAATCATTAGTGAAGAGCACTGGT 
ACCT 

Sequence 1400 

GCGGTTTGGCCNCCGCAGGTACCAACANAAACCNGNCCAGCTCCCANGCTCCTCATCTAT 
GATGCATCCAACATTTTTCACTGGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTNAGNGTATTACTGNCAGC 
AGCGTAACAACTCGTACCTCGGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnNGATATCAAGCTNATCGATNCCGNNAGACCTCGAGGGGGGG 
Sequence 1401 

tagggcgaattggagctccccgcggtggcggccgaggtac i \ v i rtttttttttttttti 

agagatgggggtttcaccgtgttggccaggctggtctcgaaqtcctgacctcaagtgatc 

cactcgcctcggcctcccaaagcgctgggatgacaggcgtgagccattgcaccaagaccc 

g ataggtctaaggg gtctc atttg agtcattgtatataa a i i i i u m i caatgtttttt 

gtctaaaaaactttagacagggaatgcactctccaattaccttcaactgctacctntcct 

tattttaatnnccnaaamcnnattttggcctttaacaattttm 

aaatttgtttnttnttaaaamt™ 

ggggggacccnnggggggggttcanttttcnccncaattttaaccngggtttccccngga 
aaacnggnngaaaaaaagtgngggggncnttttcccaattnaaaaaantnggggaaanaa 
acccttttaaannanggggg i itttti i nctttttnaccnnnncaaaaaatnnttttttt 

TTT 

Sequence 1402 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGAGGAACTGCTCAGTT 
AGGACCCAGAGGGAACCATGGAAGCCCCAGCTCCACTTCTCTTCCTCCTGCTACTCTGGC 
TCCCAGATACCACCGGA 
Sequence 1403 

AGGTACGCGGGGGAGCTACAACAGGCAGGCAGGGGCAGCAAGATGGTGTTGCAGACCCAG 

GTCTTCATTTCTCTGTTGCTCTGGATCTCTGGTGCCTACGGGGACATCGTGATGACCCAG 

TCTCCAGACTCCCTGGCTGTGTCTCTGGGCGAGAGGGCCACCATCAACTGCAAGTCCAGC 

CAGAGTCTTTTATACAGCTCCAGCAATAACAACTACTTAGCTTGGTACCTGCCCGnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCAGGAATTCGATATCAAAGCCTTATCGA 

TACCCGTCCGACCTCGAGGGGGGGGGCCCCCGGTTACCCAAGCTTTTTGTTCCCTTTAAG 

TGGAG 

Sequence 1404 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTT 

CTCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATAATATTTGCAGCTGAGAAGTATGTC 

TTTTTATTCATTTTTAAATTTTCATTTAAGTGACAATCAGT^ 

CCCTTAAGGTGAAAAAAAAAAAAAAACAAACAAAAAAAAAGTACCTnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCGACCCT 

CGAGGGGGGGGCCCCGGTACCCAGCCTTTTTGTTCCCTTTTAGTGGAGGGGTTTAAATTG 

CCGCCGCTTTGGCGNT 

Sequence 1405 

GGACCCCTTATCGACTNCTATAGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAAGGAG 

AAATAGAACAGTGCAGGCAAAAGAAGAAAGGCGCGGGCTGGGTGGGAAGAGGATTCGGAC 

TCGTCACACTGCAGAGCAGCAGAGCGAGAAAGGATGAGAAGAGGCAGAGAAGGCGACGGC 

AGAAGAAAAAAGGAAAAACTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnGCGCCGCTTGGCCGTAATCATGGTCATAGCCTGTTT 

CCTGTGGTGAAAATTGTTATCCCGCTTCACAAATTCCCACCACCAACATACCGAAGCC 

Sequence 1406 

CCGCGGTGGCGGCCGAGGTACAGAAACATTTTAGAAAAAGAAAGGGAACTTCAGCAACTG 

GGAATCACAGAATACCTAAGGAAAAACATTGCTCAGCTCCAGCCTGATATGGAGGCACAT 

TATCCTGGAGCCCACGAAGAGCTGAAGTTAATGGAAACATTAATGTACCTGCCCGnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCGACCTCGAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnCGCTTGGCGTAATCATGGTNTAAGCCTGTTTCCTGTGTGAAAATTGTTATCCCGC 

TCACAATTTCCACACAACATTACNAGCCCGGGGAGCATTAAA 
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Sequence 1407 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGCCT 

CTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCT 

CATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCACCT 

TGGTCCCTCCGCCGAAAGTGAGCGGGGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCCGCTTGGTCGTAATCAATGGG 

TCATAAGCCTGTTTTCCTGTGTG 

Sequence 1408 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGCCTC 

TCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTC 

ATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATATCCACTTT 

GGTCCCAGAGCCGAAAGTGAATATAGGATGAGGCCAGTTGCTACGCTGCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGAGTGAAGTCTGTCCCAGACC 

CACTTGCCACTTGMCCTGGCTTGGGATTGCCCAGTGGGCCCTTGTTGGGATGCATCAAT 

AAGATTGAAGGNAGCCCTGGGGNAGCCTTGGCCCAGGGTTTCNTGGTTGGG 

Sequence 1409 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCCAGGAAGTGAAGTAAAACCTGGTCTTGGT 

TGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGAGCCACAGTATAA 

TTAGTAGGCTGAGMGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTGCTCGCCTGGTAC 

CTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnGCCGCCGCTTGGCCGTTAATCAATGGGTCATTAGCCTGTTTCCTGGTGTGGAAAA 

TTGTTTATTCCGC 

Sequence 1410 

CACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAGTT 

CGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCA 

NAGAGAGGTAGGTGTAGGAGGCCTGCANGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGGT 

CATAGGCTGTTTCCTGTGTGNAAATNTGGTTATCCGGCTCACAAATTCCACCA 

Sequence 1411 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGCCTTTGCCTTGAGTCCA 

TCCACGCTTTAGCAGCTGCTCCTCAGCAAACTTCATCCCACGACTCTTGACCTCTGGGGT 

GGCATTCATGGTGAGAAAAAAGTCTCTATCCTCAGACTCTCACCAGAGAAGAAGAGGTGA 

TATCATCCGTCATTGCTGGGTCCGCTCCGCGGAAACATGTGCCGGACCTGACTTGTGCGC 

CGCCATCTTCCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnCTTATCGATACCCGTCGACCTCGAGGGGGGGGGCCCCGGGTACCC 

CAGCTTTTTGTTCCCTTTTAGTGGAGGGGTTAATTTGGCGCCGC 

Sequence 1412 

CGAGGTAC I I U I M I I I I I I I I I I I I II I I I I I AAGNGATTTGCAGTTAGTTTATTGAA 
ATGGAAATGAATTTGAGGNTTCAACAGTNGNCTAGCAGTTTCCACATGAAAAAGGACAAC 
NCNAAGTAGMGTAAAAGNTGNGNTTTTCTTAGGNCCCCCNAAAAAANNTANNANTTm 
TTTGGGCCCCCGNAAAAAANANAAAAATTCCCCCCCNNAAAANGGGGNGGGGGGGNAAAA 
NATTTTTTnTl GGGGGNNGNNAAAAAAAAAAGGGGNGCCNGNTTTTTTN ITTTTTTTNG 
NNCCCCCCCC? I I I I I I IGGGGGGGGGGGGGGGGGGGGGGGGCCCCCCCCCCCCCCCCTA 
AAAAAAN1 H II ll H I I I AAAAAAAANNGGGGTNNTG I I I I \\ I I i GCCCCCCCCCTTT 
I I I I I IGGGGGGGGTN I M I I [ I I [ I AAAAAAAAAANNGGGGGNNTNNNNCCCCCCNTTT 
CCCCCCCCCNGGCCCCCCCNN rTTTTTTTTTTTTI ICCCCCCCCTAl Mill IGGGGGGG 
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GGGGGGGGTAAAAAAAAAANGGGGGTTTTTT^ 
Sequence 1413 

ATACGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACC 

AGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCGTCCACCAGGGCCACTG 

GTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAATAACTGGCCCCCGT 

ACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCT 

TCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGGAACTGCCTCTGTTTGTGTTGCC 

CTGCCTGAAATAACTTTCTATCCCAGGAGNAGGCCAAAAGTTACCTCGGGCC 

Sequence 1414 

AGATGATGTCTTTTTGTTATTGTCTGTCTGTTTTGTATGTGTCTGANATAAGGGATAGAG 
AGGAAACATCCGTTCAGGCTAATTNTAACTACATTTTATTNTTAAAAAATAGNAGGAAAA 
CC ATAAC NCTTNTTTAAAAAGGGNGTNNNNAAAN I MINN I GGTTTNTGGGGGTTGNG 
GGTTTTTAAAAAANAAACCCCCCCCAAAAAANGGGGGGTGTNTCCCCCCCCCCAAAAAAN 
NGGGGNGTTTAAAAAAAMGTGNGNNNGGGGGGNNTTTTAAAAAAAANNCCCCCCCCAAN 
AAAANGGGGGGGGTN I I I t I I M I l I M I ; I l AAAAAAAANNAGGGNGGGG I [TTTTTTT 
TAAANANNNGGGGGGGGANAAAAAAAANGGGGGGGGGGGGGGGGGTGNGGGTCCCCCCTN 
CCCCCCNCNAAANANNNCCCCCCCCCCCNTTCTCANI WWW I ATTATNTNTAAAAATT 
TTTCC CCCCCCCCCCN I rTTTTTTTI 1 NGGGGGGGGGGGGGGGGCACTNNTTNTTCNTTT 
MM I NN CAAAAAAAAA 1 M N N N N M I 1 ATTTTNTNTAGNGGGGGGGNTATTTATA 
NTTTTTTATACCCCCCCCCCC 
Sequence 1415 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTTGACCGCGATGAT 
GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAACTGGCCGAGGGAGGAAGCGCCGAG 
GGGGCTACCNAGGCGTNCTCCTGGAAAGATTGGGGGNCCCCCCCAAAATNTTAAAAGGAA 
AAAANNNA AAAANNCCCCCNCGCCCCCCAAAAAAANNGGGGGNTNCCCCCCNGGGGGNAT 

nrrrn igggggggggggggggggnntnccccccnggntttggggggggnntttncccc 

CNTCNCCCCCCCCNGGGGGGGGGNNCCCCCN I iTTTTTTI i CCCCCCCC I rTTTTTTTT) 
AAAAAAAAGGGGGGGGGGGNAAAAAAAAAAAAAAAAAAAACCCCCCCCTN l rTTTTTTT A 
NAAAAAAAAGGGGGGGNN I I I M I I I NNGGGGGNNNNGGGGGNNNNNNNAAAAAAAAAAA 
I I I I M M I I M I I NNNCCCCNNNNNNCCCCCCCCCNNNNNCCCNNNNTGNNCCCNTTAT 
AATN NCCCCGGGGGN GGNNGNGGGGGGGGGGGNNNNNNNANNTTNTTTTTANCCCCCCCC 
CCCC llilllllllll NGGGGGGGGGGGGCCCCCCCCCCCNNAAAAAAAAANGGGGGGGG 
GGG 

Sequence 1416 

AGGTACI MINN IGTTTG I WWWWWW I CACCTTAAGGGAGGATTTAATTNGCT 

CCCAACTGATNGTCACTTAAATGAAAATTTAAAAAGTGAAGTAAAMGNACATTACTTCT 

CAGCCTGGCAAATTATNTATNGGGAGGAANAAAAAAA I lllllllllllll I GGGGGGGG 

NGGGGGGGTATMGGGGGNNTNNGGGNTTTNCCCCNCTAANAAAAANTCCCCCNTTTNCC 

CCCCTTNNCCCCNTAAAAAAAAAAAACCCNCCNNAAAAAAAGGGGGGNNGGGGGGGNGNT 

TNAN AAAAAAAAAANAAAA I II N M M I NGGGGGGGGGGGGAAAAAAAANANNANAAAA 

AGGNGGNTTTNTAAAAAAAAAANNNGGNGNTGAAAAAAAAANNGNGGTCGNTNTAANAAA 

AATNANATAANTNNTTTAAAATNGGGGGGGNNTTCCCCCCCCANAAAAANAGGNGGGGGT 

GNCCCCCCCCCCTC M M I I M M M I M NTACNNANGGCCCCCCCCTTTNCCTNNTTTT 

llilllllllll I AAAAAAANAGAATAATTNTNNCCCCCCCCCCTTTTN I M M M I GTN 

NNTANNGNGANCCCCCCCCCCCNCNAAAATATCNNCACTNTTGNTGNGAAAAATATTTCC 

CCCCCCCCCCCCCCCC 

Sequence 1417 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTC 
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TCTCTTCA TTCCTG TGGGTGCCCCAATTCTCCATAATATTTGCAGCTGAGAAGTATGTCT 

TTTTATTC A I I I I 1 AAATTTTCATTTAAGTGACAATCAGTTGGGAGCAAATTAAATCCTC 

CCTTAAGGTGAAAAAAAAAACAAACAAAAAAAAAGTACCTnnnhnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGnn 

nnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAGTGGAGGGGTTTAAATTGGCGCCGCCT 

TGGCGGTAAATCA 

Sequence 1418 

AGGTACATGGGGGTCCGTGCGGGCAGAACCCAGGGCATGAAGATCCAAAAGGGCCTGGTT 

CAGCTTTTTCTCCAGGGCCATGGCAGCTTTCATGGCGTCTGGGGTTTTACCCCACTCATC 

TTCAGCTGGCTTCTTGATGTCCTGGAAGAGAGCnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTTGCCGCGCCTTGGCGTAATCATT 

GGTCATAAGCCTGTTTTCCTGTGTGAAAAATTGTTTATCCCGCTCACA 

Sequence 1419 

TATAGGGCGAATTGGANCTCCCCGCGGNGGCGGCCGAGGTAC I I I I I I I I GTTTGTTTTT 
M M M I CACCTTMGGGAGGATTTAATTTGC TCCCAACTGATTGCCACTTAAATGAAAA 
TGTTAAAAATG AATAAAN AAGTAC ANTACCTTTTTTC AAN C NTGGC AAAT AAN AATTTAA 
TNGNGNAGAAAATTTNTGGGTTNGGGGTTTGGGGGTTNTTT^ 

TACCCCCTNNNCCCCCT7TTNNCCCCCCTATAAAAAAANCCCCCCCCNTAAAAAAAAANN 
TTTATGGGGGGGTTTTTTTNAAAAANATAAAAAAAANAN I II I I I I 1 1 GGGGGGNTGTGT 
CNAAAAAANNCTAAAAAAAGGGGGNGGTNTNCCCNAAAAAAGGGGGGNGNTNTTAAAAAA 
ANGNGGGGGGGGGNTGTANAAAAAATAAATATTTATTTTTTAGGGGGGGGGGGTNGNCCC 
CCCCCTNTNAAAAAANNNANNANGNGGNCCCCCTNA I rTTTTTTTTTl NCCCCNCTTTCC 
CCCCNTNTCTACNTC rTTTTTTTTTTTTTTTI I GTAAAAANANNGNAATTAATNNCCCCC 
CCCCCCCCNN I I I I I I I I M I I NATCTNTNNNNCCCCCCCCCCCCCCTAANTTGTNCTTA 
ANATTATTNATAATAAANATNGTTTTTNTATTTTNCCCCCCCCCGTNNGTCTCCCCCCCA 

Sequence 1420 

CTTAGGGCAATTGAGCTCNCCGCGGTGGCGGCCGAGGGACCAAGGTGGAAATCAAACGAA 
CTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAA 
CTGCCTCTGTTGTGTGCCCTGCTGAATAACCTTCTATCCCAGNAGNAGGGCCCAAAGTAA 
AANTAAAAAGGGGGGN I I I I H I I AAAAAACCCCCCCNCCCCCC I i I I I I I I II IC CNNG 
GGGGGGGGGGGGGGNGGCCCCCCTTCCCCCCCGGGGGGGGGCCCCNATTTTTTTTTTTCC 

ccci rrrrm inaaaaaagggggngnnaaaaaaaaaaaaaanatccccccn rrrrrm 

TNAAAAAAANGGG I I I II I II M I I I GGGGGGGGGGGGGGGGGGNNAAAAAA A 7 I I I I I I 
TTTNCCCCNNCCCCCCCNNCCCTNTTNCCCTTNTTNTCCCCGGGGGTNTTNGGGGGNNTA 
ATNNNGGNTTTTTTTCCCCCCCCNTN I I II I II I N NGGGGGGGGGG NCCCCCCCCAAAAA 
AAAAAAANGGNNGGGGGGGGGGGGGGGGTTTAAAAAAAAAAAAAAAAAAA 
Sequence 1421 

AGGTACCTTGGTGTTACTTCCTTTGAAAGATGGACATTGAAAGTTAAAAATCAGTCACTA 

TATAGGCACTGCTGTATGGAAAACGCATTTTGTGTTTCTACAAATTGTAAGTCGGNAAAT 

GNCCTTTTTTGTAGTATCCTTCGGGNCTTCCCCCGGGGGGGGNCCCCC I MUM I C CCC 

CCI I I II II rAAAAAGGGGGGGGNAAAAAAAAAAAAGCCCCCCCNTT^ 

GGGGNI I I I I I I If! I GGGGGGGNGGNGGGGGGGTTTTNAAAAAAAN I I II M I I I M NN 

CCCCNNNGCCCCCCNNGTCCCCCCTTNGNCCCCNNNTNCCCCCGGGGGGNNTNGGGGGGG 

NNTTNGGGNTTTTTTNCCCCCCCCC I M I I M II I GGGGGGGGGGNNNCCCCCCCCCNAA 

AAAAAANATNTTTTGGGGGGGNTTTTTAAAAAAAAAAAGNGAAAAM 

TNNNI Mill INNNNNGNCCCCCCCCCGGGGGGGGGGNGANAAAAAAAAAAAANTTTTTT 

TTTN AAAAAAAAAAA I I M M I M II M NCCCCCCCCCCCCANAAAAAATTNNAAAAAAN 
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GGGGGGGGGGGG1 I I I I1CNCCCCC 
Sequence 1422 

AGGTACCTGCACAATGTCTACCCAGAGATGTTTGTTCCTGACCTGACGCCCACCTTCTAT 
GGTGCCATCAAGAACCTNGGCACCAACCAATGCCTGGATGTGGGTGAGTAACAACCCGCG 
GNGG GGGMNGCCCCCTCMTCMTGGGNTTTTTNAAAAAAAMCCCCCCTNCCCCTm 
TTTTGGGGGG NGTNCCCCCGGCCCCCTTNCCCCGGGG ATTTGGGGGGTTTNGT I HUM 
C CCCCCCGGGG GGGGGGGGGGNNt I I I II II I I I I f CCCCCCCGGGGGGGGGGGCCCCCC 
CTTTTTTTTTTI ICCCCCCCN I I I N I I \ \ I AAAAAAAGGGGGGGGGGGGANAAAAAAAA 
AAAAAAAAAAANCCCCCCCCC n I I I M I I I AAAAAAAAGGGGGGGN I rTTTTTTTTT GG 
GGGGNNNNGGGTNNTNAAAAAAAAAAN I I M I I 1 1 I I I M l I NCCCCNNGGNNCCCCCCC 
CNNN NNCCCCT GTNGNCCCCCTNTTNNNCCCCCGGGGGTNNNNTNNNGGGGGGGGTTTTN 
NNGNi mum ICCCCCCCCCI nTTTTTTTI I TNGGGGGGGGGGGGGNCCCCCCCCCCC 
NAAAAAAAAAANANATTTNTTTGGGGGGGGGNTTTTGNANNAAAAAAAAA 
Sequence 1423 

AGGTACATATTTATGCCAGAAAGCTTGACAAACACTGAACAGTCTGTAAAAAAAATGGAA 
A ATCT CTTGACCATCAATTCAAATAMTATAATTTTAACTATTT^ 
ATTTTACCTTT NTAATTT AAANANTTAN A I I I I I M I M I IANAAAAAAACCCCCCCCCT 

m i m ii m rrccc cccci m n 1 1 n inaaaaaaagncaaaaangncaaaaangtaaaa 

CCN htnn ii l GGGGGGGGGGGNCCCCCCCCCN I I I I M I II I I NCNCCCCNTTTTTT 
iTTTTTTTI iNGGGGGGGGNGNTAANTATNTTCTTCNCCTGGGNGGGGGNN IIUUIll 
TTTCCNCCCNCCAAAAAAAAANGTGTTTNGG 

AAATTNCCCCCCNNNNGCCCCCNN I ITTTTTTm I NCCCCCCCCCCNTNGGGGGGTNTT 
TTAATACCTCCTNNNCNCCCCCCGGGGGGGGGGGGGGNTTNCCCCC I < rTTTTTTTTTNT 

cccccccccn nrrrn ittttnanaaaaaanaangggggnnnggtngngtggccaaaat 

NGNTTTATTTNTTTTCCCCCCCCCCCTTTTTTTTT 
Sequence 1424 

AGGTACTTTCi i u M iGTTTGI t I I I I I I I I 1 CACCTTAAGGGAGGATTTAATTTGCTC 
CCMCTGATTGTCACTTAMTGAAAATTTMNAAATGAATAAAAAGGACATACCTTCT^ 
GCNTGGCAAAATAATTTATTGGGGAGGGGNAAAAAAAAAAAAAA \ WWW \ NTTTTTGG 
GGGGNNGGGGNTTNNAGGGGNTTGGGGGNTTTCCCCCCCCAAAAAANAANNCCCCCCTTN 
CCCCCNNTTTNCCCCAAAAAAAAATCCCCCCCNAAAAAAMNNNANTTTTGGGGGGGGGN 
GNTTTAAAAAAAAANAN AAAAAA I I I I I I I I I 1 NGGGGGGGGGGGGGGGGGNGTNTTAAA 
AAAMNGGGNGNTTTTNANAAAAAAMNGGGGGGGNGNTAAAAAAAAANGGGGNGNNGTT 
TAAAAAAAATAANAAANANTTTTAAAANGGGGGGGGNNNGNTNCCCCCCCCCCAAAAAAA 
AAANGGGGGGGN GGGGCNCCCC CCCCCN < 1 M 1 I 1 I M I I I NNANNCNCNNNNNCCCCCC 
CNACNANTCCCCC I I I I I I I I I I I I 1 I II 1"TNAAAAAANNNTAAAAANNNCCCCCCCCCC 
CCCCCTTTTTTTTTTTTTTTTTNACACCCCCC 
Sequence 1425 

CCGG NCAGGTNC ACTTG AAACCAAANTTCTAAAACTN GTTTTTCTTAAAAAATAGTTGTT 
GNAACATTAAACCATAACCTAATCAGTGNGTNCACTATGCTTCCACACTTAGCNCAGACC 
NT TCTCACNACATT NTTTACTTGGGG I Ml I I I I I I CCCCCNAAAANTTTTAAAAAGGGG 

gn n u u n n i inccccnh m rnm ncccccaaaaaaaaaaaaaananggggnn 

TTGGGGGGGGNTTTCCCCCNAAAAACCCCN I MIIMII I NGGGGGGGGGGGNAAAAAAA 
NCCCCCCCTTAAAAAAATGGGGGGGGTNTAAAAAAAAAAACCCCCNCCTAAANAAAAGGG 
GGGNGCCCCCNAAAAAAAAAAAAAAANCNNNTTTTTTGGGGGGGNCNGGGNTTNCCCCCC 
CCCCCCN ITTTTTTTTTTTTTn 1 GGGGGGGGGGGGGGGGGTTTTTTTAAAAAAAAAGGG 
GGGGGNNG NTGGANAAAAACCCCTTTTTTTNNNN 1 I I U 1 1 I 1 1 NNNGGNNTGGGGGGNT 
rTTTTTTn I GGNNGNTGTGNNAGTNTTTCCCCCCNTTTTTTTCTCC \ WWW \ AN NAG 
ACCCCCCCCCCNTT 
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Sequence 1426 

CGCCCGGGCAGGTACAACCATCCAGCAGGTCCCAGAACAGTTTTCTTCTGGGCTCCAATT 
ATGAAATGGGGGTTGGTGTGTGCTGGATTGGCTGATGTNGGCCAGNACTCTGCAGGAANA 
AACTTAGGCACAAGGCNT CAATC TGTCTTGGGAN NGGGTTTTTANTTTTTNN NTTTTAGG 
GGGGGGGTGAAAAAAAANTTTTTNGGGGGGGGGGGGNGTNCCCCCCC \ I HUM IN AAA 
AA AAAANCCC CCCCCAAAAAAAAAAANNTTTTGGGGGGGNAGGGGGGGGNNNANATTTTN 
ANTTTTm I ISI I l \ 1 1 NNAAAAAAAAAAAN I I I I I ! I I i I II I ICACCCCCCCNNNGGGG 
GGGGGGGGGGGGGGGGG I I 1 I I I 11 I I I M I CCCCCCCCAAAAAAAAAAAAAAAAANGGG 
GGGGGGGGGTAAAAAAAAAANA I I 11 I I I 1 AANAAAAANCCCCCCCCC 11 1 1 I I I II 1 I I 
TTTCCCCCCCCCCAAAAAAAAAAAAAAACCCCCCCCCCCNA I MUM I NTTATTTTGGG 
NGGGGGGGGGGGGG I i I M I M M i I M I M ( AANNTTTTAAAAAAAA I M M M M I M 
TATTTTN7TNTTNAAAAAAAAAAAAAAA ! M M M I rTNNTGTTNTGTNGGGCCCCCCCC 
CCCC 

Sequence 1427 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCA 

GGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAGTTCGCGGAAGAAGTGGCTCACGC 

CTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGG 

CCTGCAGGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGTCATAnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnCGCTCACAATTTCCACACAACATACGNAGCCCGGGGAGCATAAAGGTGTA 

AAAGCCTGGGGGTG 

Sequence 1428 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGATACCTCACCGTGGCTGCTG 

TCTTCCGTGGTCGGATGTCCATGAAGGAGGTCGATGAGCAGATGCTTAACGTGCAGAACA 

AGAACAGCAGCTACTTTGTGGAATGGATCCCCAACAATGTCAAGACAGCCCGTCTGTGAC 

ATCCCACCTCGTGGCCTCAAGATGGCAGTCACCTTCATTGGCAATAGCACAGCCATCCAG 

GAGCTCTTCAAGCGCATCTCGGGAGCAGTTCACTGCCATGTTCCGCCGGAAGGCCTTCCT 

CCACTGGTACCTCGGCCGCTCTAGAACTAGGTGGGATCCCCCGGGCTGCAGGGAATTCGG 

ATATCAAAGGCTTATCGATACCCGTCCGACCTTCGAGGGGGGGGGGCCCCCGGGTACCCC 

AACTTTTT 

Sequence 1429 

ANCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGTCGAGAGGTGGTTAC 

ATTCGTTGAAGGACACCAGCTGCGGAATTTGCGGCTTTGGCAGATTGAAATCATGGCAGG 

TCCAGAAAGTGATGCGCAATACCAGTTCACTGGTATTAAAAAATATTTCAACTCTTATAC 

TCTCACAGGTAGAATGAACTGTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGGTAATCATGGGTCATTA 

GCCTGTTTCCTGTGTNAAAATTGTTATCCGCTCACAAATTTCCACACCAACATACCGAAG 

CCGGG 

Sequence 1430 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACTTTCATTTA 

GTAATTAAATTTTAGTCAGATTTTGCCCAACCTAATGCTCTCAGGGAAAGCCTCTGGCAA 

GTAGCTTTCTCCTTCAGAGGTCTAATTTAGTAGAAAGGTCATCCAAAGAACATCTGCACT 

CCTGAACACGCCCTGAAGAAATCCTGGGAATTGACCTTGTAATCGATTTGTCTGTCAAGG 

TCCTAAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGGGGGGGCCCGGGTACCCAGCTT 

TTTGTTCCCTTTTAGTGGAGGGTTTAAATTGGCGNCGCTTGGCCGTAATCATGGGTCATT 

AAG 
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Sequence 1431 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCCC 

AGTCGTGGCCCTCTGGACAAGTGGCGGGCCCTGCACTCATGAGGGCTTCCAATGTGCTGC 

CCCCCTCTTAATACTCACCMTAAATTCTACTTCCTGTCCAAGAAAAAAAAAAAAAAAM 

AAAAAAAANGTNCCGTCCGGTGTAATGGCGGCNCGGTGGAGCTCTAGGTGATGTTTGCAG 

ACAGAAAAAGTAAACTGTCAAGGAATTCANAGCATTGGGTACCT 

Sequence 1432 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGT 

TTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATAAT 

ATTTGCAGCTGAGMGTATGTCTTTTTATTCATTTTTAAA 

AAGTTGGGAGCAAATTAAAATCCTCCCTTMGGTGAAAAAAAAAAAACAAACAAAAAAAA 

AAGTACCTCGGCCCGCTCTAGAACTA 

Sequence 1433 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCG 

AAGTTGGTGCCAACACTCCGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAG 

AGTAGCAGGAGGGAAGAGAAGCTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACT 

GAGCAGCTCCTCCCCCGCGTACCT 

Sequence 1434 

AACGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTCTTTT 

TTGTTTG I I I I I I I I I ICACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGTCACT 

TAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAATATTATGGAGA 

ATTGGGGCACCCACAGGAATGAAGAGAGAAAGCAGCTCCCTAACTTCAAMCCATTTTGG 

TACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGTCATAGGCTGTTTCCCTGTGTGA 

AAATTGTTATCCCG 

Sequence 1435 

CGGCCGAGGTAC I" I ( I I I I I NTTTG rTTTTTTTTTTTI CAC CTTAAGGTN NGATTN AATT 

TGCTCCCAACTGATTGTCACTTAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCT 

CAGCTGCAAATATTATGGANAATTGGGGCACCCACAGGAATGAAAGAGAGAAAGCAGCTC 

CCTAACTTCAAAACCATTTTGGTACCTGCCCGG 

Sequence 1436 

TCATACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCAG 

GCGAGAGGCGGACGCGAGTCGTCGTGGCAGGAAAAGTGACTAGCTCCCCTTCGTTGTCAG 

CCAGGGACGAGAACACAGCCACGCTCCCACCCGGCTGCCAACGATCCCTCGGCGGCGATG 

TCGGCCGCCGGTGCCCCNAGGCCTGCGGGCCACCTACCACCGGCTCCTCGATAAAGTGGA 

GCTGATGCTGCCCGAGAAATTGAGGCCGTTGTACCGTATATAAAAGACAATTGCTCACAA 

TGATAGCACTGAAGCACTGAGAGATATCAAAGTACCT 

Sequence 1437 

CTATACNGCGMTTGTTTTCCCCGNGGTGNCGGCCGCCCGGGCAGGTACACGGGAGATGA 

AGGTGCGCATCCTCTCCGNNCGTAAGAAGCCTCTGGACATTGACTACATGGGGGAGGAAC 

AGCTCCGGGAGAAAGCCCAGGAGCTGTCGGACTGGATCCACCAGCTGGAGTCTGAGAAGT 

TC G ACCTTG ATGGGC N AAANCCTG AAACAGC AAG AAAATATG AG ATC AAC GTGCTGTACC 

TCGGC 

Sequence 1438 

GACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCNTGCA 

GAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnTCTTGCAGTTTACTACTGTCAAGCAAGTATAATAACTGGCCCCCGTA 

CCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGG 

TTTAATTGCGCGCTTGGGCCGTAATCATGGGTCATAAGCCTGTTTTCCTGTGTGAAAATT 

GTTATCCC 

Sequence 1439 

GACTNC TATA GGGCATTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAGAATATCAAC 

AGAGTC7TTTAAATGAAANTGATGGAAAATCAAGATGGTTCTACTCACCGTGGTTGTTGG 

TAGAATGTTACATGTATCGAAGAATTCATGAAGCAATTATCCAGAGTCCACCAATCGATT 

ACTTTGATGTATTTAMGAMTCAAMGAGCCAAAATTTCTATGGGTCACAGGAATCCAT 

CATTGCi I lATGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGGTCATTAGGCTGTTTCCC 

TGTGTGAAAATTGTTATC 

Sequence 1440 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAAACTGTT 
CAATCTTCTCAAAGTGATGGACTCGGAAGATGCCACGGGTGTCACGGCCATGGGAGCCCA 
CCTCCTGACGGAAGCAGGTAGACAGGCCAGCATACTTGATGGGCAGGTCCTC 
Sequence 1441 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 

GGGGATCCAGAATACATTTCCAACAAGAGCACTGGCCAAGTCAGCTTCTTCTGGAGAGTC 

TCTAGAAGACATGATGCTACACTCAGCTTTGGGTCTCTGCCTCTTACTCGTCAAGTTTCT 

TCCAACCTTGCCATTGCAATAAAAAAGGAAAAGAGGCCTCCTCAGACACTCTCAAGAGGA 

TGGGGAGATGACATCACTTGGGTACCT 

Sequence 1442 

TCACTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTGTT 

nTTTTTTTTI \ I CACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGCACTTAAAT 

GAAAATTTAAAAATGAATAAAAAGACATACTTCTGAGCTGCAAATATTATGGAGAATTGG 

GGCACCCACAGGAATGAAGAGAGAAAGCAGCTCCCTAACTTCAAAACCATTTTGGTACCT 

GCCCG 

Sequence 1443 

ATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTTTTTGT 

TTGI I II I I I I I I I ICACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGCACTTAA 

ATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAATATTATGGAGAATT 

GGGGCACCCACAGGMTGMGAGAGAMGCAGCTCCCTAACTTCAAMCCATTTTGGTAC 

CTGCCCG 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGGCCA 

GGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGNA 

Sequence 1445 

ATAATTGCGCGCTTGGCNGTMTNCATGGTCATAGCTTGTTTTCCTGTGTTGAAAATTGT 
TANTTACGCTCACTAATTTCCAACACAAAN CAATTACTGN AGCNCGGGG N AGTC ATTAAT 
AGTTGGTTAAAAAGNCTCTGNGNGGNTGNCNCTTAAAATGGAAGNTGCAGGCCTTAAACN 
TCAACCATTTTANATTTGGCNGTTTTGACTGCNT^ 

CAAGNTNCCGCGGNAAAAMCCACTTGGTTCCCNTTGNCNCCAGTCTTGGTCAATTTTAT 

ANTGNAAAATTCNGGGNCCCAAAACTGACNGTCCGTGNGGGGAAGGAAGGGNTCGGGGTT 

NTTTGGCCGGTNTATTTNGGGGGGCCNGCCTTACTTT1TCCCGGNCTTNNCTCCTTCGGN 

NTTCCAACTTGG 

Sequence 1446 
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CCGGGC AGGTACACAGTAAACAGCATTCCTCACTTCG CTGAACAGCAN NAGGCAACTCGT 

TTCTAAGTTCCAAGCTCACTCTTCATAGAGNCTATCCACCTTGGGCTNG 
Sequence 1447 

ATTGGTTATCCGCTCACAAATTCCACAACAACCATACGAGCCGGGGAAGNCATTAAAAGT 

TGGTAAAAGCCTGGGGGTTGCCCTAATGGAGGTGAGNCTTAAACNTCACAATTTAATTTG 

CGTTTGGCGGCNTCACTTGCCCCGGCTTTTTTCCMGTTCGGGGGAAAAACCNTGGGTTC 

CNNTTGGCCCAANCNTTGCCAATTTTAAATTGGAAAATTTCG 

Sequence 1448 

TACCAGAGGCCCTGGTGGAGGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTCTCT 
GTTGGTACCTGCCCGGGCGGCCGNTCGACTGTTGGACTTGAACCAGCGCGCGTTCAAGAA 
GTTTGCCGGGCAACCGCACCAGCGCCTTNGNCGAACTTCGACCGGCCAGCC 
Sequence 1449 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTGAGGCACCAAGG 

TGGCCTTCGCTCTCTTGACATCTTGGTCGTCTGCGTCCGGCGCTTTCCTCTTGACGGACT 

GAGGCGCTGAACTGGCGGACGAACCCTGGCTGGCTTCCTTCGCGCCTTTCAGCTCGGCCA 

CTTTCTCTCGGTGCTGCTTTCCCAGGACGTGAGTCTGCCACAGGAGCTCGCTCTTAACCG 

GAGTGTTACACAGGGCACAACTCAGCTGCCCCAAACGGTTGTACCTGCCCGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGT 

CGACCTCGAGGGGGGGGCCCGGTACCCAAGCTTTTTGTTCC 

Sequence 1450 

TACCAN TGGC CCTGGTGGATGCACCATTAGATGGAGNGNAGCCCTTGGNGAGCNCTNGGC 

CMGGATTTTCTTGCTTGGGNTANCCCTNCGGGCCCGNTTCTTAGAAAACTTATGTTNGG 

TATTCCNNCCCGGGGNCTTGCNAAGTGNAAATTTCGNAATMTTCCANAAGCCTTTTATA 

CGGTATTAACCCCCGGATCNGAACNCNTTCNTAAGGGGGGGGGGGGG 

Sequence 1451 

ACGACTNCTATAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 
nTTTTTTTTTTTTTTTTTTTI INGGCACAGTATCTCACTCTATCAGCCAGGCTGGAGTG 
CGGNGGCATGATCACCGCTCACTACAGCCTTGAACTCCTTGGCTCAGGNGATCCTCCCAC 
CT CANCCT CCCAAGTAGCTACNACTACAMTGCGTGCCACCACACCGGGTAATTTTTTTG 
GAi i n l I GTAGGG AC N AGGTTTCACCATATTGCCC AGGCTGTATCCCCTNTTTAN ACTC 
ATGCACTCTNTGTGGAGCAGGAACTACACTACTCTAAGNGATTCTGATTCTCCTCTNAAA 
TCCCATCGGTCACCCCAANAAAAGCACTCACTACAATTCTTTCCCATAAGGAGAAATAAT 
TAT 

Sequence 1452 

CCGCGG TGGCG CGGCCGCCCGGGCAGGTAC I rTTTTTTTTTTTTTITTTTTI I GTCCATC 

TGCCCAGTTTTATTGGGAACAAGGGCATTATAACTGCTATCAAAGAGAAGGGAGCCCAAG 

GG CTCCTTQ TGTANCAAGATCCTTCTTCANAGTTTNAGCCAGGGCTGGGAGGGTAAGANA 

CCCi mi ICANGCAGGGTCACACTACCACCCTCAGCATGACTTCCCCAAAAAGTTATCC 

TCCTTTAGCTCAGCACTTGGCACTTNAGGAAGAAAGGACCAA 

Sequence 1453 

CGGGCAGGTCGAGCTCCGGCGCACGATGCAGAACCTGGAGATCTCGCTCTTGCCGCTCTG 

CACCAGTTCGCTGTTGGTGGCCACCTCGCGGTTCAGCTCCTCTGTCTTGGTGAAGAACCA 

ATCTTCGGCATCCTTGCGGCGGACGCGTGGGTCGACTCACCTnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCGAAGCC 

GGAAGCATNAAAGTNTAAAAGCCTGGGGGTGC 

Sequence 1454 

AGGTACTCCATCGCCTGCTGCTTCAGCTTCTGCAGCTCCCTAGTGGCCCCACAGCTGCGG 
CCGCCGCCTTCTTCCTCCATGATGAAAACATCTGAGACTCAGGGCTAAGCACCTTGCCCA 
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AGGCCACACAACAAGTAGGTGATGGAGAGTGTTGGCGTGTACCTGCCCG 
Sequence 1455 

CCGCGGTGGCGGCCGAGGTACCANNGACCTCTAACTCCCCCTGACACAGAGCAATTAGAC 

TCCCATAACAATGGTATCAATTATACCACTCCATTGGAGGGACTTCCTTTATGTGTCACC 

CAGGATACATTGCTCAACTGCAGTTGCCTTGCANTTTGATCCCAAGCATGGNTGAGTTAC 

CATAAAAAAATTATGTACCT 

Sequence 1456 

CCGGGCAGGTACNAGACAGAGTCACCATCACTTGCCGGTCAAGTCAGACGATATCTATGC 

ACAGTCTGCGAGCTGAAAACGGACCCTCACTTTTGGCCGGGGGACCAAGCTGAAGATCAA 

CNCGNAACCTGTNGTGCTGCNCCMTCTGGTTCTTTAATCATTCCCCGCCCANTCTTGAT 

GNAGNCACGNTATGAAAAATCCTGGGAAACTGGCCCTACTGGTTTGTTGTTGNCCATGTC 

TTGAAAATTAAACCTTTCTTATCCCCAANNAAGNAGNGCCTCANAANGNATNNCGCNTTC 

TGTGTCACCGNCTTTCCTTATGGAAAACCTTAATGTTNGTGTANTACNCCCCCCACTGTG 

GGTCCTTN GTC N AATG NGGTAAAATTTTTC C 

Sequence 1457 

CGGGCAGGTACTTCCTGTGGAACTCCCTGGAGATGAAGCCATGTTCGTCCTGGCGCTCTT 

CGTGCTTGCCGTGCACCTCAATCACGTTTCCCAGAACCTTGACTTTGAGTTCCTCTGGAG 

AGMGTGCTTCACCGTCCAGNTTCACAGGAGAACCCTGTNCTTTTCCATACGCATNTTCT 

GAATAGCCCAGTGTTCAATTCCAACTAGGGTGCCCCNC 

Sequence 1458 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGATGTCACCGGCAGCATAGATATCAGGAAGG 

GATGTGTGCATATGATCATCCACTTTCAGGCCACCATCTTCCCCTAGATCAAAACTGTTA 

CCATGGAGAAAAGGTTCTACATTTGGTGTAACTCCTGTANCACTGACAATGAAAGCGCAG 

CCATATATCTTTTCATTGGTCAATTCCACATAGACAGGCCACATCTCTGTATCGGCTGTA 

ACTGACTTATGGTCTCTTGGAAAAGTGAAGGACTTTTrCTTCAAMTTCTAAACTCATC 

TGMGGTAGATTTTCTTTACTTCACACATAGTTTCAAGGTGAATCTTATGAGAAAACTCT 

TTTGTTCCTTTAAGATTCAAGCCTTCATGCCAAATCTTGG 

Sequence 1459 

TCGAGGCCGAGGTCTTGCGGATCTGGCGGACCTGCTGGTGCTGCGCATAGGATGTCTTTC 
TAATCTGGTTGTTACGTTTCTTAGTGAAACCAACACAGAAAAGTCGGAGCAAAATA 
Sequence 1460 

CCGGGCAGGTACATAAAATGGCAAGAGTGCATCCTTTAAAGCTTGCACATGAGAGACTTG 

GAAACAATCTTATTAGAATTGTGAAAATTCTAAGTAGTCAAAAAAMTAAAAAAAAATAA 

AAAAGAAAGTCAAGTACGTGCCAAGCATNCTNGTGCGACCGCGAGAGCCCGGGGAGCGGG 

GGCTTGC 

Sequence 1461 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAATGTAATTCATAATT 

TAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATGAGCCTTAATTTGTTAAACCT^ 

TACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAAT 

AAGATTAAGTAAMCAAATCAATTGTATATATAATTGACCTTTrTGTGGAACATGTAGTT 

TATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCTTTTACTGATTGAACAGTTGTG 

CTACAATCAACTTTTCATAGTACCTCGGCCCGCTCTAGAAACTAGTGGATCCCCCCGGGC 

TGCAGGAATTCCGATATCAAGCTTTATCGATACCGGTCGACCTCGAGG 

Sequence 1462 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCGGATTTTG 
AATTTCGTGATTITG GTTTG GTATAAACGGTAAAAGTGTGTGTGCCCTCTTTACCTGTTC 
TTTGTCTTGTGGTATGTGTGGTGTGAGCGTGGTATTTTGTCTGGAAAAAAAAAGAAAAAA 
AAAAGTA 
Sequence 1463 
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CGGCCGCCCGGGCAGGTACTCAGTGGATGACGAGTGCTTGGTTGAAATTGTTGAAAGGCC 

TGTG TTCTGAAATACCTGGGCCGTGTCCAGTGAGGCCNAGGAGMTTTTTATNGAGCATC 

TTTTGCCAATGAANMGAANATTAMTATTGACCACTTACCTTGATCCCAAACGCTCCTC 

ACTGGAAGANTGGCACCATGNCGTGTTTTTATTNGAAGCTAAAGGCNANTAAAANNGTAA 

AGTAGTGCTCATTCANMNNTATNTTNGGNAAANTTTGTCNCNNANTCCAAANAAACTTT 

ACCANANGCAANATTTNACCTTTCCCAA 

Sequence 1464 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATAGCTGTAAGGAGA 
AGCTGAGAAATGATACCCAGGAGCAGCAGGCTTTACGTCTTCAGCCTAAAACCTAAAAAA 
AAAAAAAAAAAAAAAAAAAATTTAAAAC AG CTATTAAACTGAAAGCATCTGTAN 7TAAAA 
AAAAAAAAAAAAAAAAAAAAANGTNCCTN 
Sequence 1465 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGATCACTTGCACTTTTCCTCATTTCTTAA 

GAAAATCCAGTTTGCTTGCCATGACTTGTTTAGCGTTAGGTTTACAAAGCTTGTTGTTG 

ATGTTGAATATTTGTCCAAAGGAGTTTGACAAAATAGAAGCTGCTTGGTCCCTGTTCTGC 

CTGCTAGGTCATAAGGATGAAGGCCATGTGGCTGGGGTACCT 

Sequence 1466 

CCGCGGTGGCGGCCGAGGTACTTTTCTTGAACTCAGCATTGCTGCCCTGAACCCTCCCCC 

AGCTGCCTTCCCTACTTTGGGAGCCTCTATTTCTAGAACAGGAAAACGCTAGGCTAAATA 

TATGTCCTAATGGTCTCTTCCATCGGTGTGAAGGAAAGAAAATGGGCCCAATGTTGTTGC 

AAGGATTAGGGAAGGTCCTGCCTTGCCACCCAGTGGGTTTAGAAATCCATCTTTACGGCC 

TCCAGAGAAGCTGGTGATCAGGTTCCCAGACATCAGCAATCCTGTTTCCTGAGGAGCTTG 

TGGGGGCTCTCTCTGGGGATGAATCTGGGTCCTAGACAAAGAGCCACAACTGATGGGGGT 

CACTGGGATTGCAGGGGGCCATGGGCTGGC I I I I I GTCCTGG 

Sequence 1467 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACGGCTAATGTTGCTG 

CTAACACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAGAGTAGCAGG 

AGGAAGAGAAGCTTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACTGAGCAGTTC 

CTCCC 

Sequence 1468 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAGGAGTGAGTCCC 

TATGCTGACCCCAATACTTGCAGAGGTGATTCTGGCGGCCCCTTGATAGTTCACAAGAGA 

AGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGATGTCTGCAAAAACCAG 

AAGCGGCAAAAGCAGGTACCCTAGCCATGCTGAGCTGCAGAGGCCACAGGACAGCTCTCC 

CCGCGTACCTGCCCGGN 

Sequence 1469 

CCGCGGTGGCGGCCGAGGTACTGCGCAGCAAAACAATTCTATCTTCTTAATAGTTCTTCT 
ATGTCTTTCCCAGTGTTTACTGAAGAAGCTTCACCCTTGAAACGTCCTTTCTCCACCTCC 
ACCTCTACCTAACAAAGTCCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTA 
Sequence 1470 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAA 
GTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATMTATTTGC 
AGCTGAGAAGTATGTCTTTTTATTCATTTTTAAATTTTCATTTAAGT 
GAGCAAATTAAAATCCTCCCTTAAGGTGAAAAAAAAAAAAACAAACAAAGTCCT 
Sequence 1471 

CCGGGCAGGTACCACCATGGACACCTATAAGGAACTGATTGAGAGGATTCCTGAGTTGAA 
CAAAGTGGCACGAGCAGCTGCAGAGGTCGCTGGTCAGTTCGTGATTAAACTAACCCATGA 
TGTTGAATTGAACCTGGACTATGAGAGGTACTC I ITTTTTTTTTTNTTTTTI IGAAAACT 
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TTTTATTGCTCTTTTGGATAGAMCGGGAATTTATTTGCCAGGAAGGATGATCCCATCAT 

ACTTCTGCTGGAACCAGCGCATGGCCTCCTCTTTGCTGATTCTGTGTTTGGCCCCAATGC 

AGCCTGTCCTGCGCTTCTTGTCTGCGATGCTGAAACCTGGCCTACCCAGCACCACATNNN 

AAGTCCAGGGCCGTTAGATACCAATGCTT 

Sequence 1472 

CGGGCAGGTACTTACAAANTGCCCCATTCTNTTCATCTGAAMTGTCCCCGAGCCTTTTG 

GAAACAGGNGCAGGGCATNCANANTNTTCCCANTTTTTATNGAACTTGANGNACGGTTNC 

AACNANMNNNNCTTMTTTCTACANACATCTTTNTTGGGTNATTTGTTACCGAAGNTCC 

TCCTTATTTGAGACCCCACCTAGACACCGACCTTGNANATTGGAACAAGTAAGCCCCTNT 

NCNTTCNTACAATTGNAAATGGTANCGTGGGAAGAAAGCCTTGGTTNGMTTTTANAACC 

CCCTGANTTNATNGTTGNCCCATGGNACCCAGGGGAACCAA7TGGG7TTTTTTTAANAAA 

AAA 

Sequence 1473 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAACCTCTTGGTCAAACATGGCATG 
CCACCATCTTTTTCAAAGCCCMTATTGAAGGTGCCCCAGGGGCTCCTATTGGAAATACC 
TTCCAGCATGTGCAGTCTTTGCCAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGC 
Sequence 1474 

AGGTGTCGACCCACGCGTCCGCAAATAATGTTAATCCACACACCAAAAGGACGAACCTGA 

GCCCTAATAATTGTATCCCTAATTATATNTAATCGGGCTCAACCAACCTTCTAGGGCTTC 

TTCCCCATACATTTACCCCTACCACTCANCTATCTATAAACCTAAGCATAGCCATCCCCC 

TATGAGCAGGAGCCGTAATTCTAGGCTTCCGACACAMCTAAAAAATTCTTTAGCCCACT 

TCTTACCGCAAGGAACCCCCATCTCACTAATTCCCATACTAATCATCATCGAAACTATTC 

AAGCCTATTTATTCAACCGATAGCACTAGCCA 

Sequence 1475 

AGGTACCGCTTCCCTGAAGAGCTCACCCAGACCTTCATGAGTTGTAACCTCATCAGTGGA 

ATGTTCCAGCGAATGGACAAACTGAGAAAAGAATGCCTTTGCTAGTGTTATCCTGTTTAG 

AAACAACAACAGCAGCTCCATTTCTGGGGTTTGGGTCTTCCGAGGCCAAGATCTTGCCTT 

TCTGCTGAGTCCAGATTGGCAGGTGGACTATTGAGTCATATACATGTCGGAAACTGGGAT 

CCTGGTAGCCGAGGAGACGTAGACCCTCGTTCGAGACCTGCCCGGGCCGGCCCGCCCCCG 

CGGGTGGGAGCTCCAGCTTTTTGTTC 

Sequence 1476 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC ITTTTTTTTTTTTTTTI 
TTTTTTTTTTGGGCAN CTAAAATNTAGGATGGNGN N AAGCTTCTTCATTTTGN CAATTAT 
ATATAAAAAATN ACAAACAACATG AATCN GCATTTCTTGG ATG AG ATAN NTAATAACAAA 
CTATTTCTCAATATTTGTATACTAAAAACTAGTAAAGGTGNTATGTGTTTCAGTATCTTA 
TCTNTTATTTGMCATGGGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTATGTAG 
TAAANGAGGCNCTGCCTTGGCTGGGACTGCTNTAAAAAAATTACCATAGACCATTGACTA 
ANCCACAAACNTATACTTCTCACANNTCTGGAA 
Sequence 1477 

CGGGCAGGTGTACAAGCTAGCTTGAGTCGACTCAAGCTTAGGGATCTAGACCACGAGCGG 

CCGCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnGCGNAATCATGGTCATAnnnnnnnnnnnnnnnnnnnnnnnnNATCCGCTCACAAATTC 

CANACAACATACGAANCCGGAAGCCATAAAGTGTNAAAGCCTGGGGTG 

Sequence 1478 

AGGTACTTCAGGGAAATACCACCTCCACAAACGTTGAGCCAATTCAGGGGCTGCCAGTAG 

AATCCTAAAGAGTGATGTCTCCGGTTCATGTGAACACCATCTGCANTCCACACCCCGCTT 

TNAGGGACAAATTTNANNGCCATTAAGNGNNAAGAACCATTGCTTCAAGCCAGAAGCTNG 

TCCCCCTTTTNATCCCANTCCCCNAGTTTCACTCACCCCCTTTAAGNCCTCTGGAATATG 

GGNGACCNAGTGCCGGAATGGGCCTTCNAATTAAGNCTCNACGGGTAAANGGGGCNTGGC 
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TCCTNNNCCCNTTCAAGNGGCNTTATNCTCCCNAAGATANANCNCCNAAANTTCTATANG 
GGGGGG GCNM TTTMCCGGNCCGNGGCCTTTCNANNTCCCCCAACCMCCNTTTTTTGG 
NCMNCCTTTTCAATTAAATTTTTCCCCCCGGTCTAATTGGGCCTTTTCCCCCGNCCT^ 
TAAAAN AATTGGTTAAN CC CCTT 
Sequence 1479 

AGGTAGTTAATNTTCGGATGGTTCTGAAGCTTGGCTTGAGACCCCCCTCTCCAACACACA 
CACACACACACACACACACACACACACACACACACACACGCACACATTATCCAAGCGGAC 
GCGTGGGTCGACTCAAGCACCTGCCCGG 
Sequence 1480 

CCGCGGTGGCGGCCGAGGTACTGTGGAGCTGTTGGGACTGGGCAGGCGGCAAAGATCTGC 

AACAACATGCTGTTAGCTATTAGTATGATTGGAACTGCTGAAGCTATGAATCTtGGAATC 

AGGTTAGGGCTTGACCCAAAACTACTGGCTAAAATCCTAAATATGAGCTCAGGACGGTGT 

TGGTCAAGTGACACTTATAATCCTGTACCTGCCCGGNATNTTAATGAACCGNTGCATGCA 

AATCTTNTTTACTTCATNTCCTGTCAGGGCATACTTAANTNTGTTCCTCAGGAAAATGAT 

GAGGGGGAGACACTNTCTNAACTTGNGGGGACCNGGTGGATGGACG 

Sequence 1481 

CCGCGGTGGCGGCCGGGATCGCGGAAAGTGATGGCTGTCGTCCCGGCGTCTCTCTCAGGA 

CAGGACGTGGGATCATTTGCATATCTTACAATTAAAGACAGAATACCACAGATCTTAACT 

MGGTTATTGATACATTGCATCGACATAAAAGTGAATTTTTTGAGAAACACGGAGAGGAA 

GGCGTGGAAGCTGAAAAGAAAGCTATCTCTCTCCTTTCTAAATTACGGAATGAATTGCAA 

ACAGATAAACCATTTATCCCCTTGGTTGAGAAATTTGTTGATACTGATATATGGAATCAG 

TACCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCAA 

GCCTTATCGGATACCCGTCGACCTTCGAGGGGGGGGCCCCGGGTACCCAGCT 

Sequence 1482 

TACCAGTGGCCCTGGTGGATGCACCATAAATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACTGTTAACAACCGGTATTAGAAATTGAGATGTTTTTATTTTGACTATTTGAG^ 

GTAACATTGTAGATMTGTTCCAAACCATCATCAAAAATGCAGAGCAAACTTTTCTGTAT 

ATAAACTGTACCTGCCCG 

Sequence 1483 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCATNCCTTGTTTTCTGC 

TCATTTCTTNCAGCCTGAGGAACAGACGTCAGAAAAAAAAACAACCACCCAGACAACTTC 

CCCTTTTAGCTGAGCTGTCTTCTGACCAATCAACAGGGACCAGCCAAAAAAAAAAAAAAA 

AAAAAAAAGTNCCTGCCCG 

Sequence 1484 

AGGTACGCAGGCGGCAGCCTCACTCTGAATGGACCAAGGCGCCAGCTCCTTGTCCAAAGC 

CAAAACCTTTGGGCCCAAAGTTTTTGGCATAGCATCCTTTGCAATAGATCTCACCATCCT 

TGTCTGCCAGGGTGGTCGACTCGAGGCCTTTGCCACATTTGGCACATCGGAAGCAGGACT 

TATGCCAGGACTTCCCGGCACNCGATCACCTTCTCCGCTGCATAGACTGCCTGGCTTGCA 

TCGCGGGACAGCGCTCAGAGCCGCCCGATCTTCTGAAGCAAACTTGGATGCCATTGGGGT 

TGGGTGGGTAGCGGGACGCGTTGGGTCCG 

Sequence 1485 

CCGCGGTGGCGGCCGAGGTACGCCTTCGCAGTTTCCTGCCTGATCCTTCCTCCAGCAAGT 
AGGTCACATCAATGGCATTTTTTCCAGGGATTTTTTCAAGGAGATTGAGCAAGATCCGGT 
TGAGTCGTTCCCGTTGGTTATGCCGTCGTTCCTCAGGGGTACCTGCCCG 
Sequence 1486 

CGGGCAGGTCGACTCCTATAGGGTCTAGATCACGAGCGGCCGGCCGCACTTTTTTTTTTT 
I I H M I I I I I I II M I M I t I I I I I I U I I I I M 1 AAAN AC ACACACACACACACACAC 
ACACACNCNCACACACACACACACACACACAC 
Sequence 1487 
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CCGCGGTGGCGGCCGAGGTACTGGGAAAAGGCCACAGGGAGAAGGAAGGCTCCAGCTGAA 

CTTTATAGCAA7TTGAACTCAACCAGAAGCCTCCTGACCAGAACTCGATGGGAGGGGTGA 

AGCTGGCTTGCAGATTCCACAGGTGGGGAMGAAGGAAGGCTTTATTTGCTTTGCAGCTG 

GGAGGTGGGTACCTGCCCG 

Sequence 1488 

AGGTACAGTGGTTAAATAATTGACTCAATGACAAAAGTGGGATTGGAGTCCCAACAGTGT 
GGACATGCTTGCTTTGGATTGAATGTTTGTGTCTTGCCCCATATCCCAATGTTACAGCCC 
TAACCACCAATGTGATGGTATTGAGATTAGGGCCTTGAGTACCTGCCCG 
Sequence 1489 

AGGTACGGGGGCCAGTTATTATACTGCTGACAGTAATAMCTGCAAAATCTTTAAAATTG 

AAGGCTTGTTGMTGGTGAAGMGTGAMCTTTGTCCCAGAACCCACTGCCCACTTGAAA 

CCCTGGCTTGGGATACCAGTGGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAAG 

CCTGGCCAGGTTTTCTGCTGGTACCTTGCCCGGGCGGCCGCTnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnGCTTATCGATACCCGTCGACCTCGNAGGGGGGG 

GCCCCGGTTACCCAGCTTTTTTGTTCCCTTTTAGTGGAGGGTTAATTGGCGCCGCTTGGG 

CGTMTCATGGTCATAANCTGTTTCCTTGTGGTGAAAATTGTTTATCCGCCTCACAAAAT 

TTTCCACACCAAACATACNA 

Sequence 1490 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTGTTCCACCAAG 
TTATCCAAAACACCAGTGAGGAAATCTTTGCCCAGGGATTCCAACACCTTAAAGTGGGCT 
TTTTTTTGTGGGTTGGCCTTCTGCCATAGGGAACAGCCTCCGTCCI UN IACAGCGTTG 
GAAAACCCCGCGTACCTGCCCGGGCGGCCGCTCGAATCA 
Sequence 1491 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAATAAGCATATTGC 

TTTGGCAATGCATCTCCAGAGCAGGTGACCCTGGCCGTCTGTCCTGGGGACACTGACACC 

GAGGGTGGCTGTGCCAGTTCATAGGAGGCCTCAGAGCCTGTGCAGAGAGTGAGGAGGGGG 

AGAAGTAGAGGGATCCAGGCCATGGTGAGACACCCAGAGTTCTGCCTCCTGGACCCACCC 

CCGCGTACGTGCCAAGCATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 

Sequence 1492 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGCTGGAGACGGCCTGG 

GTGCTGGCGAAGCGGAGGCCGGANTAAGAAAGACTGTTAGAAATGCCCCTCGGGTAACAC 

AGAAGGCTTGAGAGATCCTGACATAAATCCTTGTTTGTCGGAATCTGATGCTTCCACCAG 

ATGTCTGGATGAAAATAACTATGACAGGGAAAGGTGTTCCACTTACTTCTTGAGGTACCT 

GCCCG 

Sequence 1493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCTTCACTTGGG 

CCACCCACCGTGATTTCTGTCCTGGATTCCCTGAGCCAAATACCCATGATGGGATGGTTC 

AGTGATGTGGATGGATGGCTATTTGTGTAAGGGCCGAAAACTAGTTTTCTAAGATGGGAG 

AGCTGGGCAGTTACTTATACTCTGAAGCCCTTCTCTTTCTCTGCTGGGGCTTTCCCAGCA 

GTTCCTAATAGTTCCTTGTATTTCAGATAAGTCATCATGATTGTTAGGAAAAAAAGAAAA 

AAGAAAGTTAGTTTTTAGCCTCCTGACTC 

AMTAATAGCTCATTATTTTTATGGTAGCTATTTGATTATm/^ 

GAAGAAATTCAGAGAAGTTCAATAGAGAGTAAT 

Sequence 1494 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGTGTAAG 

AAGGCCATCTACAATCAAGATTGAGAGTGGCTCTAACAAGTGCCATTTTTCCTTGTTAGC 

TTTCATTTCTCAGCCCTTTACMGATTAAAATAGTCTGCAGTTTAATCTCTCCAAAGCTT 

TACGGACAGTGATTCTGTCCTAAACAAGACAGTGACTCCAGGATTTCTGAAGACTATTGT 

GGAAGAAGCATCCATTAAGGCCAAGCTATAACATCAGAAATGGTGAATGAA 
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Sequence 1495 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAGGAAGTGAAGTCC 

CTATGCTGACCCCAATACTTGCAGAAGGTGATTCTGGCGGCCCCTTGATAGTTCACAAGA 

GAAGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGATGTCTGCAAAAACC 

AGAAGCGGCAAAAGCAGGTACCTGCCCG 

Sequence 1496 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACTGGCCTCAT 

CCTATATTCACTTTCGGCCCTGGGACCAAAGTGGATATCAAACGAACTGTGGCTGCACCA 

TCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTG 

TGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 

Sequence 1497 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGGAGCTTCTCCTTGCCAGT 
TTCTCCCAGCANGGACCCTCTTCTTGTTTTGAMGATGGTCGGCTGCTTTTGGTAGGCAC 

GCTCAGTCTGAATGTCCGCCATCTTCCCGGCCGCTCTAGAACTAGTGGGATCCCCCCGGG 

CTGCAGGAAT 

Sequence 1498 

ACTACTTAGGGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACCTGATTGCATGCAC 

TTAAATGCAGATTATTTTGGAGTTTGAAMGGGACTATTAATGAMTCTT^ 

CCTTTCTCi I I I 1CCCTTCCCCGCCACTGATTCAGTGAGCTGGAGATTGGATCACAGGTA 

TAATTCAAGC I I I I ICATGTAGTCATGTAGATCACTAGACTCCTTGGTGTACCTGCCCG 

Sequence 1499 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCATT 
ACTGCAGGAAAAGGTCCCGGAGAGCTAAGCAGTCAAGATGTGTGACTTCACCGAAGACCA 
GACCGCAGAGTTCAAGGAGGCCTTCCAGCTGTTTGACCGAACAGGTGATGGCAAGATCCT 
GTACGTGCCAAAGCATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 
Sequence 1500 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATATCAACCAGAGCA 

ACATACATGAATAAGCCAGCAGTAAGTGCAMTATCCACATAGAAACATTTTCAGCATAA 

TGACCAATGAAAATTCCTGTTGCCATTCCAAGATACGCCAGCATGGCTGACAATGCATTA 

TAAAGGACAGCCTGCTTAACGGTCATGCCAGCCTTTAGTAGAACAGCAAAGTCACCTAAT 

TCATGAGGCAACTCATGACAGAACACAGCAACAGAAGTACCTGCCCGGGCGGCCGCCCGG 

G CAGGTACAAGGGATTG ATTTAAAC C AAATTCGAGGACTTGG G TTTGATG CC AC C GTGTT 

CTCTGGGTTGTTTTGGATAAAGCAGTTTCACCCATTACCAGTCAACCAGGAAGGGGGATT 

CCCATCGAAACGTCATCATGTGGCTGGACCATCGA 

Sequence 1501 

CCGCCCGGGCAGGTACCAACAGAAACCTGGCCAGGGCTCCCAAGGCTTCTTCMTCTTAT 

GATGCATCCCGACANGGGCCCACTTGGCATCCCCAGCCCAGGTTCAGTGGGCAGTGGGGT 

CTGGGACAGACTTCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnn n nAGCAGCGTAGC AAACTCGTACACTTTTG.GCC AGGGGACCAAGCTGGGAGATCAA 

AACGAACCTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGGCAGTTGAA 

ATCTGGAATTGCCCTCTGTTGTGTGCCTGCTTGAATTAACTTTCTTATCCCAGAAGAGGC 

CCAAAAGGTACCCTTCGGCCCGCTTCTAGGAACTAGGTGGGATTCCCCCCGGGCTTGCAG 

GGAATTTCGATAATCAAGGCTTATTCGNATACCCGTCGACCCTTCGAGGGGGGGGGCC 

Sequence 1502 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCATTCGTATTGCGCCGCTA 
GAGGTGAAATTCTTGGACCGGCGCAAGACGGACCGGAGCGAAGAAAAAAAAAAAAAAAAA 
GTACCT 
Sequence 1503 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTCACTTCCGGCTTCCTTCAGTCCGCTGGTCCC 
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GAGCACGAGCTGTGAGGGGATTCACTTGTGTGCGGAACTCCTCGGAACCATGGCGTCCCT 

TTCCCTTGCACCTGTTAACATCTTTAAGGCAGGAGCTGATGAAGAGAGAGCAGAGACAGC 

TCGTCTGACTTTCTTTTTATTGGGTGCCATCGCCATTGGAGACTTGGTAAAGAAGCACCT 

TGGGACCCAAAGGCATGGACAAAATTCTTCTAAGCAGTGGACGAGATGCCTCTCTTATGG 

TATCGAGCGGCCGCCCGGGCAGGTACCCTTAAACTGGCAGGACATTTTTGAAATCACAAA 

TTTGCACATAAAGAATGTCACGAACAGCCATGTATCCATATACAGCAATCAAATAAGGAA 

CTTATGACCTAAAGCAAAGGTAAACTTTCTTGAAACT 

Sequence 1504 

NAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGTGGTGATGCAGTCAAA 
GCCCAGGACAGTCGTATAAAGGAAAGCAAGACCCATGCCAGCCAGAAACACAGGCTGGTT 
GTAGTAGGAGACCCATCCATCTCGGAAGGTACCTGCCCG 
Sequence 1505 

AGTCGGGGAAACCTTGTTCTTGCCCAGCTGCATTAATGAAATCGGCCCAACCGCGCCGGG 

GAAGAAGGCGGTTTGCGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCAGAATCAGGGGATAACCGCAGGAAAAGAACATGTGAGCAAAAAGGCCANCNAAAA 

AGGCCAGGGAACCGTAAAAAGGCCGCGTTTGCTTGG 

Sequence 1506 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGG 
CCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnTCTGCAGTTTTACTACTGTCAGCAGTATAATAACTGGCCTCCGTACCTGCCCG 
Sequence 1507 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCAGGACAA 

GGATGGCGCTCCAGTGGCCACCAATGCCTTCCATAGTCCCCGCTGGGGTGGCATTATGGT 

ATATAATGTTGACTCCAAAACCTATAATGCCTCAGTGCTGCCAGTGAGAGTCGAGGTGGA 

CATGGTGCGAGCGATGGAGGTGTTCCTGGCACANGTTGCGGTTGCTCTTTGGGATTGCTC 

AAGCCCCAGCTGCCTCCAAAATGCCTGCTTTCAGGGGCCTACGAAGTTGAAGGGGCTAAT 

GACCTGGGAGCTAGACCCGGCTGCTCTGGGCTCGGTCAGTGGGAGAACCTGGCCACAGCC 

ACCACCACCCTTACCTCCCTGGCGCAGGCTTCCGGGCAAGATCAAGCAACATTTGTCATT 

AAGGACGACGTGGCATCTGAGGTGTACCTCGGCCGCTCTAGAACTAGT 

Sequence 1508 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCCGAGATGGATGGG 

TCTCCTACTACAACCAGCCTGTGTTTCTGGCTGGCATGGGTCTTGCTTTCCTTTATATGA 

CTGTCCTGGGCCTTGACTGCATCACCACAGGGTACATCTTTCAATTCTGGAATATCTGTG 

TCTMGCACAATATCTTCACACTGTGCTGTATTGCTGCTGAACTAAATGCACTTTTCCCC 

ACATATGGGGCACTGGCTTCAAACAATTCAGTTCAGTATCATTACTTTTAATCTCATCTT 

TCCTTTCTTGGTAGTTGTTAATACAGTTATGGAAAAGAGGCACATTGCATAGAAGCCATT 

GGGGAGTTCAGTGGAAGTTCTGTAAGAATGTGCATGTCCTGCCCG 

Sequence 1509 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGGCCAG 

GCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTA 

Sequence 1510 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATGGGGGTCTCACTACAT 

TGCCCAAGCTGTGCCAGAATTCCTGGATGCAAGCAATCCTTCTGCTTTGGCCTCCCAAAG 

TGCTGGGATTACAAGCATGAGCCACCATGCCCAGACAAAGCCCTCAATTTCTTACATATC 

GATTTAAGGGCCTGAATAAACCAACACATTAAAAAGGAAAAAGAATAATTCACATACCAA 

GCCGTGAGTAAATTCAAGACTGAGATGATATAATCGGTACCTGCCCG 

Sequence 1511 
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GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTTGTTTG 1 rTTTTTTTTTI 
TCACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATCGTCACTTAAATGAAAATTTAAA 
AATGAATAAAMGACATACTTCTCAGCTGCAGATATTATGGAGAATTGGGGCACCCACAG 
GAATGAAGAGAGAAAGCAGCTCCCTACTTCAAAACCATTTTGGGTACCTGCCCG 
Sequence 1512 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCTCTTTCACTATGA 
AAGTGTGCATGAGTCCCAAGCATCGGATGTCAAACAAGAAGCAAATCACCTGCAAGGATC 
GGATGGGCAGCANTCTGTCMGGAAAGCAAAAAAAAAAAAAAAAAAAAAAAAAGTTCCT 
Sequence 1513 

AATTGGAGCTCCCCGCGGTGGCGGTCGAGGTACATCATTTCCAGAGCAGGCACTGGCAGC 

GAGATAGGGTTGGAGGAGAAGTAGCGCCGGGACTTCCGGATGGCAAACTTCTCTGTGGGT 

AGAGATTTCCCAGCAATCTTGAGCTTCAGGCCTGGCACAGCTCGAAATAATTCCACTTCG 

TCGTCCCCGAACGGCTTGTGGTCCTCCTTCCCAAACATGCTGAGGTAGGCGGCCTTCATG 

TAMTGTAGGTGGCCTTTTTAAGTCAGATCATGTCAGTTCCTTCTGGAAATCTGGTTATA 

TTCCATCACACTCAGGAGACATCTCCTACAATTTCCTTGACACCTGCAGCACTCCAGCCA 

CACGACGGCCCCCGCGTACCTGCCCG 

Sequence 1514 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCGGGGGATTCTTNAGAG 

GTCAGCCAAGTCGGTAACATTCTGTGAAGCAGATTAAATACAGTAAAATCAAACAAACGG 

AGGTCTTTGGAAGTGATGCTGGGCCCANGCAGTGAAAATCAAGAAAAGAGGAGAAAGACC 

AAGACCTGNATGTCCTGGACGACCTGACCCAAAAANGGGGCACAAAAGCAGCAGAGCTTC 

ACCAGGTGACTGGGGTGGGCTTCTTTTGTCCTCATTTAAATGTACCTTGCCCGGGCGGCC 

CGCTCTAAGAACTAAGTG 

Sequence 1515 

CCGCGGTGGCGGCCGNCGNACTCATTTACCCGGGGACAGGGAGAGGCTCTTCTGCNTGTA 

GTGGNTGTGCACAGCCTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCT 

GCTNTTGTNCACGGTGAGCTTGCTATAGAGGAAGAAGGANCCGTNGGAGTCCAGCACGGG 

AGGCCNTGGTTCTTTCCGCGTACCTCGGCCGCTCTAGGAACTANTGGATCCCCCGGGCTG 

CACGGAATTNGAATATCAAACCTTATCGATACCTTNCGACCTCCNNN 

Sequence 1516 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCAGGTACAGCCCACANANG 
GCTGA TNGGCTTTTGNTGNTTNTTCTNGGAANATANTGATGAGTTCCTCNAAACNCNTTA 
ATTTTACCTCTTCCGGGACATCATCCTT 
Sequence 1517 

CCGCGGTGGCGGCCGAGGTACACTNTTNGTCAGGGGACCAAGCTGGAGATCAAACNAACT 

GNGGCTGCACCATCTGNCTTNATCTTCCCGCCATCTNANGAGCAGTTGAAANCTGNAACT 

GCCTCTGTTGTGTGCCTGCTGAATAACTATCTATCCCAAAAAGGCCNAAGTACCTGCCCG 

GGCGGGCGGCTCTATANCTANTGGATC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGTACGCGGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAAC 
CATGGAAGCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGG 
AGAAATAGTGATGACG 
Sequence 1519 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTCTGGAG 

AAAATAAGTGTGTTAGATAAGACTTGTTCATGCATGAGTGATAGTGCTGTTAGTGTGAAT 

TCAATGTTAATGMTCAATAAAATATATGAAATAAAGTGTCTTTAAACAGTMCACAAGG 

TTGTATGTTCTCAGTTGACAAAAACATTGTGACCAGAGGCTCACAGGAGTCTTAAGTCTG 

CATTTCCCCAGGAGCAATGGCTTAGCATTTACTAATCGAGTATTCCTGGCAACTTTATAG 

AACATAACTACTGTAAATAATGAGAATCAACTGGATATTATAAATCCCCAAATATATATT 
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TTGCTTTACATMTTGAGTGTTTTTCAAAGGCATATAGAAAAGAAAAAATAGGCTTTTAT 

ATGTACCTCGGC 

Sequence 1520 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAG 

AGCCACCCGGGCTGGGGACCGGGGAGCGGAGGCTGAAATCCCCAGACGCCGGTTTTCTGG 

GCTGGGCTTTCTGCCTTACTCACTCCTTCTCCCTCTTTCAGGATTTTACCGACCGCTCGC 

GAAACAGCTTTCACACAATGGAGCTTCATGTCCTCGTGCAGGAAGTACCT 

Sequence 1521 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAGAAATTC 

CTAGACTACAAGATGTCATCCTTCTAGAAGAAGACCTAAAGCATACACTGGGAGGAAAGT 

TTTACAGGGGAATCTAGACTACTTMTTTTTTCTTCACAGTMCAAATGACAAAGC^ 

ATATCTAMGGTTCCCAACACACAGGATAAACAATAAGGAAGTTCTCCCAGTGTTTCCCA 

TTTTAGGCAAAAAAAAAAAAAAGGATTTCAGCTAGCACAGTGCATCCAAATGACCAAAAA 

TACTACMTGCCGATGTTTTTTAAAAATAAATATAAAAGATT^ 

TGGAATAATAAAATTAATGGTGAATGGGTACCTN 

Sequence 1522 

GCCGGGGAGAGGCGGTTTTGCGGTATTGGGCCGCTCTTTCCGCTTCCTTCGCTCACTTGA 

CTTNGCTGCGCTCGGTCGTTCGGGCTTGCNGGCGAAGCGGGT 

Sequence 1523 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGGGGGCCAGTTATTATA 
CTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGC 
ACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 
Sequence 1 524 

CCGCGGTGGGCGGCCCGAGGTACGCGGGGATCCAGAATACATTTCCAACAAGAAGCACTG 

GCCAAGTCAGCTTCTTCTGAGAGAGTCTCTAGAAGACATGATGCTACACTCAGCTTTGGG 

TCCCTGCCTCTTACTCGTCACGGTTTCTTCAACCTTGCCATTGCAATAAAAAAGGAAAAG 

AGGCCTCCTCAGACACTCTCAAGAGGATGGGGAGATGACATCACTTGGGTACCTGCCCGG 

GCGGCCGCTCGACCGCCTGGACCAACTTGTCGGAAACCACCTTTCTGATGCCGCCCACGG 

CAGCG 

Sequence 1525 

AGGTACGGGGGCCAGTTATTATACTGCTGACAGTAATAAACTGCAAAATCTTCAGACTGC 

AAGGCTGCTGATGGTGAGAGTGAACTCTGTCCAAACCCACTGCCACTGAACCTGGCTGGA 

TACCAGTGCCCTGGTGGATCACCATAATAAGGAGCCTGGAANCCTGGCCAGGTTTCTGCT 

GGTNCCTGCCCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnCGACCTCGAGGGGGGCCCNGTACCCANCTTTTGTTCCTTTA 

AGTGANGGGTTAATTGCGCNCTGGCGTAA 

Sequence 1526 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCACCATGTCCGCCTCGGCTGTCTTCA 

TTCTGGACGTTAAGGGGCAAGCCATTGATCAGCCCGCAACTACAAGGGCGATGTGGCCAT 

GAGCAAGATTGAGCACTTTCATGCCTTTGCTGGTACCTnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnAGCTTATCGATACCGnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCCTGTGTGAAATTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATTGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACCTGCGTNGCCAACTGCATTNAATGAAT 

CGGCCAACGCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNTTCA 

nnnnnnnnnnnnnnnnnnnnnnnnnCGGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGATAACGCANGAAAAGAACATGTGAACC 
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AAAAAGGCCAGCAAAAAGGGCCAGGAACCCGTA 
Sequence 1527 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCCGAGGTCAATATAGGGCAGACAGTTTGCC 

TTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACCAT 

TTTCCTGCAGGATGTTNATAAATAGTTCCTGGCAAAGGAGGTTGCGTATAGGGTCCTACA 
AG 

Sequence 1528 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTT 
CTCTC TTCA TTCCTG TGGGTGCCCCAATTCTCCATAATATTTGCAGCTGAGAAGTATGTC 
MM IATTCA1 I I I I AAATTTTCATTTAAGTGACMTC AG TTGGG AC AAATTAAATCCTC 
CCTTAAGGTGAAAAAAAAAAACAAACAAAAAAAGTACCT 
Sequence 1529 

TATA NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCTTN iTTTTTTTTTTTTTI 
TTTTGATTTGCAGTTAGTTTATTGAAATGGAAATGA^ 

AGCAGTTTCCACATAAAAAGACMCACAAGAAAGAAMGTGGTTTCTAGCAATCAACAGA 

TGAGTTCTGNCATAGTCTGTAGCCCTCTGAGCCAAACGCCTTGGGGTAAAGTTAATTACT 

CTTCATATTACACCAAATGCTGTCGTACCTGCCCGGGCGGCCGTCTAGAACTANGTGGAT 

nnnnnnnnhnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGACCTCGAGGGGGGGCCC 

GGTCCCAGCTTTTGTTCCCTTTAG 

Sequence 1530 

CTACTATAGGGCGAATTGGAGCTCACCGCGGTGGGCGGCCCGACATCGCCGCCGAGGGAT 

CGTTGGCAAGCCCGGGTGGGAGCGTGGCTGTGTTCTCGTCCCTGGCTGACAACGAAGGGG 

AGCTAGTCACTTTTCCTGCCACGACGACTCGCGTCCGCCTCTCGCCTGGAGTACCTCGGC 
GCTCTAGAACTAGT 

Sequence 1531 

CCGCGGTGGC GGCCGC CCGGGCAGGTACATAACAAACATGGCGAAAAAGGAGATGTTTGA 

AACCATCTGCA! Mill ICTGTGATCGGTCTTTAAGCTCACTGTAAATTGGCAGGACTGA 

CGGGTGGCA AACA AATGCAAATGCAATGGTGGGTAMGCATACACCGGTCTTTGAATTGA 

AGGTAAC ATA \ \ l iGGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnTAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnTCACAATTCCACACAAACATACGAGCCGGGAGCATA 

AAAGTGTAAAAGCCTGGGGGT 

Sequence 1532 

CTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCC^^ 

TTTTTTGAGATGGAGTCTTGTTTTGTCGCCCAGGCTCTGGAGTGCAGTGGCGCTCACTGC 

AACCTCTGCCTCCCGGGTTCAGGTTATTCCTCTGCCTCGGCCACCCTAGTAGCTGGGATT 

ATAGGCTTGAGCCACTTGCGCCCGGCCCAATTTTTATTTTTTAATGGTTGTT^ 

AATGTATCCCTCTAAAATTCATATATTGAATCTCTAATCCCCAATGTGATGACACTAGGA 

AGTAAAGCCCTTGGGAGATAATTAGGTTTANATGAGGACATGAGGATGGGATCCTCATGG 

TGAAATTAGTACCTGCCCGGGCGGNCGCTCTANAACTAGTGGGATCCCCCGGGCTGCAGG 

GAATTCNATATCAANGCTTTATCGATACNCGTTCGACCTTCAAGGGGGGG 

Sequence 1533 

CCGGGCA GGTA C I I nTTTTTTTTTi IT! Ml 1 1 IN I CTTCTCATACTGNGAATNGTTC 
TTGACTCCTTTTNTTGACATTNAGTTTTCANAATTT^ 

GCTGTTCTCTTGACTGCCTGNTGGGCCANCATCCGATTGCCANGNCCANAAACGTAAANN 

CTTGNCCAANATGGCCAAGNTACCTNGGC 

Sequence 1534 

GAGCTCCACCGCGGNGGCGGNTCGTGGCACTCATTTACCCGGAGGCAGGGAGAGGCTCTT 
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CTGCGTGTAAGTGGTTGTGCAGAGCCTCATGCATCACGGAGCATGAGAAGACGTTCCCCT 

GCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTGTAAAGGAAGAANGANCCGTNNNGGA 

NTTNCAANCACCGGGGAAGGGCGTTGGGTCTTGTTAGATTGAATNCTCCCGGCGTGCCNC 

ATTTGGCTTCTTCCCCACCTCCCACCGTGCGNATGGTCCTCNTGGGGAATAAGAAAGNNC 

CTTNTGGACNCAAGAGCCAAGGGTCAANGGCCNTGMCCCCATGGGCTTTTCTTTTGGGT 

TCCAAGTCCTTCAANTCCNCNNGGGGGGAATNGNGGGNGNGGCTAGGGNGGNTTGGTTAC 

CTCCTTCCGNGTCTCCGGCCTTTNTTNAGGGAAAACCTTANGGTNTGGGGGGNATTCCNC 

CCNNCCCCGGGNGNACTTTGGNCCAGTGGGAANAATTTCCCNAATTATTTNCAAAAAGGC 

CTTTTAATTCCGAAATANCCCCGGTTCGGNAANNCNTTTTGTAAGNGGGGGGGG 

Sequence 1535 

TCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGGTTGGGGAG 

AACCCGTTTTGA1TACAAGCAGATAATTATCTCAGTGAGATGGGGGTTAGTTCAAGGAAG 

TAAGGAGGGGGGAGGATGTGAGGAAGTTAGAACAACCCAATGCTTATTTGATGGGCTGAA 

TAAACTATTCCAGGACTGAACTATTTTTGAGCACTGTGAGGTGGCACAGTAATTACCTGC 

TTCAAAATCAACTGATACCAACATTTTTATCTTTGTATCTTATCTCTGTACCTGCCCG 

Sequence 1536 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAAGTT 

GATGCCGAGCGCAGATCGCCTGCAGCTTGCTAGCTGTGTGGGCTGGGAGGTCTGGTAGGG 

CTGAGCTTGCAAGAGGATCMCATGCCTTTGGCTAGAGATTTACTACATCCGTCCTTGGA 

AGAGGAAAAGAAAAAACATAAAAAAGAAACGCCTAGTACCTGCCCG 

Sequence 1537 

ATCTTTTMCTAMNGTAATNGTCACCATAATCTTATAGACAAAGCATTGAGGTTTATTG 

AACTAATGCTGAAGGTAGTAAGTGGAGGAGCCAGGATGAGGTCAGAATCTGAGATTNTAA 

CCATGCCTATGCTGTCACTTCTTACACTNTAAAATACCTCCATGCTCNTGTGGACACCTA 

NGAACAANTGAATATTTCTATTCTTGTCCCAGAATNTCAAAACATTAAACATGTTANAGT 

GTATTTTNGNTTACCATAANCCTTTCCAGGAGGAACAAGCACTAAACACAGTCTCTGGCT 

TAGGATTTGGATGAACATATTCAAAAGCCATCTGCTTCCCAGCAATCATAATCATACCCT 

TTGCTTTTGGCCACTATCACCAAGATCTCCANTAGTACCTCGGCCGCTCTACAACTAGTG 

GATCC 

Sequence 1538 

CCGCGGTGGCGGCCCGAGGTACCTATGATAATCATGATGGAGATCTGGGGAGGGGAGAAC 

GTGGAAATGTCCTTCCGGGTGTGGCAGTGTGGGGGCCAGCTGGAGATCATCCCCTGCTCT 

GTCGTAGGCCACGTGTTCCGGACCAAGAGCCCCCACACCTTCCCCAAGGGCACTAGTGTC 

ATTGCTCGCAATCAANTGCGCCTGGCAAAGGTCTGGATGGACAGCTACAAGAAGATTTTC 

TATAGGANAAATCTGCAGGCAGCAAAGATGGCCCAAGAGAAATCCTTCGGTGACATTTAN 

GAACGACTGCANCTGAGGGAACAACTGCACTGTCACAACTTTTCCTGGTACCTGCCCG 

Sequence 1539 

CGAGGTACGCGGGGCAGGTGATGTTTGTTTTCACGATGGTCTTCAGATGCCCACGTGGGC 

ACTGCTGAGAAAGCCACTTGGTAAAACTGATGCCGGAAATGGGGCTTTTTGGGATCCCTG 

CTCAGCTGCTTCTGAGTCCCAGCATGCCCTGGTTACCTATGGCCCTCTTTCCCATGGGAC 

CTGACCTATGATCGGCCCCCGCTCCTAGGCGTGGTGACCAAGATGAAGATGCAGAGGACC 

ATTGTCATCCGCCGAGACTATCTGCACTACATCCGCAAGTACCTGCCCG 

Sequence 1540 

GGCCGCCCGNGCAGGTACAGCATCNCTGGTGGTTTCAAAGAACGTAGTCATTCTTCTCAC 

TGCAACANTGTANGATAAGCANGGNAGATCTGTTATTTCCAAATTAAAGGTGATTAANAT 

ATATGGAGAGAGANCATGGCATGTGAGGTTTATAGGGCTAGANACTGNACANCCATGTAC 

AACCCANATT 

Sequence 1541 

TTAACTGGACAGGTCAATTATCTGAGCTGTGTAAATCATTGTCTGGCACTACAGTTGAAT 



Page 210 of 635 



WO 01/51628 



Table I 



PCT/US01/00798 



CACCTTGACACTGAGCTCAGGAGCAGCCCATTTAAGAAGCATCTTGTAAATAACAAAGTG 

ACACCTTCGAGATTACAGTTGCAGCTATGCAAGTCATTCACAGAGTAGCTGAGGGCTCTC 

TTCTGAAGGAGGACCCATTACATGAAGGTAGACCTTTCATGTCTTGTTGCCGACGTCCAA 

GTCTCTTGCGGTCCAGAAATGCCAGGATGGTTGTTTCAAATGCATTGGGTATGAACTGCA 

CCTTGAATCTGCCCTTCCCTTTTTCGATTAMCCTTTTCACATAGCTGCTGCTGCTCCT 

Sequence 1542 

CCGGGCAGGTACGCGGGGGAGACCAAGGGCTAAAGCTGGGAGGTGAGTCTGTCACCTTGA 
GCCGGGCGAGCGCTGTGGGCCAAGCAGGGGTTGCAGGGCAGTAGGAGTGCAGACTGAAAA 
AATGCAGACCGCCGGGGCATTATTCATTTCTCCAGCTCTGATCCGCTGTTGTACCT 
Sequence 1543 

CCGCGGTGGCGGCCGAGGTACAGAAGGGCCATGCTGTTATTACTCTTACACAAGGAGGCA 

GCCCTCGAGCCACAGGGTCCAGCTGTTGGCTATAATAGCCTACCGGTCTCTGATGATCAC 

CATGTTTCTGGAATTCMGCCAGGAAGAAGCAGCAATCTGTCTTCTGGATTAAAACTGAA 

GATCAACCTAC7TTCAACTTACTAAGAAAGGGGATCATGGACATTGAAGCATATCTTGAA 

AGAATTGGCTATAAGAAGTCTAGGMCAAATTGGACTTGGAAACATTAACTGACATTCTT 

CAACACCAGATCCGAGCTGTTCCCTTTGAGAACCTTAACATCCATTGTGGGGATGCCATG 

GACTTAGGCTTAGAGGCCATTTTTGATCAAGTTGTGAGAAGAAATCGGGGTGGATGGTGT 

CTCCAGGTCAAATCATCTTTCGGTACCTGCCC 

Sequence 1 544 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTC 

AGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGTGGCTCACG 

CCTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAG 

GCCTGCAGGTACCAGCTGCCATCTGTCAGTTAAACTTCAGAACACACATACCATAGGGAT 

GAAGATAAGAAAGGAATGATTGTGACTCTGGATCAAAATGCTNCCAGTTCCTGAAGGTGC 

ATG ATTTAAC TGTG A 

Sequence 1545 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTAACACATTATTA 

TGTTTATTAACTTTGAGACCCAGAAATAAATTCTTTTCTTTTCTTGA 

AAATACAAAAAAAAAAATGTTTTGTTTTGTG 

MM lAATTGTCAGGATTATGATCTTGCTGTTTTTCTTCAATATGTATACAAGGTGATGT 

GAAAAGATGACTTTGGGCAGAGGAGTAAGAACAAAGTAGGCTTGNTTCTTCTACTTTGCT 

TCAGAACTCAGTTTMTGCCCAAAAGCCGGGAGAATCAAAGCCCATGTTTGACCGTTCTC 

GTTGCTNCACCTGCATTTTTCCCAGGAGAGTGTGACNACTCATGCNAGTCCCCTGAGAAA 

AMTAAAATTNAGGGACCATTACTTTCTCCTrTTTAGTCTTTT^ 

AACGTTTTAGTCNCAAAGGGGGAAC I (kill) I ATGCCTATTCAGGGAAAAGGGT 

Sequence 1546 

NCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTTCTATAAGAACGTGGTCCTGTATATTA 

TTGAGCTTTGGTTCGCCTTTGTTAATGGATTTTC^ 

GCATCGGCCTGTACCT 

Sequence 1547 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTCATTGAGG 

ATCTAATGGACCCCTATCTTCAGCATGGGGAACAGGGTATAATGTTCACCACCTTGAAGG 

CATGTTACTACCAGATTCAGCGTGAGAAGCTTAACTAGGCTGCATAACAGCTTTGAAAAC 

TGGATTATCTACTACAGAGTGTTATAACACCATCTGGAGTCTTCCTGTAGTNGGCAAAAA 

AAGAACAAGTGTTGAAAmGGAAGGGGACTTTGTGTTATTTNAGGTTGTTAGAATG 

CTTACCAATAATAAGMGCCCTTGAGCCCAGAAAAAAGGACTGTTTAGTTTAAAGGGAGG 

ATTG AAAGG NAGNGTAAAAAAATCN NATTAGACCAGTTNTTGGCCGCTCTAGAAACTAGG 

TGGGATCCCCCGGGCCTGCAGGGAATTTCGATNTCCAAGCTTATTCGATTCCGNTCTGAC 

CTNCNAGGGGGGGGGGCC 
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Sequence 1548 

CCGGGCAGGTACCACAAGCTGTGTGTCTTTAAGCAACAGAAATTTCTTCCCTCACAGTTG 

CGGAGGTCAGAGGTCAGAAAACAAGGTGTCTGCAGGACCAACCTCTCCTCTGGATGCTCT 

AGGTGAGAATCTTTTCCATGCCTTTCTCTTAGCTTCTGATGTTGCCGTCAGAACTTCAGA 

TGGTGTTCCTTGGCTTCTGTCAATATTAATACATAAATCCTTTTCAGTCTCAGCTTCTCT 

CTTCACATGGTCCTCTCCACCCCCGCGTACCT 

Sequence 1549 

t 1 1 1 1 n u n 1 1 1 1 1 1 1 n 1 1 n nmtattttaaagtattgcactgcagtggcatttgt 

CTTGATGGCTGAGTGTTTGGGCACCTCTTGAGATTTCCACCCTAGTGCCAGCTCTGTCTG 

GACGTCCATGGAGGACCTGCCCG 

Sequence 1550 

AGGTACCAATTCTTCTTTGAATGTCTAATTGMTTCAGCTGTGAATCCTTCTGGTTTTAG 
CCCTTTTTTGTTGTTGGCMCTTTCTTTTTATAACCM 

GGGCTGTTCATGGTTTCTATTTCTTACTGATGTAAGACGGTTGTATATTTTGAAGGATCT 

ATCAATTTTCTCTAGATTTCCTAGTCTGTATGTGTAAAGGTGTTCACAGTAGCCTTAAAT 

AATTTTTTATAATTTTTGTGGTATTGGTTGTAATATGTCCAGTTT^ 

TTATTTAGATTTCTCTTTTCCTTTTCT 

CTTTCAAAAAAATAGCTTTTGGTTMTC^ 

CGTACCTGCCCGGGCGGCCGCCCGGGCAGGACTCAAGGCCCTAATCTCAATACCATCACA 

TTGGTGGTTAGGGCTGAACATTGGGAAATNGGGGCAAGACACAAACATTCAATCCAAAGC 

AAGCATGTCCCCACTGGTNGGGACTCCAATCCCCTTTTGGCATTGAGTCATTAATTAACC 

ACTGNACCTCGGNCCGTCTANAACTAGTGGANCCCCGGGCTGCAGGAATCNATATCAGCC 

TAATCGATNCCGNCNACCTCNANGGGGGGGCCCGNNCCCACCTTTTGG 

Sequence 1551 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAGTCAGTCTCAGTCAGGACACAGC 

ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTACTCTGGCTCCGAGGTGCC 

AGATGTGACATCCAGATGACCCATTCTCCGTCCTCCCTGTCTGCATCTGTAGGAGACAAG 

AAGTCACCATCACTTGCCGGGGCAAAGTCAAGAGCATTAGCAAACTATTTGAATTGGTAT 

CAAACAGAATTCTAGGNAAAAGCCCCTTAAGCTCCTTGATCTATGCTNCATCCAGGTTTG 

CAAGGTGGGGTCCCA 

Sequence 1552 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I IGTTTGI I II I I II I II I CACCTTAAGGGAG 
GATTTAATTTG CTCCCAACTGATTG TCACTTAAATG AAAATTTAAAAATGAATAAAAAGA 
CATACTTCTCAGCTGCAAATATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAAGA 
AAAGCAGCTCCCTMCTTCAAAACCATTTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGGCG 
TAATCATGGTTCATAGCNTGTTTCCTGTGTGAAAATTGTTATCCCGCTCACAATTTC 
Sequence 1553 

NACCTGTCCGTGCCAGCTGCATTAATGGAATCGGCCCAACCGCCGCGGGGNAGAGGCCGG 
TTTTTGCCGTATTTGGGCCGCTTCTTCCGCCTTCCTTCGCTCACTTGACTTCGCTT 
Sequence 1554 

CACTACTTAGGGGGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGT 

GATCGGAGTATAGTTGGAGCGCTTTGTCATGATTTCCAGGTTGGCTTTGTCCACAGCTAT 

GTTGGCCAATGCACCTTGAGCCTCAAAGCTGGCAAATCGTCCAAATTCTTCAAGCCGCCA 

GACCGTCTCCTTCTTTGCCATATCCACATGGAAAATCTCATCACCCCGCGTACCTnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nCGCTTGGCGTAATTCATTGGTCATAGnnnnnnnnnnnnnnnnnnnnnnnTATCCGCTCA 
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CAATTTCCACACAAACATTACCGAGCCGGGAGCATTAAA 
Sequence 1555 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTGAAAAGTGCG 
CCCTACGAAAGGACGGTCGACTCCGGCGGGGCCCAGGTGCCAATGGGTCATGTTTGGTTA 
GAAGGTGACAATCTACAGAATTCTACAGATTCCAGGTGCTATGGACCTATTCCATATGGA 
CTAATAAGAGGACGAATCTTCTTTMGATTTTGGCTTC^ 

GTGCCAGCCCTAATGGCCACAGATTTTCTGATGATTAGTAAGCATTTATTCTTTTGACTT 
GATTATTTGTCTCCTTTTCATGTGMTTTTATT^ 

TTGCCCGGGGCGGGNCCGCTTCTANGAACTTAGGTGGGAATTCCCCCCCCCGGGGCCTGC 
CAANNGAAAl l l l I NGNATTATTCCAAAGGNCTTTTATTNCGAATTANCCCCGNTCCTNA 
ANCCCTTCTGAAAGGGGGGGGGG 
Sequence 1556 

CCGCGGTGGCGGCCGAGGTACTTTTTGTTTG I ITTTTTTTTl I CACCTTAAGGGAGGATT 

TMmGCTCCCAACTGATTGTCACTTAAATGAAAATTTAAAAATGAATAAAAAGACATA 

CTTCTCAGCTGCAAATATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAGAAAGCA 

GCTCCCCTAACTTCAAAAACCATTTTGGGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAAATTGCCGCCGCCTTGGG 

CGGTAAATCATGGGTCATAGGCTGTTTTCCTTGTGTGAAAATTGTTTATCCCGCTCACAA 

TTT 

Sequence 1557 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC ITTTTTTTTTTTTTTTTT1 1 CTTTTCAATGTT 

CACAGCTACTTCTCCTTTAAGAGAAGCAGAAAGGCAGTTTTAGGTGTGGTCTGTCTTTAT 

TCCCCCGAACTTGATCAATGTATACAATGCTTACACATTCAAATATGTGCAAGATTCAAC 

AAGGAACACTCTCCACTGCTGTGCTTGAAGTACCTCGGCCCGCTCTAGAACTAGTGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGTAATCATG 

GGTCATAGCTTGTTTCCTGTGTGAAAATTGTTAATCCCGCTTCACMTTTNCACCACAAA 

CATACCGAG 

Sequence 1 558 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTCC 

CAGCTCCTGGCTGACTTCTAGTCTTCTGGTTGAAGCTGCGCCTTTAGATGACACGACCCT 

ACCCACCCCTGTTTCCAGCGGATGCCCGGGCCTGGAGCCCACAGAATTCTTCCAGTCCCT 

GGGTGGGGACGGAGAAAGGAACCGTTTCAGATTGAGATGGCCCATGGGCACCACCACGCT 

CGCCTTNAAGTTCCAGCATGGAGTGATTGCAGCAGTGGATTCTCGGGCCTCAGCTGGGTC 

CTACATTANTGCCTTACGGGTGMCAAGGTGATTGAAGATTAACCCTTACCTGCTTGGCA 

CCATGTCTGGCCTGTGCAGCAGACTGTTCAAGTACCTGCCCGGGCCGGCCCGCTCTAGAA 

CTAGTGGGATCCCCCGGGCTGCAAGGAATTCGAT 

Sequence 1559 

CCGCGGTGGCGGCCGAGGTACCCATTCCCTTGATGTCTACAATATCACCTTTCTTATAGA 

TTCGCATATATGTGGCCAAAGGAACAACTCCATGTTTTCTAAAGGCCTAGAGAACATATA 

TCGGGTGCCTCTCCTCTTTCCCTTTGTGTTCGTCATTTTGGCGAATTACTGGAAAGATGG 

CGGTTCCCGGCCCGCTCTAGAAACTAAGTGGGATCCCCCGGGCTGCAGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnAGTGAGGGTTAATTGCGCCGCTTGGCGTAAATCATGGTCATTAGCTTGTTTCCCTGTG 

GTGAAAATTNGTTATCCCGCTTCACAAATTCCACACCAACCATTACCGAAGCCCGGGGAG 

CATTAAAAGTNGTTAAAAGCCCTGGGGGTGCCC 

Sequence 1560 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAATGATAAAACCATTGA 
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TGAGGAACTAGAACGGGACAAGAGGGTCACTTGGATTGTGGAGTTCTTTGCCAATTGGTC 

TAATGACTGCCAATCATTTGCCCCTATCTATGCTGACCTCTCCCTTAAATACAACTGTAC 

CTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnhnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnCGCGCTTGGCGTAATCATTGGTCATAGCTGTTTCCTGTGTnnnnn 

nnnnnnnnnnnnnnnnnnnnnCACACAANCATACCGAANCCGGGAGCCATAAAGTGTAAA 

GCCCTGGGGTGCC 

Sequence 1561 

AGGTACAAAAAATATTCATATAAACGTTGTTCACACGTAGGTCCTAGATTACCAGCTTCT 
ATGC AAAAAAAGG AAATGAAGAAAAATAG ATTTG TTAACTAGTATTG GAAAC T AACTTTG 
TGCCTGGCTTAAAACCTCCCTCACGCTCGTCTGTCCCACACAAATGTTTAAGAAGTCACT 
TGCAATGTACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnGAAATTGTTATCNCGCTCACAATTTCCACACAACATACCGAAGCCGGGGAGCCATA ■ 
A 

Sequence 1562 

CCGCGGTGGCGGCCGAGGTACAAAACTTTTTAAATTTTATATTATTTTGAM 

TGGGTTTGTGGCACCCTGGCCACCCCATCTGGCTGTGACAGCCTCTGCAGTCCGTGGGCT 

GGCAGTTTGTTGATCTTTTMGTTTCCTTCCCTACCCAGTCCCCATTTTCTGGTAAAGGT 

TTCTAGGAGGTCTGTTAAGGTGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCNACCTCTTAGGGGGGG 

CCCGGTACCCANCTTTTTGTTCCCTTTTAGTGAGGGTTAANTGCCGCCGCCTTGGCCGAT 

AATTCATGGGTCATAGCCTGTTTTCCTNGTGNGGAAAATTGTTTATNCCGCNTTAACAAN 

TTTCCACCACAAA 

Sequence 1563 

CCGCGGTGGCGGCCGAGGTACTATTTACAAAGCATGAATTCACTACAATGCTCAACTGTT 
TGTTTAGCTTTATCTCACTTGGGGAATTTATTCCTGTCTGCTGCATTGTAGGTAGCTGGG 
TAGGATATATTTCCACTTGC I I I I I AAATTAAGTTCTTCACCTCCATTGACACCTCGGTT 
TTTTNGGTTTTCTCCCTATAGTGTTGGGTTGGGTGCTAAGACACCAGTCTGACCCCCAGA 
AATGGGGAGGTTATTTCNATCCATCTTTTCCTCCATCCTTTCCAAAAACCACATATCTAC 
ACAAGGGAAAAATTTAATACATCTAGGAA \ I I I I I I i I I I AATTAC AAGCTATTTAAAG A 
GATGAATGTGGCCAAAGTTTTACA 
Sequence 1564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTGAGCACATAGC 

ACCAGAATATTGGMCAGATGCTTCTCCAAAGCCAAGGGTTGCCTTACATTTAGAGTGGG 

AAAAGAGGAAAATGAGTTAGCAGAGGAGCAGCGGGGAGGGGGGTAAATTACAGGACATAA 

TTTGCTTCTAAAAATTTTGTGTTCGTGTTGM 

TGCCCGGGCGGC 

Sequence 1565 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGAGCATTATAATATTGCTnnnn 

nnnnnnnnnnnnnnnnnannnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnTCGGTCAGGGACCCCGGATTCCCGGGTAGATGCCCAGTTAAT 

GAGCAGCTTAGGAGACTGTCCTGGTTTCTGCTGGTACCTGCCCG 

Sequence 1566 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGG 
GGGTTCCCAGGATGGTGATCCGTGTATATATTGCATCTTCCTCTGGCTCTACAGCGGGAA 
GAGATGGATCAAGATGGTGAAATAGAAGACTCCACCAATCATCCCCTTCACACGAACACC 
AATTTAACAACTATCTACATGCACACACACAAAAGCTTCATCAATTTAACAACTATCTAC 
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ATGCACACACACAAAACCTTTGTAAGAACTCAAAATCAGATTAAGAAGAAACAACAAGAT 
GTGCTTGGTTTCCTAGAAGCCMCAAAATAGGATTTGAAGAAAAAGATATTGCAGCCAAT 
G AAGAGAATC GG AAGTGGATGAGAGAAAATGTACCT 
Sequence 1567 

CTATACGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTTTTGTTGGTTTTT 

TTCTCCCTCAAAGTGAAATGNNAGACTATTGTAGGGTTTTGAGCCAAGGAGTGATATTAT 

CTGAATTCTGTTTTAAAAGGNTCATTNTGGCTCCTGTGTGGAGGCAGGGTAGCTACTTAN 

AAACCAAGCCATCTCAGCCTCCCGATTTGTAAGAGATGTGGTCTCACTATGTTGCCCAGA 

CTGGTCTTGAACTCCTGGCCTCATGTGATCCTCCCACCTTGGCCTNCCAAAATACTAAAA 

TGATAGATATGAGCCGCCACACCTGGCCATMGCTTCAGCCTTGAGCAGATACTTTCTAA 

ACACTTTATATAGGGAAAAAAAACTMTTCTTTGCGTAGCCCAAGGTGTTCATTTT^ 

GAGAATAACCGTAGCCATANGTGTTTACAATTACGTGGGTGGTAATTT 

Sequence 1568 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGGAGAAATAGAACATGT 

GCAGGCAAAAGAAGAAAGGCGCGGGCTGGGTGGGAAGAGGATTCGGACTCGTCACACTGC 

AGAGCAGCAGAGCGAGAAAGGATGAGAAGAGGCAGAGAAGGCGACGGCAGAAGAAAAAAG 

GAAAAACTGCGGCCCGNGATCANAGCCTGAAAGTGTCCCGCGTACCTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnTNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTT 

GGCGTAATCATGGTTCATTAGCTNTTTCCTGTGTGAAATTGTTTATTCCCGCT 

Sequence 1569 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGAGCGC 

TTTAGGCTGGCCGGCGGCGCTGGGAGGTGGAGTCGTTGCTGTTGCTGTTTGTNNAGCCTG 
TGGCGCGGCTT 

Sequence 1570 

CGAGGTACCATCATGGCTTGAGTGCTCTGAAGCCCATCCGGACTACTTCCAAACACCAGC 

ACCCAGTGGACMTGCTGGGCTTTTTTCCTGTATGACTTTTTCGTGACTTTCTTCTCTGG 

CCCGTGTGGCCACAAGAAGGGGGAGCTCTCAATGGAAGACGTGTGGTCTCTGTCCAAGCA 

CGAGTCTTCTGACGTGAACTGCAGAAGACTAGAGAGACTGTGGCAAGAAGAGCTGAATGA 

AGTTGGGCCAGACGCTGCTTCCCTGCGAAGGGTTGTGTGGACCTTCTGCCGCACCAGGCT 

CATCCTGTCCATCGTGTGCCTGATGATCACGCAGCTGGCTGGCTTCAGTGGACCACCTTC 

CTTTGGAGACTGCTCCATCAGTGCCGAGGTGTGTGGGAACAGGCTTCACTGCACCGCCAT 

CTTACTGAGTTGCTTCACGTGAGGAAAAGGGGGCTTTGGCCCTGTGACTCAGTTCCACAT 

TTTGGATTGCATACTGGAAAAGAACCAATCTTCTTGCTAGTAAACCAGCAACCCGGCTGT 

ATACAGTGGTGACCCAAGCAATGGATATAAACCTAAAAATCTGANGGAAGGGAGAGGTNG 

AATACAGTANTTCTTGGAATCTGMGTCTNCTATTTGATCAAGGTTATTTCCTG 

Sequence 1571 

CGCGGTGGCGGCCGCCCGGGCAGGTACATGATACATGACCCCAGGTTCCAGTGTAGAACC 

TGAGTCCCCCATTCCCCAAAGCCATCCCTGCATTGATATGTCTTGACTCTCCTGTCTACT 

TTTGCACACACCCTTAATTTTTMTTGGTTTTCTT^ 

GTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCC 
CGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACT 
TCTATCCCAGAGAGAACAAAGTACCTTGGCCGCTCTA 
Sequence 1572 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAGTTGCTACGCTGCTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGCCAGTGGCCCTGTTGGATGCATCATA 

GATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGTTGGTACCTGCCCGnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCT 
TGGCGTAAATCATGGGTCATAGCTGTTTCCTGTGGTGAAAAATTTGTTTATC 
Sequence 1573 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTCTCCTAGATTAGGAGTTT 

AGGTTGCCCCAATACCCATGCAGAGAACTTGGAGCTGAAAGGTTCCACAGCTTCAAACCT 

AGGCATACTTGTGAATGCGTGGTGGCCACCCATTGATTATCCCTTGGTGCTAGTGCTTGT 

ACCTGCCCG 

Sequence 1574 

GCGAATTGGAGCTCCCCGCGGTGGGGGCCGAGGTACGCGGGGAAGAGAGAGGGAGAGAGA 

GAGAAAGAAGAGCANGAAAGATCCCGAAAGGAGGAAGAGGTGGCGAAAAATCAACTGCCC 

TGCTGGATTTGTCTTTCTCAGCACCTTGGCGAAGCCTTGGGTTTCTTTCTTAAAGGACTG 

ATTTTTAGAACTCCACATTTGAGAA 

Sequence 1575 

CCGCGGTGGCGGCCGAGGTACANGGAGATGTGGAACCTGGTTCCAGGCAAACCAACGCTT 

CCMCTCCAAAGAGTTGGGGGTTGTTAGAGAGCCCTTTCCCAGAAAGCCCGACACTCATG 

TCTTTAGTCTGGCAGCTGCACTAGTCACTTTCAACTGGCTGACAGGTGCCCAGTATTTAG 

CCCCCAAATTCTAAAGGAAAAAGTTNGGACAAGAATAGCAAGTGAAAGGGGATCCGATGG 

TACCTTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnAGCTTATCGATACCGTNCGACCTCGAGnnnnnnnnnnnnnnnnnnnnnnnnTGG 

TTCCCTTTAAGTGAGGGGTTAMTTTGCCGCGCTTGGCCGTAAATCATGGGTCAATAAGC 

TGTTTTCCTGGTGTGAAATTGT 

Sequence 1 576 

AGGTACTGGCTGTCCACCCCTGAGCCCATGCCCATGTCAATGGCACCCATCACGGGGGAA 

AACATAAGACCATTTATTAGTAGGTGTGCTGTGTGTGAGGCGCCTGCCATGGTGATGGCC 

GTGCACAGCCAGACCATTCAGATCCCACCGTGCCCCAGCGGGTGGGTCCTCGCTTGTGGG 

ATCGGCTACTCTTTTGTGATGCACACCAGCGCTGGTGCAGAAGGCTCTGGCCAAGCCCTG 

GCGTCCCCCGGCTCCTGCCTGGAGGAGTTTAGAAGTGCGCCATTCATCGAGTGTCACGGC 

CCGCTCTAGAAACTAGTGGGATCCCCCGGGCCTGCAGGAAATTTCCGATATCAAGCTTAT 

CGATACCCGTTCGACCTCGAGGGGGGGGGGCCCCGGTTACCCMGCTTTTGTTT 

Sequence 1577 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGCACACCCCCACTCC 

CACGGTGTGGGTAGTGCCGTGAATTGGAAACCGCAGATACCCTGGAGGCCTGGTAGGCAG 

CTGACTGCCCAGCACTGGGCAGACTGATAGTCCGTAAGTCTGATGGCAATGACNACCTAT 

AGTTCTGAGGAAAAGGAAGGTGGAACTCAAAATTTCCTTGTGCTTGGACTTGAGAAAACT 

GNAATCTTGAAGGTTTGGCCGAAGTT 

Sequence 1578 

ATCGACGCTCAAAGTCAGAAGGTGGCCGAAACCCGACAGGGACTATAAAGATACCAGnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnCGCCTTTCTTCCCTTCGGGGAAAGCCGTGGCGCCTTTCTCATTAGCTCACCG 

CTTGTAAGGGTATTCTCAGTTTCGGTGTTANGTTCNGTTCCGCTTCCAAGGCCTGGGCCT 

TGTGTTGCACGAAACCCCCCCGTTC 

Sequence 1579 

TCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCGGCCCCAGAG 

CTCCTGCCAGTCTGGGCGACAGCTAGCTCTCCGCCTCTTCAGCAAAGTTCGGCCCCCTGT 

TGGGGGTATCTCTGTTAAGGAGCATTTTGAGGTAAATGTGGTGCCTCTCACCATCCAGCT 

GACACACCAGTTCTTCCACAGAATGATGGGCTTTTTCTTTCCTGGCCGAAGTGTGGAAGA 

TGATGAAGTTGGTGATGAAGAGGATAAGTCCAAACTGGTGACTACTTGGAATACCAGTGG 

TGAAGCCTCGGCAGCTGATTGCAACAAGATGATGCAGTACCTTGCCCGGGCCGGCCGCTC 

TAGAAACTAGTGGGATCCCCCGGGGCCTGCAGGGAAATTCGATATCAAGCCTTATCGATA 
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CCCGTTCNACCTCGAGG 
Sequence 1580 

TCGCACTCATTTACCCGGAGACAGGGAGGGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 

CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCCGCCACCTGCTCTTGTCCACGGT 

GAGCTTGCTGTAGAGGAAGAAGGAGCTGTCGGAGTCCAGCACGGGAGGCGTGGTCTTGTA 

GTTGTTCTCCGGCTGCCCATTGCTCTCCCACTCCACGGCGCCCGCGTACCTnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNG 

ACCTCGAGGGGGGGCCCGGTTACCCAGCTTTTTGTTCC^ 

CCGCCGCTTGGCCGTAAATCATGGTCATAGCTGTTTT 

Sequence 1581 

CTATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGG 

AAAGAGAGAGCCTTCTGGTGAAAATCAAAGTCCTGGAGCAAGACAAGGCAAGGCTGCAGA 

GGCTGGAGGATGAGCTGAATCGTGCAAAATCAACTCTAGAGGCAGAAACCAGGGTGAAAC 

AGCGCCTGGAGTGTGAGAAACAGCANATTTCAGAATGACCTTGANTCAGTTGGAAGACTC 

AATTATTCCCGCAAGGGAGGAAGGCTACTAGGAAGNATAGAATCGGAAAGAGAAAAGAGT 

GAGAGAAGAGAAGAACAGTCTTAGGAG7TGAGATCGAAAAGACTCCAAGCTGGNGATCCA 

AGMGAATTTGTAGNAGAGGTGCAGGCGTTTANGCTTGGAGGGATTTCTTCCCCAGGGAG 

GACCCCCAGACCNCCANTTTTTGNAAAACCAGGAAA 

Sequence 1582 

TNCTATAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC l rTTTTTTTTTTTTl 
rTTTTTTTn IGGTTTGAGGGGGAATGCTGGAGATTGTAATGGGTATGGANACATGTCAT 
ATAAGTAATGCTA 
Sequence 1583 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGCCTCTGGATTCACTTT 

TGATAATTATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTC 

GGCAATCAGTGGTAGTGGCGGTGGCACACACTACGCAGACTCCGTGAAGGGCCGGTCCAC 

CATCTCCAAGAGACAACTCCAAAGAACACACTATATTTGCAAATGAATAGCCTGAGAGCC 

GACGACAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnr^TTTGNCGCCGCTTTGGCGTTAATCATGGGTCATAAGCNTGTTTTCCTGTG 

TGAAATTTGTTTATCCCGCTCACAAATTTTNCACACCAACATACCG 

Sequence 1584 

NGGAGCTCCCCG N NGTGGCGGCCGCCCGG NCAGGTACACGCCCAATGAGATCTGCGTCAT 

CCAGAGAGGAATGCGGTTCAGCATAGATGTCTTTGAGGAGACCAGGGGCTACATCTTGGA 

GGTCTATGGTGTCCACTTTGAGTTACCTGACCTTGGACCAATTGGGGCCAATGGCTTGGC 

CMTCCTCGTGATTTCTTGATACCCATTGCCTGGGTATGAGGGATCGCCAAGTTACCTTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnCNACCTCNAGGGGGGGGGCCCGGTACCCCAAGCTTTTTTGTTCCCTTTTAGTTG 

AGGGGTTTAAATTTTGCCGCCGCCTTGNGCCGTNAAATCATGGGTTCATTAAGCCTTGGT 
TTTCCC 

Sequence 1585 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGAGCGGCCGCCCGGGCAGGTACC 

TATGATTGAAMCAGTAGTTGGTCTATGACTTTTGAGGAGAGGGAGAACCGAAGATTACA 

GGAGGCCAGCATGAGGTTGGAACAAGAGAATGATGACCTTGCCCATGAACTAGTAACAAG 

CAAAATTGCTCTACGGAATGACTTGGATCAGGCAGAAGACAAGGCAGATGTGTTGAATAA 

AGAGCTCCTTTTGACCAAACAGAGGCTGGTGGAGACTGAAGAGGAGAAGAGGAAGCAAGA 

GGAAGAGACTGCCCAGCTAAAAAGAAGTCTTCAGGAAACAGCTAGAGAAGGCAGAATATG 

G AAAT AAAG AAAG ACTACAGCTATC ATTG CTTG AAGTTATAAAC AG G ATCTTGTTCCGC A 

GTTTGAAGTACCTCGGCCCGCTTCTAGAAACCTAGTGGGATCC 
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Sequence 1586 

CTACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTATTTTTCT 

CAAGAAATACGTTAAGTCCAACTGCTGAACATACATCTTGAATCTCTGTAGCTGTAGGAT 

TTTCAACAGCCTTACTTATGGGGATTCGCCTTCCCTCTGCGATGGTCTTCTTATTATTTA 

AATAAGCAGGATAGATACAAATAAACCTGTCCTGGTCGGCCCGCTCTAGAACTAGTGGGA 

TCCCCCGGGCTGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCGTAATCA 

TGGGTCATAGCTGTTTTCCTGTGTGGAAATTGTTATCCCGCTCACCAATTTCCANCACCA 

ACCATTACCGAAGCCCGGGAAGCCATTAAAAGTGTAAAAGCCTGGGG 

Sequence 1587 

CCGCGGTGGCGGCCGAGGTACTTTTTTTGTTTG I I I I I I I I I I CACCTTAAGGGAGGATT 

TAATTTGCTCCCMCTGATTGTCACTTAMTGAAAATTTAAAAATGMTAAAMGACAT^ 

CTTCTCAGCTGCAAGTATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAGAAAGCA 

GCTCCCTAACTTCAAAAACCATTTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnhGTTAATTGCGCCGCTTTGGCCGTAA 

TCATGGTCATAGNCTGTTTTCCTGTGTGAAAATTGT 

Sequence 1588 

CCGCGGTGGCGGCCGAGGTACTGATTTCAAAACGTGGACGTTCGGCCAAGGGACCAAGGT 

GGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCA 

GTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGC 

CAAAGTNCCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGTCATAGCnnnnnnnnnnn 

nnnnnnnnnnnnnnnCGCTCACAAATTCCACACCAACATTACGAGCCGGGGAGCATTAAA 

GTGTAAAAGCCCTG 

Sequence 1589 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCCGGGCAGGTACTATCTGTATCA 
TTCTGCAAACCTTTTNGGNNIAATTNNNNCCCCCCTCAACMTTTCTGGGGGATNCTNTTT 
NCTGGTTAAATACATGTGG 
Sequence 1590 

• AGGTACCTAATGAAACCCGTCTTTGGCCTGTGGCTGAGACGCCATCTGTAGGCGGTGAAA 
TCTTTCCAGCCTATGTGACGAGGGTCATGCCACAGGGTGCGCACCTGAAGGAAAAGGAAA 
AGGAAAGGTTGAAACACAGCTCCTTCAGGTGACAGAACAGATGTCCCATGGTGACCAAGG 
ACCCCAGCCTCCATTGACTGTAAGAAGATGAACCACAGGCAACMTTAAGATCAACTTTC 
AAATCCCTCCCTTGTCATAGACTCTCCTAAGCCCGTTCAAGAAACCTCTGCTTCTCTGAC 
ACACGGAAAGAATGCTGGGGAAGAGGGTGAGTTTGTTTTTCATTAAAAGA 
Sequence 1591 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTTTTGGTCAGG 

GGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCAT 

CTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATC 

CCAGAGAGGCCAAAGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGTCATAAGCTGTTTC 

CTGTGTGNAAATTGTTATCCCGCTCACAATTCCACACCAACATTACCGAAGCCCGGGNAG 

CATTAAAAGTGTAAAAGCCTGGGGGTGCCCTAATG 

Sequence 1592 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACNTGCNCATCAATGT 
ATGATGATGACTATTTGGCTTAAATGGGATACAATATTCGACCACGCTCAAGAACCTTNA 
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G AAGCTTGACTG AAATTTCTAAN ACTGATATTTCTTACTGAAATGTG N CCTTTGGGTTTN 

AANCTGAGGTTNCTATNGTTTGCTTNCNACAGACATATGTTTGACATACAAAATATGGTA 

AAGATCCACTACTTNTGCAGAGGTATATTGGCCATATTATTCCCTTATNGGCCTGGATTN 

CCTCATCAATAGCACAATTCATTCAAA 

Sequence 1593 

CCGGGCAGGTACTACAATGAGACAGTTGGATCCATTTTTCCAGGGTTAAGCCTGCAATAC 

CTGAAGCTCCAGACCTGAAACCCAGAGCTCCTGCCAGCTGCTATGCCTGTGAACCTGTGG 
AAGACCTGAAGTACCT 

Sequence 1594 

TCCTGTGTGAAATTGTTATTCCNGCTCACAAATTCCACACAAACATTACCGNAGCCCGGA 

GGAGGCAATAAAAGNTGTTAAAAGCTCTGGGGGGTGCNCTAATTGAAGTTGAAGCCTAAA 

CATCACCATTTAAATTTGCCGTTTTNCGACTTCACCTTGCCCNGCCTTTTTCNC 

CCGTNGTAAAAACCCNTNNTTCCGTTGGCCCCAGGGCCNTTGCCAATTTTAAAATT 
Sequence 1595 

AGGTACCTCATTACTGCTCCCCATGTGGCCTCCTTACCCAGGAAAATTATGAAAGTCTGA 

ACTCAGTCTCTTTNGACACCATCTCAACAGGAGGGGAAGGGGCACCTCATTATTGCCTGG 

CAAGGGAGGAAGTCTAGGTTCCTCACTCAGTCTTTGCTGGTGACACAGGATTCTTTGGGA 

GGCCACTTCACCAGGCTCCACCCATTCAGCCCAGCAGGCCGTGCTTGGTTTGCGCTACTG 

GCCTGGATCCCATATCCTGGGTAAAATGGTGAAGGGTGTGTGTGCAAGTGGGTGTGAGGT 

CTGACCACAGTGCACAGCCNGNCACACAGGCTGCTTGCAGTTGGGGCTGGGCAGGCAGCT 

CCAGATGCCAGCACAGATGCCTCCTTCATGTTGAAAGCNTGCCNGCTGGACTAGGC 
Sequence 1596 

ATTGGAGCTCCACCGCGGTGGCNGGCCGANGTACAGCGGGGGGGCGGGGGGATAGCTGTC 
CAAGGTCTCCCCCAGCACTGAGGAGCTCGCCTGCTGNCNTCTTGCNCNCNGGAAGCAGCA 
CC AAGTTCAC GGCCAACTGCCTTG GC ACTAN GGTCC AAN AATG GCTA 
Sequence 1597 

CCGGGCAGGTACTGGGCGCAGATGGGCGACCGTCACAACACTACGGTCCACTGGACATGT 

CTGTTGCTGAGAGAACCACTGGGTGATGCAGGCGTTGCANAAAGCATGTTCACAATGAGG 

TGCCTGGTAATCGTGAGCTTCTCCTCATACTTGCCCCAGGATGATACAACTGACTCTGGT 

CTTGGCAAGGNTGAACCT 

Sequence 1598 

CCGGGCAG GTACGCGGGCTACAAGCATAAGGCATCATGCCTGGCTAATTTTTTTACATGT 
TTTTTTTTTGTCGAGATTATGGTATACACTATGTTGCTCTGGCTGATCTCAAATTCCTGA 
CCTCAAGTGGATCTTTCNTGNCACATN NCTTCCTAAAAAG NTGTCC ANGGGATTTANCAT 
TGCNATNAATGGCCAACCCAATTGCCCTTAANCCGATAANATGGTATTTTTATCMNTTT 
AATTGTTA TTC AAAT AN NTTCTTTATANTNTTNTTAAN NGTAAG NCCTCACTTNTTAATN 
TTNGNATC » till I NTGGGCNCCTTAAATAATN ATATC NTTC N AAATTAAATTATAANTA 
ATTTCCTTNTTCGTAAANAACCTTTTTA I ( I I I I I I I I CGAATCCAAAAANAATTATNTT 
TNGTG GGGNGGCTMTTGGAATTTNTTATTTNTACGTAAGNTAAMGGGGGGTGTGGTTT 
TTTTTGGAAAATAACCATATTNTANCTTTATAATTAAAAATGG 
Sequence 1599 

AGGTACATGTGTAGAAAGCAGTGGGAACTGTGATTGGATTTGGCAACATGTCAGCTTTAT 
AGTTGCCGATTAGTGATATGGGTCTGATTTCGATCTCTTCCTGATGTAAACCATGCTCAC 

CCATATCCCACTATACAAATGCAAATGGTTGCCTGGTTCCATTTATGCAAGGGAGCCAGT 

ACCTGCCCG 

Sequence 1600 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGAGGCCAGT 

TATTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnhnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCCTGGGATACCAGTGGCCCTGG 
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TGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 

0 

Sequence 1601 

AGGTACGCGGGAGATGAAGGTGCGCATCCTCTCCGAGCGTAAGAAGCCTCTGGACATTGA 

CTACATGGGGGAGGAACAGCTCCGGGAGAAAGCCCAGGAGCTGTCGGACTGGATCCACCA 

GACTGGAAGNTCTGANAAAGTTCGACCCTNGATGGGCGAAAGCTTGAAAACAGCNNGNAA 

ATATTGNAANATCAAACGGTNCCTGGTACCCTGCCCCNNGGCCGTGCCCGCTTTTTAANA 

AAACCTTANNNNNGGNANTTCCCCCCNCNCGNGGNCTTTGNCCACTGNGAAANTTNCNNA 

ATTANATTCTNAAAGCCTTTTAAATNCCGAAATTNCCCCGGTNCNTACCCCNCTTCGCTA 

NGNGGGGGGGGGGNGCC 

Sequence 1602 

GACGGAAAGCTTAGATATGCCAACAACAGCAATTACAAAAATGATGTCATGATCAGAAAA 

GAGGCTTATGTGCACAAGAGTGTAATGGAAGAACTGAAGAGAATTATTGATGACAGTGAA 

ATTACAAAAGAAGATGATGCTTTGTGGCCTCCCCCTGGATAAGGGGTTGGGCCCGACAGG 

G AG N CTTG AAATTGTAATTTGG GG G ATG AAGC AC ATATCTTTTTACCC ACCATC AAAAAA 

ATTAGGGTNTCTCTTTATTGGATGGTAAAATCCAGTCCAAAGGGATTCCTGGAAGGCCCT 

TCGNAGGTATTTTTTACTTATTTTGGGTTACCCTCGNGGCNCGGCTTCTTAGAAACCTAG 

GTGGGATTCCCCCCCGGGGCCTGGCAAGGGGAATTTCCGAATANTTNMGGCTTTATCCG 

AATACCCGTTCGMCCCTCNAAGGGGGGGGGGGCCCCCGGGTTACCCCCNAGCCTTTTTT 

GGT 

Sequence 1603 

ACTATAGGGCGAATTGGAGCTTCNACCGCGGGGGCGGCCGCCCGGNCAGGGTACTCCTTC 

GTNATACCCATCTGGGCAGTCCAAGACACCATTGCACAGCTGGGATAAATGAACACATTT 

GTTGGTACNCGGGATGATAATGTCCCGATCACCTTCGGCCAAGGGACACGACTGGAGATT 

AAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGNTGAAA 

TCTGGAACTGCCTTTGTTGTGTGCCTGCTGAATAACTTNTATCCCAGAGAGGCCAAAGTA 

CCT 

Sequence 1604 

GATCNTATAGGGCGAATTGGAGCTTCANCNCNGGGGNCGGCCGCCGGANGGNGGGTGACG 

TGCGGATCTTNTTCTTTTTGTGGCTGTGGACACCTTTCAACACTGCCTTCTTGGCCTTTA 

AAGCCTTCGCTTTGGCTTCAGCCTTAGGAGGGGCAGGAGCTTCCTTCTTCGCTTTCGGCG 

CCATCTTGTGAAAAAGCCCCGCGTTACCTCGGGCCCGCTCTAAGAACTTAGTTGGGATTC 

CCCCCCCGGGGCCTTGTCATGGAAATTTTCCNAATAATCAAAGCCTTTAATTCNAATTAA 

CCCGGTTCCNNAACNCTTCTAAANNGGGNGGGGGGGCCCNCCGNGTAACCTCCNAAGNCT 

TTTTTTGGTTTNCNCCCTCTTTAAAGATCGAAAGNGGGGTTTTAAAAl^ 

GCCTTTTGGGNCCCGG 

Sequence 1605 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATGGACCCGCTACGG 

AACGAGGAGGAGGTTCGAGTGAAGATCAAAGACTTGAATGAACACATTGTTTGCTGCCTA 

TGCGCCGGCTACTTCGTGGATGCCACCACCATCACAGAGTGTCTTCATACTTTCTGCAAG 

AGTTGTATTGTGAAGTACCTGCCCG 

Sequence 1606 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTATAGTTACGTTA 

TTTCTCACCAGTCACATACTCAAGAATTCTTTTACTCAGACCCATGACTTGACCACAGTG 

ATAGCAGAAGCTGGGGTGGGCACAAATGCTTATACACAATATTACAAAAGACGACATTCA 

CTGGCTTAGAGAMCTGATGAATTTTCATTACTTTTACATTTTTA 

TTAACAGTTAACTTTAGATAAATCTACTTACACAGTACCTGCCCG 

Sequence 1607 

GGGGGCGGCCGAGGTACCANAACTTCTATGCACACCTCCCTGAGAGTCTGGGAACCTTCA 
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CCGCTGACCTGTGTGAGATGTTCCCAGCAGGCATTTATGACACCAAATATGCTGCTGAGT 

TTCATGCCCGTTTCGTGGCCTCCTACTTAGAATATGCCTTCCGGAAATGTGAACGGGAAA 

ATGGGAAGCAGCGGGCAGCTGGCAGCCCACGCCTTACCCTGGAGTTCTGCAACTATCCTT 

CCAGCATGAGGGACCATATTGATTACCGCTGCTGCCTGCCCCCAGCAACCCACCGTNCTC 

ATCCCACCAGCATCTGTGACAACTTCTCGGCTTATGGCTGGTGCCCCCTGGGACCACAGT 

GTCCTCAGTCTCACGATATTTGACCTTATCATTGACACTGATGAGGCTGCGGCAGAGGAC 

AAGCGGCGACGG 

Sequence 1608 

TGGCCCGAGGTACGCGGGGGAGGAAGGACCTGGTAGTTTTGATGACCGCTGTCCTGTCTA 

GCAGATACTTGCACGGTTTACAGAAATTCGGTCCCTGGGTCGTGTTC 

Sequence 1609 

CGAGGTAC I I I I II i II I I I I M I GAGTTAAGTGTCTTTTTAATTGAAAAGCTAACTGTC 

CAGTTACATTTCTCCCAAAAAACCACAAACTGGGTAGTAACTGAGTCTCTAGGCAATATA 

TTTAAAACTAAGAGGATTAAAMTAMGAAAAAGAAAACAAGTCCTCAGATTGCAATGGA 

AGGGGAGCCCTGCTACATTACAGGCCTAANTCAAATACAGTGGGTTACCTGCCCCGGGGC 

CGGCCCGCTTCTAGAAAACNTAGGTGGGAATCCCCCCCGGGGNCTGCAAGGGAAATTTCG 

GATATTCAAAGCCTTTATTGGGATTACCCCGNTCCGAACCCTNNGGAAGGGGGGGGGGGC 

CCCCGGGGTACCCCCCAANGCI fTTTTI I NGTTTCCCCCCTTTTTAGGGTNGNANGGGGG 

NTTTAA 

Sequence 1610 

CGCTCACTTGCCCGCTTTCCAGTCGGGGAAAACCTGGTCGTGCCAGCTGCATTAAATGAA 
ATCGGCCAACGCCGCGGGGAGAAGGCCGGGTTTTGCGGTATTGGGGCGCTTCTTCCCGCT 
TTCCTCGCTT 
Sequence 1611 

GTGTGCCACCGCCACTACCACTGATATGCCGAGACCCACTCCAGCCCCTTNCNTGGAGTA 

NNGGCCGGNACCCAATTCATAGACATAATTTATCCANATAAGGTGAANTCTCAGTAGGGC 

TTGTTNCNCTCTGAGCCCGCTTCTNAGNNAACCNTAAGNNGGNATTNCCCCCTCCGAGAG 

TCNTGGCAANGGTAAANTTTNCNNATTANNTCCNAAAGCCCTTTTAATTCCGGAATAAAC 

NCAAATTCNGGACNCCTTTNNTAGNGGNGNGAGGGGNGNCTCCCGGGTNTAACACCCAAG 

GACATTTATTTGGATTTCCCCCCTTNTATTAN GGTTGGAAGN GGGGTNTNAAAATTTTN G 

NCCNGCCCGGCCCCTTTNGNGNCCCGTTTAAAAAANCCAAATTGGGGGTCCCAAATTAAG 

CCTTTGNTTTTATCCCCTTTGATNGGTTGNAAAAAAATTA 

Sequence .1612 

AGGCTCCCGCCCCCCTNACNGAGCATTTCACAAAAAAATCGh4AACGCTCCAAAGTTCAAG 
GAAGGTTGGGCGAAAAACCCCCGACNAGGGACCTATTAAAAAAGAATACCCCAAGGCCGT 
TTTTTCC 
Sequence 1613 

CGCTTACNCGGATTACNCTGGTCCCGCCTTNTCTCCCTTTCGGGAAAGNCGTGGGCCGCN 

TTTTCTTCATAAGTCATCAACGGCTTGTAAGGGGTATTCTTCAAGTTTCGGTTGGTTAGN 

GTTCGGTTTCCGGNTTCNCAAAAGACTGGGGGCCTTGNTGTTGGCACCGNAAACCCCCCC 

CCGNTTTCAAGCTCCCGAACNCGCCTGCNGCCCCTTTAATTCCCGGGGTNAAACCTTATT 

NCGTTTCTTTGGAGmCCCAAAACCCCCCGGGTTMNAGGGAANCAACCGNAACTTT^ 

AATTCNGNCCCAACCTTGGGGCCAAGGTCAATGNCCCAACCTTGGGGGTTAAAA 

Sequence 1614 

CCGGGCAGGTACTGGGCTCTGACCACTATTGGTTTTGAGACCACGATGTTGGGAGGGTAT 

GTTTACAGCACTCCAGCCAAAAAATACAGCACTGGCATGATTCACCTTCTCCTGCAGGTG 

ACCATTGATGGCAGGAACTACATTGTCGATGCTGGGTTTGGACCGCTCATACCAAGATGT 

GGCNAGCNCTCTGGMGTTAATTTTCTGGGGNAAAGGGATCANGCCTCAGGTTGCCCTTG 

TGGTCCCTCCCGTTTTGACCGGGAAAAGNANGAANTGGGATTTCCTGGGATATTCCTANN 
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GAACCCAAAAATTCAGNAAAGGGGGNAACCANGGTTNCCCCTTNGNGGACCCCGGCCTCC 

TTTANNANACCTTAAGGTNGGGGAATTNCNCCNCCTCCGGGGGCCCTTGGCCAAGGGGAA 

ATTITTCGGAATTATTTCAAAMNNCTTTTMNTTCNGAANTTATC 

CTCTTNCAAAGGGGGGGGGGGGGCCC 

Sequence 1615 

CGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACATCCAGAAATAGATCCAAGAAA 
TGGGGTGGTTGAGTGGGTCCGCACGAAATGCTTGATTATGTCAGCAACACCCAACACTGT 
CTGTTTTC C ATTTG NTGGTTTTAATCATAAAATTGTC AAGTG ATNC N N GTT7TTAAAAAN 
AAAAAAAAAAAAAAGTACCTGCCCG 
Sequence 1616 

ATCACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAT 

TTAAGAAAAGAACAAGGTTMCTMCTAAAAGCAGAACTCACTTATTTTrr 

CCAATTAAAMTMGTTCATTAAAAGCACTTGAAATTATAT^ 

TGCTAAAATTCCAATATAAATGTAMTATCTTTAACTTGCTTAACCCAGCTATCCCCAAA 
ACAGTGTAGTGGGGCAAAATGTTCAAAAGAAAAATCATCCAGTGCACGTAGATGGGCACC 
AAGAAGCTAAGCTTCCCTGAGCGCGTACCT 
Sequence 1617 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 
GGGAGGTTAAGGGAGAAGATGGCGGCGCTAGGGGAACCCGTGCGGCTGGAGAGAGATATT 
TGTAGAGCAATTGAATTATTGGAAAAACTACAAAGGAGTGGAGAAGTACCT 
Sequence 1618 

CCGCGGTGGCGGCCGAGGTACAGGTTTGTAGCCTAGGAGCAATAGGCAATATCATGTAGC 

CCAGGTGTGTAGTAGACTGTCTCATCTAGGTTTGCGTTAGTGTGTCTATGATGTTGGCAC 

AAGGACAAAATCACCTMCAATGMTTTCTTGGCAGGTGTCTCTGTTAAGAAGACATGAC 

TCCAGTACCTGCCCG 

Sequence 1619 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTi 

H I I I I N I I I I I I I I J I I I I I AGGC MM I GGANACCTTGCCCTTGCTGGCTAANTTAT 

CCMGCAGGTTTTGATATANCTTTTATAGTAATCCACCTGCTCCCCATAAAAGGNGGCCT 

TANAGTTCAAANCANCGTATGTCTGTTGCAGTTTCACTANTTCGGCCTTTCTTTCTNTGT 

CGGTACCTTGCCCGGGCGGCCGCTTN 

Sequence 1620 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTAT 

TACACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGG 

ATGCACCATAGATGAGGAGCCTGGGAGCCTGCCAGGGTTTTCTGCTGGTACCTGCCCGGG 

CGGCCCGCTCTAGAACTAGTG 

Sequence 1621 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I M M I M M I 
M I M M I M M { M CTGGGACATCATTCANATGTTACCAAATCAGTCACCTTAG M M I 
AATCCTCACTATCATGCTAACATNTAACACTTGGATNGCATTTTGTGACGTGTAAAGNGT 
TTCCNCCTNTCCCAGGATATTTTAATTT 
Sequence 1622 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGACTCCAAGTCC 

AMGCAAATACCCATGGCATGMCAAATAGGATGCCTAAGATTCTAACATTTGCTTCGGC 

ATGGCTTTCCTCCTCACTTCCCTACCTCACAACAAGAAAAAATTTCCTAGATGGGACAAT 

GGCAAGATATTGTGCTAATACATTTCATAGGATTTCTAGATGCTGGGCTATAACATCCAG 

GTTGCAATTTGGTGACGGA I MM! I ATTTCTATTGCAAATGGACTACACTCCAAACACA 

AAAAGATTCTTGCCATAGCATCCAAACATCCCTATGTAGACCCACACTTCCTCTCAATAC 
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AGAAAGAGAATCACTCAGAAAGGACAAAGCTGGCAGACCAAGAAAGGAGGCAAGGGTAAA 

CTCTGTCACTGGCCAATTNTCTAAAACAAAAAT 

Sequence 1623 

ACTAGTGGATCCCCCGGGCTTGCAGGAATTTNGATATCAAGCTTATCGATACCNnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGCGCT 

TTG GCNG TAATCATGGACATTAGCTTGTTTTTCTTGTGTGAAM 

CMTTTTNCANACTATCCATAACGAGNCCCNGGGTAGGCATTAAAAGGTGGTAANTGGCC 

CT NGGG GGGTGCNCCTTAAATGTAGTTGGAGCNTAMCCTTCAMCAATTTAAATTTGGC 

CGTTTTGGCCGGCCTTCCANCTTGNCTNCCGTCNTTTTTCTCCNAGGTACNGGGGGAAAA 
AA 

Sequence 1624 

AGGTACTGCTTGTGATCCAGCTACTGAGGTCTGAACTCGAACAGACCTGGGAAGTGNCTC 
AGAGCTGTCTNTCAGGCTCTGAGAGAAAGTAGCCAACATTACAGTAGACAGAGAGAGACA 
GAGATCAAGCACAGAGACAGAGACAGCATAAGGCAAAAATACCTTTATGAAAATTGCCAA 
GTCCTCAAAGCAAATCTTANNTTTAAACTTCTTNTNAGGCNAAACCTGTNTGGAACACAN 
GTC NTTTTAATCCTTCAAGGGTTTATATAAN N ATANTAAAATAATN AAGTTACNCTGCTC 

CGGGAGTCGGGTCTCGACTTCTAAGAAACCTANGATGNGAATCACCTCNCGGNGCTTGCC 

AGTGGAATTTTCCGT 

Sequence 1625 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCAACATGTCCCGTGGT 

TCCAGCGCCGGTTTTGACCGCCACATTACCATTNTTTCACCCGAGGGTCGGNTCTACCAA 

GTAGAATATGCTTTNAAGGCTATTAACCANGGTGGCNTTACATCAGTATGCTGTCACCAG 

GGAAAGACTGTGCAGATNGTTGTCACACNNTAATTAAAGTACCTGCCCGGNCTNGACGAN 

CTATAANANGTGCAATNCGGAGCACNTTCACTGNNATGTTNCGCCGGAAGGCCTTCCTCC 

ACTGGTACCTGCCCGGGCGGCCGCTCG 

Sequence 1626 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCCATTCCCTTGAT 

GTCTACAATATCACCTTTCTTATAGATGCGCATATATGTGGCCAGAGGAACAACTATCAT 

GTTTTCTAAAAGGCCTANAGAACATATATCGGGTGCCTCTCCTNTTTCCNTNNGTGTNCG 

TCA I I I I GGCGAATTACTGGAAGATGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCCCTTATNCGNTACCGGGGNNNTCCNGGGGGGGCCC 

GGTGCCCACTTTTAATGAGGGCTTAATNGNGGCGCTTNGGNGNAACGCTTNGGNCNTAAT 

CANGGTCNTGGGNTGAAATTTNGTTNTCCNATTTACAATTTCCNNACAAACATACCNAAN 

CCCGGGTGCCATTAAAAGTGTAAAAAGCCTGGGGGTGGCCTAATGAAGTGGAGCCTAACT 

TACCATTTAATTTGCGTTGGCGCTCNACTGGCCCCGCTTTTCCAAGTCCGGGAAAACCC 

Sequence 1627 

GGGCCTGAGGTTTTACTCCNNAAAAGCANAGGAGGTGGCAACCTTGGCTTTGGGGTTTTG 

GCAGGCCCANGANAGGCAGNGGAGGANAAGCTCAAAGCCGGGTTTCATGTTTCACACCAA 

AGGTCTAATTTGTGGGGNAGAAGGACCAAATCCCAGATNCCCCTTGTTTAGCACANCAAT 

TTAGG NTTCAC N AAAATTGTNNTNTTTTGGGCCAAAAAAACC ATGTTGGACC AC NCTCAT 

CCCATGGANTANATTTTGTACTTTAGATTCTCAAAGGAAAAAGA 

Sequence 1628 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTTGTTTCAAACT 

GTTCACTGTGATCTGTGGGTCTTTGAGTTTCAGTGAGTTTGCTGAAATGTCGAAGAAGTA 

GTTCCAMCTTCAATGTTCAATGAAATTTTTGTTCAAGTTTGAM 

TAAAAGGTACCTGCCCG 

Sequence 1629 

TCTAAAGGAATAAGCTTGCGGCCGCTTAATTAATTTTTAGGCACATGTAGGGTGATCCAA 
ATTGGTCTGGGGTCAGGATTTGGCCCACAAGTCTATCAGTTTTGCGTCATCTAGTTTTAG 
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TTTTGACTTTAGGTTTCTTGAAGGAAGGGACCAAGCCTGCl I I I I GTCATCATTATATCC 

TCACGGCTTAAGGTGTTAAGCCCCTGNTNCAAAAGAAGCCTTGGTTACCAGGGNAAAANG 

GCCACCNTTTGTNTTTGTCAAAAAATATGAATTGTGGTTGGGGAATTGGAAAGGGGGGAA 

GNGGGTTAAGGGCCAANAANTGGGGAATGGAAGTTG 

Sequence 1630 

CTACTAAGGGAATAGCTTGCGGCCGCTTAATTAAAGATC I ! I I I I II I I C I I I i 1 1 I AAC 

AAAATTCCTATAGAATATMCMTATTAAAGAGGGMCTTTAAGTGMCCCCCACTTTTG 

GCCATAACTTCTTTTTGTGTCAGCCAGCAAGAGCCAAGAGAACATAACCAGGGTGGTAGG 

ATGTAGCTAGATAAGCCTCTTTAAAGGCACAAAGAGGCCACTTCTTCAACCAGATTTGCC 

AAAGCCCTTGGTTCAATGACCAGATAGGCCAACACGGGTAGTAGTGGGGTCATCAAGCCC 

TAAATGCAGCTTTCTGGGCTACCTTCTAGGATGCTCTGGAGCTTGGAACTGTCCACCTGA 

AAGAGGGTTGAGGGAGGCTGGGATGAATACMTATTTGGGTCCACTGGATTTTCCTCATA 

ACCCTTAGCCCC . 

Sequence 1631 

GCCTTTTAAAGGGAATNAGCTTGCGGCCGCTTAATTAAAGATC I I I I I I I ! I M 1 1 I I I I 
TTAAGAAGCTGTAATGATAGTMCAAAGAAGTCCAACCTTTATATTGCACAAATNTTTAC 
TAACCATGTAGCATCTNTGTGCCTGACCAGCCCTGACAGATGAG 
Sequence 1632 

CCTCCTAAAGGGAATANGCTTGCGGCCGCTTAATTAAAGATCTTN I I I I I I I I 1 1 1 1 I I I 
TAAACGCACAGMCAACCTTTTGAAGGTGCCAMTTTAAAGATGGTAAAGTAACAGAGAA 
GTAAACTGTAACCTAATTGCTATGGCTTCAGAATAACCCAGTCAAAANACTTTCAACATN 
NAAAGGAAAAAGGAATTTTAAMTTMGTTGCCCCTTTGAA 

NGGGTTCTTGNAA I I III I I II ICCCTTACCl I I I I AGCNANAGGGGGNAANCAAAATTN 
TTAAGNTCCCTAAGAAATTTG NNTACCCCAGGGGGGTTGTTATTCNTNCCCCCC I HUN 
TTTCNAAAAAGATAAAATTTTTTGNCNAAGGGTTTTGGGAACCNT^ 
Sequence 1633 

CCTCCTAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATCI U I I M I I I I I I I II IGG 
AGATGGAGTTTCAAACAAAAAAAAAGAAAGCAGAATCCACCATCTTCCTCCCTGCTGCAC 
ATGCTCTTCTG 
Sequence 1634 

CCGCTTAATTAAAGATC I rTTTTTTTTTTTTTTTI I GGTTTAATATGAGGTTNCATATTT 
ATTTTGGTCATAAACATATGAGTNTTTNATATATAACCAGGTGTCATGTGCTNTTTGGCC 
CNGTNTTTAANANCAAAAACCTAAAACAANATCAACCCATTCGCNCNAAAAAACCCCAA 
Sequence 1635 

GCTAAATAACCCTCACTAAAGGGMTAAGCTTGCGGCCGCTTAATTAAAGATCTTTTTTT 

mrmTi iatactacttatcactatttaaaattatatttaggtacttatttgttatct 

GCTTCCCCTCTAGGCTTTCTGCTCTGCAAGGCCAGAGACGTTGTCTCTCTTCTTCATAGC 

TCGACCCCCACTGCGGAGAACAAGTGCTCAGCACTTGCTAACATGGCAGAAACAAAGCTG 

TCTCCCTCAGTTCTCGTGGCAGCCCTCATTCTAGTTTGACTTCCTATGAAATTACTTTTC 

CTAGTTGTCTTGAAGGTCATGTTTTGCTTAAGATGTAGCTCCACCAAGCAGACAGCCCTA 

TGTGGGGTTTGGAGGTGAGCAGTGAGAGGTGGTACCCATAAGGAGGAACATCANATCTGC 

GGGGGGCAT 

Sequence 1636 

TACTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC l I I I I l I I 1 1 1 I I I I I I I I I 

ACAACTAGTGTATTGTACACTAGCTCAGCTCCACCAAACTACCTGTTCGTTCTCTTTTAT 

TTGACATTGTTCACAGACTAGTACATATTACAATAAGAGTGCTGGATAAAAACATGAGGT 

AC NG AAAGTG GTTCAAAGGATTATAGGGTC ATTGC AG N ATCATGCTN GACAGTAAAAGAA 

AATTGTGACTGAGAAAACACTAANTAAAAATACATAAAGAATGTGCATTANTAAAAAAAA 

AAAAAGGACATTNNAGATNAAGNACNNCAAAAAAAAAAACCAAACTCCATTTNCANCAGC 
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CTTGTG GCTTTNTTTGGGTNACCTAAAGTAAGGGNTTGGAACCAANTTTATCACCAAGTT 

TTTTTTAATTTCCCCCACCCCCGGAAANAAAAAAA 

Sequence 1637 

TAAAATAACCCTCACTAAAGGGAATAAGC^ 

rTTrnrm ICATTTTAGTCTTTAGTTCCTTTTTCAACTTCTTCTTTACCTCCACGGAG 

GAAGGCAGCTTGGTTTTCTCCTCATACACCTTGTGGAGAGGTTCGGGAGTTGGGAGGAGA 

CACTGNCGGNAGCAANGAG NTTATT AAGGGAAAAGTTTGGGGGGGGGCATTATTTTTGGA 

AGGGGTGGTCTCCCCA AGNTT TTTTGGCTTTTACNGTACCAAACNCTTNATTCANATNGG 

NANGGGC CAAT TCCCNATTTTGGTTCACATTGGAANTTTTAATTTTGGGNAMNC 

GGAGGNGTTTTNCCNANCCCGNGCAAAAAANNNAAGNGNCCATATAAGNGGAAGTTTNGG 

GAAATNCNCCTTGGAACC 

Sequence 1638 

CTATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC l rTTTTTTTT 

rTTTTTTTn 1GTAACAGGATCTTGTTTTGTCATCCAGGCTGGAGTGCAGNGGCACAGTC 

ACAACTCACTGCAGCTTCMCCACATTGGCTCAAGTGATCCTCCCACCATCAGCCTTTTT 

GGTAGGCGGGACTATAAGCACATGCCACCATGCCCAGTCAAGTTTTTAATTTCTTTGTAG 

AGACAGAGATCTCACTATGTTGCCCAAGCTGATCTCAAACTCATGAGCTCAAGTGATCCT 

CCCAT CTTG GCCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCTCCTGGCCTTGCT 

ACTCCTTTTATTATTCACAACCTCTAGAAATAGTATAGGAGATCCAAGCACTAGTCTCCC 

GTTGTCATGGATACAGAAAGTGGAGAAGAGAGCAGGATAAGTGGCTTGTTTGGGACTAGA 

CAAGGCTGTCACTAAATGACTCACTACTTCAGCCCTTTGCTCCTCTTGGCACC 

Sequence 1639 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAATCATTTTGCTTCCAACCCCATTTA 

GCCTGCCATTGAAATGCAAAAGTCTGTTCCAAATAAAGCCTTGGAATTGAAGAATGAACA 

AACATTGAGAGCAGATGAGATACTCCCATCANAATCCAAACAAAAGGACTATGAAGAAAG 

TTCTTGGGATTCTGAGAGTCTCTGTGAGACTGTTTCACAGAAGGATGTGTGTTTACCCAA 

GGCTA CACAT CAAAAAGAAATAGATAAAATNAATGGAAAATTAGAAGAGTCTCCTGATAA 

TGATAGTTTTCTGAAGGCTCCCTGCAGAATGAAAGTTTC 

Sequence 1640 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGGAACTGCTC 

AGTTAGGACCCAGACGGAACCATGGAAGCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTC 

TGGCTCCCAGA 

Sequence 1641 

CGAGCATTNACAAAAAATTCNGACGCTCAAAGTCAGGANGGTTGGCCGANAACCCCGACC 

TGGGACCTATTAAMGGATTACCCANGGGCCGTTTTCCCCCCCTTNGGAAAAGCCTTNCC 

CTTCGTTGCCGCCTTCTTCCCTTGGTTTCCCGAACCCCTTGGCCCGGCCTTTAACCCGGG 

MTTACCCCTTTGGTCCCCCGCNCCTTTTTCNTTC 

Sequence 1642 

AGGTACGCGGGGGGAAGAGGAGGAAGGAGGGCGAGCGAGGAGGATGGCGGAGTCGGGGCT 

CCTGACGGAACTCTAATGAATCATTGATTGACCAGCACTATTTTACCAGTTGGAATGAAT 

G ATCAGAAATGGGC NTAGCTGCTTTTAAATAC N ACCNTG NGN ANATGG ATGTTACTCC AT 

GCTGATTACTNCTTNNAGCCAGCTCCTNNCTNGGCGGTCCCNCTNGNNGTAGGNANnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCATGGTC 

ATAACTTGTTTCCTTGTGTGAAAATTTGCTTNTNCCNTTCACAAATTTCACACNAACATA 

CCNANCCCGGGAAGCCATTAAAAGTTGNTAAAAGCCCTGG 

Sequence 1643 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTCTTCATCTTC 
CCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGTTGAATAAC 
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TTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1644 

TCACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGGAAAACTCTCC 

TCATCTGGACTCGTGACGTCCTTGCAGCCCGAGTTGGCCATATCCCACTACGCCCCTGCA 

CTGGAGCCTGAAGCAGAGTGTAAGGAACGGCCAGAGAGCGCAACACTGGGGCCCACTACC 

CCGGCGCAGTGACCCGCCGCGCCCCCGCGTACCTTGCCCGnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACCCTCGAA 

GGGGGGGGGCCCCGGTACCCAAGC I I I I I IGNT 

Sequence 1645 

AGGTACACTTTNGCACCTGCAAATCTGTTTATtMTTAGACCATAGMTCTGTGTTTCAT 

TGGTGGTCAGCTGGCCMTCTGATGGTCTTGTCATACACAGGTAAGAAGAGACAAAGAGC 

CAAGAGTTTGTTTGGAGGCCGGAGTGAGCTTGACAGAACTAGATGAAATTGGCCACTGGG 

CACTGGGACCCAGAAAGGCCTCATTGCCACAGAGTTGCAACAGAGTGTGGGGTCCCATTA 

TCAGAATCCTTTCTATGGTGCATCCAATTGGCTATGACTCTCCTAAGAACAGCGCTGCAG 

GAAAATGCAAGTTCACAGGATAATGAATGCACTGTTGCTTCTCT 

Sequence 1646 

AGGTACGCGGGATGCTCTGGTTTMTGTTTTTCAATAGTGTGATACMTTGTTCAACTTA 

TTGGCAGCTTTAGTGGTATTTGAACAACTTTGCTATGTTTGGGTAATTTTCCACCCTGTG 

AATCAGCTCCCCAAAGCAGAAGTCATAAAATCTGTTTTCCAGCCAGGGTGTGGAAACCTG 

GGCCCCTGGTATCAGGTCCACCCATGTGCAAGCTATGGCATTTGTGCCCAGTGGCAACAC 

AAAGCGATCTTCACCAGAACATTCCTCTCCTGAGGTTTCAGCACTGCTCTTGGCTACGTA 

GCCACCGCTAAGCTTCTCTCTCTGGCCTTCCTAGGAA 

Sequence 1647 

TACTCACTATAGGGCGAATTGGTAGCTCCACCTGCGGTGGCGGCCGAGGTACACGGGGAC 
CTCANAGCTGAGCTGGGCATGAGNAGATGCTCAGTAAGTGGTAGCACAGGGTTGGTCCCT 
ACGGTGGAGGCCCCCTAACACCG 
Sequence 1648 

CCGGGCAGGTACCAAAAGAATGTTCATANGCAGCATTATTTGTCATAGTCAAGAGCTGGA 

AGCAACCCAGTTGTGCATCAATGGTAGAATGGATGACTAAAtTGTAGTGTATTCTTACAA 

TAAAATACTATAGAGCAATGAAAATAAATAAACTATGGCTATNTGCAACAACATGGATAA 

ATCTTACCAACAATGTTAGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAAT 

CCATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAA 

AGGACTGAGAGAGTATTGCAGAGGGGGCTCCGGGTGTTTGTAATTTCTATTTCCCAGTGC 

AGGTGTGGTTTGCATGANCATGTTCACTTTGTGGAAATTCATTGAGCTGTAA 

Sequence 1649 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 

AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 

AGTGCTTACTGCATGCCAGACCCGTGCTAAGCTCTCTCCATGCACTATCTTAATGTTCAC 

MCCTCACACAGTAAGCATTATTAACAACAGCACTTTACAMTAGAAATTTGTTGAGGCT 

CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAAACCCAAGTTTTCATTTCC 

AAAGCTTTGCTTTCCTGCTGCTGGATGTCTTTTTGCGTCATCTTCGCATATTTAACCTCC 

TTNGGCTCAACTGGGGAACC 

Sequence 1650 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 

AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 

AGTGCTTACTGCATGCCAGACCCGTGCTAAGCTCTCTCCATGCACTATCTTAATGTTCAC 

AACCTCACACAGTAAGCATTATTAACAACAGCACTTTACAAATAGAAATTTGTTGAGGCT 

CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAAACCCAAGTTTCATTCCAA 

AGCTTTGCTTTCCTGCTGCTGTATGTCTTTTGCGTCATCTTCGCATATTTAACCTCCTTG 
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GCTCAACTGGGMCTCTTTGMGGCTAGGGCCATCMTTACATTTTTrGTTTCTTCCA 
Sequence 1651 

TCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGTGCCCAGAGATGCTCTCCGCACCAAGCC 
ACAGATGTGGAGGAGGCAGGTCTGGACTTCAGGCAGACAAGGAACTTCAGAAAATCCTGG 
GGAGAGACAGCACANGGTAAGAGGNTAGGTCAGNAGAAANTCTNAAGGCTCCCTTTTTTC 
AACTTTAGNCACTGTACCATGCCCCGGGGCGGGCNCGTNTCTAGTAACCTAAGTGGGAAT 
CCCCCGGGCTTGCAAGG NAANTTCNGATTATTCAAAGC NTTATCTGTATTACCCGTTCGT 

ACACTTACNAAGGGGGGGGTGCCCCTGGNTACCCNCAANCCTTTNTTGTTTTCNCCCTTN 
TTAGATGGAGGGGGTTTAAATTTTGNCTGCNGGCCTTTGGGTCCGTTAAAATTNCAAATG 
NGGTTNCAATTAAGCCTTGGTTTTTCCACTTGNTGGTTGGAAAAA 
Sequence 1652 

CCGGGCAGGTACCAGCCAATCCCTTCATTAAATGTATACAGATTTAGTTAAGTAGCATTA 

AATAGGATTCTTAGAAGTATGTCCTCATAGMCTTTTMTACTTMGGCTTTGTAAAAAC 

TATCCATGAAGGGAAAGCTCCTCAGCATAACTGCTCAGGGAAATAGGGCTAAATAACTGA 

ACATTAAATAATTGGTTAAAGGTGCTGTTAGCCGAGCCTCAATGCTTGCTACAAGGATGT 
ATGTACCT 

Sequence 1653 

CCGGGCAGGTACGGGGGTCTGCTTAAAACTCTTCAGAGGTTCACCATCGCTTCCAGAAAA 

AGATCAATACTGCTGACGTGGCAACCTGTCGAGACTCAGGACCGCTAGGAAATCAGCTCT 

TGTTTACCCCTCTATCCATrACACTACGATCCAGCTGGGnTAGGCATCACTTCCTCTGA 

GCMTCCTCCCTGATGTCATGAGGCAGTGCTGAGTGCCTTCTCTGGACCCTCTGTTTACT 

TGCCTGTTAAATCCTGAACATCTGGAAGGGAGTCGTGTTCAGCAACATATCTACGCCTTG 

CAGCACGGTGCCTGGCCTGCAGCTGGTGTTCCGTAAACAGCTGCTGACT 
Sequence 1654 

aggtacgcgggagtaaatgcagtacaaattgttctactgttttaaaaAgttttccgcaga 

ACAGTGCATTTATGGCAATGCTATGTTTAATGAGTTAGGGACATCAAATATATAGTAGTT 
CCTTATTTTCAGTTGTGAAAATGAAATGGCTAAAGCAGAAGAGACGTCTATTTTAGTCTT 

ttaaaaatgtgtgtgggtggtcttttttcctcagmgcccaaagcacatgtatattttgt 

TATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACTTrAAGAAAAGGAACAAGAAA 

aaggtaaattcatgtgttccccactgctgtgtctagaaccaagatcacatta 

Sequence 1655 

ccgggcaggtaccagccaatcccttcattaaatgtatacagatttagttaagtagcatta 
aataggattcttagaagtatgtcctcatagaacttttaatacttaaggctttgtaaaaac 
tatccatgaagggaaagctcctcagcataactgctcagggaaatagggctaaataactga 
acattaaataattggttaaaggtgctgttagccgagcctcaatgcttgctacaaggatgt 

ATGTACCT 
Sequence 1656 

CCGGGCAGGTACCTGGTAATTTACAGMCGTTCGTTATCAGATTTGATAATTATATATTA 

aggttgtaaagaatgtgtatgccagaattataggtagtagatcctagattcttaggaaaa 

ATAGTTTCTTTATAATCTTTTGAGTAGTGAAATGGTTACTTTTACAATGGTTATGAACTG 

ggtcaaggcaaaagggccactatatgtcttcagtcatctttctatgcctgaaatccagga 

AACAGTGAAAATGGATGTTCCCTGGAACAGCCATTGCGATGCCATATGTTGGTCATTGGT 

GTCCTTAAAGTGTATCTCAGATAATTGTGTGTCTCCCT 

Sequence 1657 

TNACCGCGGNGGCGGCCGAGGTACAATTGATATAACAGCTTTGTTGCCACCCCAGGCTTT 

TCTGTGATGATGCCATGGGCCACATTCTGATCAAACTGCACACCCAGAAGGTGAAGTGTT 

GGCTCCAAGCGAGAAAAATTATTMGTTTGGCACTTGAAACCCTGCTGTCCAAAAATTCT 

GAAAAATCATCCTGAAGTTCAAACTTGTGTAGAACTTCTCCAAGTAGATAGCCACTGGAA 

AATGCCTTTGCAAATGACTTGGGACTCACGGTCCGGGACACCTTCAACTCCTTGTTGAGC 
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CACTGGCACAGGGATCTCCGACATGGTTCAGCCCGGTGCCTCATACCGCANGCGGGGGCT 

TAGAAA 

Sequence 1658 

TCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAGC 

CAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTC 

TACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATT 

CACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAAA 

CTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAMCTATTTTGAATTGGT 

GGTG7TGAGGCTAAA7TCTCTGCTATTGACAGAATTGAGAATGTGATCAA7TTCAGAGTA 

GCATGCTACAAATTTTGTCCCAATTTCAATGGGGAGAA 

Sequence 1659 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCTNTGGTNGTNGCTCAGGAAGCTTTANA 

NATATATGGCTATTTMTTTGCAMTAATACTTAGAACTGCAAAMCACGTTCACCTGGT 

CTTGTTGAGAAAGAMTCATAAGTCCCCTGACTTAAACTCTCTCAATTGATGAAGTGACC 

AGGATTACTCAGAGTGCAGTTATTGGCACCCCCGAAGCACCAGAGTCCAGGAAAGATGCT 

TATCAATCTTTTGCCATTGGCTTCTACTTGCAGGTGCCTTCTCTACTATTCAATCATATA 

CGATCATTGGAAAGACAAATGAAGTCAAGAAAAGCAGCGAAACAGTACCTCGGCCGCTCT 

AGAACTAGGTGGGATCCCCCCGG 

Sequence 1660 

CCGCGGTGGCGGCCGAGGTNCCCGGGGGATGCTGCCACCTAGGTTACTTGTAGGACCCTA 

TACGGCAACCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGCAGTGAAG 

TAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGT 

CTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAA 

AAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGGAGG 

TGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACC 

Sequence 1661 

AGGTACAGTTACAACTGACTTGCATAACACTAAAGAAAACATTGTAATTCCTACGCCAGT 

AGTGGGGGTAAACTGAGMCCAACCTAATTTTCACCACAGGGCCCTCTGGAAATAGAGGA 

ACTAAAATGATGACTGTTTGAAAGTGAAGAGGCAGATTCCTAGAACATCACTCCTACTGC 

ATTCCTTACCTCCACTCCATACCCTTTCCTTCTTCACTCAGGTAGGACTGTCTCTGGGAA 

TGCATGGACAGCATGCCACAGTTCAGNAGCTTGGGATGTGATCCTGGGAAATGGGGCCTT 

AAGATGACCCTTTATGGAAACACCTGGAAT 

Sequence 1662 

CTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCAGGGTCAGGTCCGTCATCGCCA 
CCCCAAGCCCGCGGGCGGCGGCATCCAGTGCCAGTTCGTCGAGGTTGAACGGGATATGCC 
GGTAGTCGTCCAGCGACCGCGCACCATTGGCCGCCAGCCAGGTCATCCAGTCTTGCTTGT 

0 

Sequence 1663 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGGAAAGATG 

AACAGGATTGCAGCAGATAGAGACAGGTAGAAGGAGAGAGATTCCAAGTGGAAGGATAAA 

GGAGAGGGAAGAGAAGCAAAGAGTGCAACTTCCAAGAAGATTTTCTGGTTTCTGGCACTG 

GTCATAGATAGCTTGGGAGGAAACAGGCAGAGAGGGTTTTGGAGAAACAGAGCTGCAAAG 

CCMGTTAGATGTTAGGTGAGTTGAAAAAATAACTTAGGGCAAAAGCAGAGGGAGGGAGG 

TATTTACTCTGAACAGAGATGACAGGGTGGGGGAGGACAGATTGTTTTGAGTAGGCAAAG 

GCCANGAGCATGAAGCAAGCTAGGAATTG 

Sequence 1664 

CCGG GCAGGTACTATTAAATTTAATG AG AGGTAACCC ATG C ACCGGCAGG AAGTTCAAG G 
GGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGT 
TTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCTTTTGATTAGGAT 
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GCTGTCCTAAATCCTTCCTCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCTTTATCGATACCCGTCGACCCTCGAGGGGGGGGGCC 

CCCGGTACCCCAGNCTTTTTGTTTCCCTTTTAAGTGAAGGGGTTAAAATTGCCGCCGCTT 

GGGCCGTAAATCAATGGGTCATAAG 

Sequence 1665 

CGCCCGGGCAGGTACACCAGAACTTAAAGTATAATAATAAAAAATAAAGTATTACAGAAC 

ATGGGTCAATTATATTAAAAAGTAGATTMTTAATGTCCATATATACATTTmGGTGGA 

AATCAAACATAATGAAAACTATTTCACATMTTAATATTTTC 

ATTTATTGCAGAGATGGGAGTCTTGCTAGGTTGCCCAGGCTAGTCTCAAACTCCTGGCCT 

CAAGTAATCCTCCCACCTCAATCCCCCAAAATGCTGGGGGATTTCAGGGAACATGCCCAC 

TGG^CTTTATTAGGTGTTACCCTGGTNCATAAMGGTTTAAATTACCTCATT 

A I I I I I CAGGTATATTATATTATACCAC 

Sequence 1666 

ATACCAGGC CGTT TCCCCCTGGNAAGGCTNCCTCGTGGCGCCTCTCCCTGTTNCCGACCC 
CTTGCCCGNTTTTACCGGGAAANACCCCTGTTCCCGCCC I I M It II I CCCCTTTTCCGG 
GGGA 

Sequence 1667 

AGGTACTCTTGTTTAACCATCAGAGGTGATCCCATCACCTTCACAGCCCCAGCCTCTGCT 

CCAGTCCCTCCCCAGCGAAAAGGGCCGCCCATGCCATCCTGCTGCTGGTGATTTGCTTTG 

TGGTCATGGACTCAGTGGACATCATTATTTTATTAACCGTGTGGTAGATTTTTAACTCAG 

TTATCCTGGATATCCAAAGGTTTGTGGTCCATCTTTAGGCTTCCGTTTGTTCTTTGGTAC 

CTGCCCGGGCCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCCTGCAGGGAATTTCG 

ATATCAAGCTTATTCGATACCCGGTCGACCCTCGAGGGGGGGGGGCCCCGGGTACCCCAA 
GCTTTTT 

Sequence 1668 

GGANCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTG 

CTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGG 

CAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCA 

GMTACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAA 

AAATGACAAGGAAGTTAATGGAAGTCAGCCAATGTGATGNGTGTTTTGGAGGTGNGAGCC 

CTTC AG N AAGGGTAATTTAATGCNCCTTGTTAAGN AAGAAGGCCCAN N)*AG AGCTTTGCG 

CCACCTTTCTTCCCTGCCCATTGTGGAGGGAAGCCCAAGNAAAGCCCGGC 

Sequence 1669 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGCG 

TGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAGTTAGGCTACTCGAGGGTCT 

GGGACCCACTTGAGGAGGCAGTCTGTCCGTTCTCAGATCTCCAGCTGCGTGCTGGGTGCT 

GGGAGAACCACTACTCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnAGGGTTAAATNTGCGCCGCTTGGCCGTAAATCATGGGTCA 

TTAGCCTGTTTCCCTGTGTGGAAAATTGTTTATTCCCGCTCAACMTTTCCCACACCAAC 

ATTACCGAAG 

Sequence 1670 

CCGCGGTGGCGGCCGAGGTACACACTAGTTTTGTCATACCGTGAGGCAGTGTATTAAAGA 

GGAAATGTTCTGTCTTTGGGACCAGACAGACTTGTTTCCATAACCCTGATTGTCTAGCAC 

TTTATAGCTCTGAGTCCATGGACCACCACTTAGCCATCCCTACCATCAGTTTTTTGTAAA 

GAGCTATGTCAGGAATGTGGAAGGATTCAAGAGAGGGCTTTTAAAGTGTCCATCACGACC 

TTTAACCCAAGGGGAATTAATGCAGGAACAGAAAACCCAAATACCACATGGTTCTCACTT 

TATCTGTAGGGAGCCTAAAACATTTGGAGGCCACCACCATGGGACCACCGAAGGGGGAGG 

CCAATTAGGACCCTACCTATTTGTAGGG 
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Sequence 1671 , 

GCGTTGGCGCTCACCTGCCCGCTTTCCAGTGCGGGGAAACCTGGTCGTGCCCAGCCTGCA 
TTTAATGAAATCCGGGCCAACCGCCGCCGGGGGGAGAAGGCCGGTTTTGCCGTTATTGGG 
GCCGCCTCTTCCCGC 
Sequence 1672 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTAT 

TTCTAACAGGCTTGTAGAGAAACGTCGGTTCATGTAAATTAGAAATTATGGGGCCACTTT 

GCCATTCTTCACACCTGCAATGAACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAA 

CTCCATTTGCATGTTATAGTGGGATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCA 

CTACATCTATATATGCATTCATATTCCAGTGGCTGTGAAGTGTTGGGCTGTTGGTTGACA 

GAAGGGAGCTGCATCCTTCTGGGAGGGAAACTGAACTTCAGCTGGATGAAAAGCCCACCC 

TGGGTCCTTGGGAGGGTGAAAGCATCTTTCCAAAATGGACAGGCCTGGCAGGTCAATGGA 

CTGGAT 

Sequence 1673 

CCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCT 
GACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCT 
TTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACA 
TTTC 

Sequence 1674 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAATATTTA 

TAGATATGAGGCTTCACTTTGTTGAATTCATCATACACACACACTTCTTTTTAGCAGAAA 

TTAATAGAAAGCCAATTCTAAATTAATGTAGGTAATACATAATCTACTCAACTGGATGAA 

TACCTTGGTTGCAAGTCTTGTCCTGAGCGAGCTAAGTGACAACACTCTCAATCTACTGAG 

CTTCCAGCATCTAGGCAGAGCTCCCAGGACAAGCAGAAGTCACAGGGAACAGGAAGCAGG 

AAAAGAGCAGGAAAAGATCCTTTAGTTTCCATGTGTAGAAACAAATGCTCAACTGNCTTN 

TGTATTGAANCCTTCTGGATGCCTGNCNNTTTTATTTGGTCANTTGGNGGNATAGGAAAA 

TCCI I i I I I AGGTTGGGCAGNCCNATTTAN AAAAATTTGGGAAACCCCNCCC I It I M IN 

AAAAAA 

Sequence 1675 

GTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAATAAGGTGTGGACTCACCA 
AAGACAGTTCCAAAGTCAATTTCACTCTGACACACTCTCTGTGATCTTCCACAGTCAGCA 
C AATGCCTGCCCCCTG CTAGGCCTG ATGG ATGATCTGATAGAAAACTGGCTTCAGC 
Sequence 1676 

CCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTTCTTGAGTTCATTTGTCTTTCCAATG 

ATCGTATATGATTGAATAGTAGAGAAGGCACCTGCAAGTAGAAGCCAATGGCAAAAGATT 

GATAAGCATCTTTCCTGGACTCTGGTGCTTCGGGGGTGCCAATAACTGCACTCTGAGTAA 

TCCTGGTCACTTCATCMTTGAGAGAGTTTAAGTCAGGGGACTTATGATTTCTTTCTCAA 

CAAGACCAGGTGAACGTGTTTTTGCAGTTCTAAGTATTATTTGCAAATTAAATAGCCATA 

TATCTCTAAAGCTTCCTGAGCAACAACCAAGAGGTACCTGCCCGGGCGGCCGCTCTAGAA 

CTAG 

Sequence 1677 

AGGTACGCGGGACATTGATGGGAGAGATGTGGAGGAAGAGGGAAGAGGTCATTTTCCGTG 

AATACAGCTTTTC I \ I I I ACCCTTTAAACA I 1 I I 1 GCTCCCTTTGTTCTTTTCCCAGATT 

ACTTCTGTTTGCTTGGGTTCATCTGGCTCTCCATGAATGTCCTGCTTTTCTGGAAAACCT 

TCTTGCTGTATAACCAAGGGCCAGAGTATCACTACCTCCACCAGATGTTGGGGCTAGGAT 

TGTGTCTAAGCAGAGCCTCAGCATCTGTTCTTAACCTCAACTGCAGCCTTATCCTTTTAC 

CCATGTGCCCGAACACTCTTGGCTTACCTCCGAGGGATCACAGNAAGGTTCCAAGCAGGG 

AGAACCAGGGAGATTGTTGGATAAAAGCAGA 

Sequence 1678 
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CTACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTAGGGGCC 

CTTTGCTGCAAGCTTAGGCCAGTGTCAGCTCCTGAGCCACTCCTGACTCAGTTGGAGGAG 

ACTCAGTGTCCTCTGCCCCCTGCCTGGTGGAGCTTGGCAGCAAGTCCCAAGTTTTACAGT 

GTTGAGACCAAGTGGGCAGCCATCACCATTGAGGGGTGCCCTTTTGCTCCCCCTCGAGTC 

CMGTTATAGTATTGCTTACTTCCTACCTGAAATAGTTTATGGGTCATGGGTCTGGCTTA 

CATCAAGCCCCATAGCCAGTCTGGTTGCCCCACCTAAGGTCTTGGTGCTCTTTGATAATA 

GCCACAGCTAAATGTATTAGTCCTGTCCTAGGGAAATGCATTGATCTTCTNCCCAAAAAT 
ATATA 

Sequence 1679 

TTC TC TTACTGATAGTAGGATATTTC TGCTTTAGTTATTGTC ACCTTAAATATATTTTCA 

ATGTTGAAATCCTCACAGCATGTTTGATGAAATCTAGTTTTCAAATTTTCTTAGGTAM 

TTCTG TCAC GTTGGCATGATAACAAATGCAATAACCCAAAAGACCCCAAAAGCTAGTGTA 

ATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCATGTTGACAGTGCGTGAATGTTCG 

GTAGGCTTTGTCAAGCTTGCATACAATAAATTATATATGTCCCTTTTCTTTTAGGGTC 
Sequence 1680 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGGGACCGCCAGGGGCCTCGAGAATCG 

GTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGTGCAAACACATTGTT 

CTTAATTTGAAAACTGTGGGCAGAAACAGMGCCCGAGACTAATTTTTCCATTGCTAACT 

CTAGATTCTCGGCCACTGGAGTCTGAAGATACTCTCTTTGAGAATGCATATTATTTTGCT 

CACA GCTAAMCATTTMGTATCATAGCTGATCAGTGGAGTGAGATTAAAMGGTTTCTT 

TTTTGAATCATCAGCTAGAAGATGTACCCTCGGCCGCTCTAGAACTAGGTGGGATC 
Sequence 1681 

CCGCGGTGGCGGCCGAGGTACAGCAATGGCACACTCTATCCTAATAATGACATCCTGTAT 

AATTTAGTATGCT CTGT ATCTCACTTTAAATGTAGCATTAAGGATGAAGCAGTAGAAAAT 

GAACGCCGGGCCTTTTAATAATGAGGCTCAGGTTTGCTATCTTAATTTCAGGAATTAGGA 

ATGGGAGGGATGCTAAGGAGACTGTTTAAATTTTTAAAAAGTTCTCTCTATGCCTGATAC 

AGATGCAAATTGCTTGCTCATTACATTCTTTCAGCCTTTTTAAACAAGTTTGCTAAA^ 

ATCATTCTCTGGCATTAAGACAGCTTACCTGCCAAGCTTGGACATTTTCTATAACCCAA 
Sequence 1682 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTGATTTTAAG 

AGAATAAATGTCCAGATTCCCMCTCCTATATGCACTTTTTACTAAAATGATTCTGCTTG 

AGAATGTGCCTGCAGTGTGAGGGAATGCCGTCTCTTTTCTCTTTTCCCATTTTCCATTCA 

CCCTTTAAAAAGGCAAAGCACACCTCTTCCTCATCCTGAGTCTATTCTGGCATATCACCT 

TGTGGTGTAAAAACCTCCAGGCACCAAACAAAGGGACACTTTGAAAATATTAGTGTCATA 

AACTTGTGATAGGGCTCCAMTCTTTCCTTGCTGTTATATATGTATTTCTGTTAAGGAAA 

GAAGTGGGATAGAAATACAAAGTTGAGGAGAAAAAGGAAGGGATGTAAAGCTTTCTGGAT 
TGTCT 

Sequence 1683 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGCTCTTGATA 

TATCCTGGCCTTAAGTCTCAGGTTCTCCAACAAAGCTGGCAGCCTGTGGTATGCCTTCTG 

GGGCCAGAGAGTGTGGCATGATCCGAGGTCCTGAGTCTCTCTCTCAATGAGCCCATCATA 

TTGGCTTAACCAAAGCCAGTGGTCTTCAGGA 

Sequence 1684 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGATGAAGAGGC 

AAGAAAATTATGCCAAATTACCCTGTTTCCCTTTGTCACACTGCCTCCTCGCTGACACCT 

TCATTCTCCTGCAACCTTACACTTTGATGATGACTTGCTCTGCTTCCATGGAACTACTTC 

ACTTG GCCTATGGCACAAGGCACTTCCTTCCGGCAACATTCGCACCrrGGGAGGGGGCTA 

( Ml 1GC ATGCACCTAGTCCATTGCAATCCATTTGTTCTTGAAGGGCAAGATAGAAACTT 

TGTTTTATTATCTACACTCACGAAAAACAAATTTTGTGCACCATTCCCAACACTATTCTT 
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TTTTCl Mill I ATTTt AGTAAAAGG 
Sequence 1685 

GTGGGCCGGGCCCGAAGGTTAACGCCGGGGGAAGCCTACCGGGCNAAATCCCTGGAAACT 

TTCCCTGGAMGAATGGTTCCTTTGGANTGGTGGCAGGCTTGGGGCCATTTTCCCTTTCC 

GAACTTTCTTCTTCCCANCCCCTAAGCCTTTCCCCAAGAAACCAATTCACCATTAATCAA 

CTTGCCAAAAAAAATTAAGGCCAATTTGCCANTAACCATTGGGNNATTCCAAGGGCCCCC 

AGGTTGGGGAAAAAATTGGTTAAAAAGGAAAANGGCCCCCTTGGNAANGCCTTNNATTGG 

GGGGGGTTCCAAAAAATTGAAMGGGGTTGGAAAT7TTCCAANANGGCCTTGGANAANGG 

GAAAAAATTATGN NCAAA AAATTTTCCAACCCCTTACCAACCMGGTTTTTNTT^ 

MGGGAATTGGGGTTTTNGCCAACCCGGAAAAAACCMCCAACNTTTGGGGGGGGGAAAA 

AmGNGGGMGGCCAANAAAAANCCAANGTTTCrTTTnTGGAAAAA 

N AAAA CCAACCCTCCCAAAAGGGGGCCTTTGGNTNGGAAAGNAACCCTTAACCCCCTTAA 

ATTTTGGGTTAAGNAAATTAANTTTTNGGCCCAC^ 

TGGGGNTTTNGGGTTTCCCCTTGGNAAATTTCCAAAAGGGA 

TTGGGGGGMNCCGGTTTTTGGGGCCCCCCCTTNGTTTTTTTGGCC I I It I I I I I IAATT 
TAAAA 

Sequence 1686 

TAMGATACCAGGGCCGTTTTCCCCCTGGAAAGCTCCCTTCGTGCCGCTTCTCCTGGTTC 

CGACCCCTTGCCCGCTTTAACCCGGAATACCTGGTTCCGCCTTT7TCTCCCCTTTCGGGG 

AAAGCCGTTGGGCGGCTTTTTCTTNAATAAGNCTTNMCGCCTGNTAGGGTATTCTTCAA 

GTTTNNGGTGGTAAGGGTTCGGTTTCGGCTTCCCAMGCCTTGGGGCTTGGTTGTTGGCA 

ACGGAAAACCCCCCCCGGTTTCAAGACCNCNGAACCCGGCTTTGGNCGGCCCNTTNATTN 

CCCGGGTTAAACCTTATTCCGGTCTTTTTGGAAGGTTCCCAAACCCCCGGGGTTNAAGGA 

ACAACCGGMCCTTTAATTCCGGCCCCACCTTGGGGNCNAGGCNAAGGCCCCAAACTTGG 

GGNTAAMCCAAGGGGAATTTAAGNCAAGTANNCCGGAAGGGGTTAANTGGTTNAGGGGC 

CCGGGTTNGNCTTANCAANGAAAGTTTTCCTTTGGA 

Sequence 1687 

ATTG GAGCTC CACCGCGGTGGCAAAGCAGACTTT1TATTTTMCAAGGAAAGTCATAAAG 

TGAAl Mil I CATGAAAGAAATAGTCTTCATTTAAGAAAGTGTTGCCGTTAAAGGGTTAT 

CTTGGCACTMTGATTGGCTTTGGGTGATTTTAATTCTGAGATGTCTGTCATAAAGGAAA 

TTATGAGACCATAAAATTGAGCTTCTAATTTTCTGTAAGAATACAGAGGAGAAGGAAGGT 

AAMCTGTTMGGGTGAGCTTAAGTAATACACAGATCCTAGATAATCAGTTTGCATTTTG 

TGCTAATGTMGATCACAATGCTTGTGTTTGAATTAATGTTT^ 

TTTGAAATGGACAGTTGGAATCAATCAACAATATTTCTTGGGGGTGCATTTCCACTGGAT 
GC 

Sequence 1688 

TGAATCTGCATTTCTTGGATGAGATAGTTAATAACAAACTATTTCTCAATATTTGTATAC 

TAAAAACTAGTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTT 

CTGAAAGGAGCCTATATAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCTTGGCT 

G GG ACTGCTATAAAAAAATT ACCAT AG ACC ATTG ACTAAACC AC AAAC ATAT ACTTCTCA 

CAGTTCTGGAAGTTGGAAAGCCAAGATGAGGATGCCAGTATGGCTAGCTTCTAGTGAGGG 

GTCCTCTTGCAGGTTGCAGACAGCCGGCTTCTTGGCTCCTCACATGGCAGGAAGAAGGTG 

CCGCAAGCCTCTCTGGCCTCTTCTTACCAAGGGGCATTAATTACCTTC 

Sequence 1689 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCATGGATAAT 

TTAGATGTGACTCTGGACAGAGGGATGGATCAGATGACTTC7TAAGTTATCTTCCAGTTT 

AGGAGTTCGTAAACTATACTTTCTCCTTTCCAGACTATCCTAGTAAGAAMTTCTCTTTT 

AAGACAGAGTAGAACTCTGGAATTCATCAGTTTTGATGTTTCTTAAAGTGTAATCTAAGA 

TAGTGCTCCTGTATTAAGTTCTGATGTCTGACCATTGTTCAAATAAAGAGTAAAATGCAA 
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ATGACAGGAMTTGGCTGTGTTCTGAATCCTATTTTTATTTGGGATAACAATAAGCCTGT 
ATGGTCACTGGTGACCTTTTGATTTGCTGTTTCTGCAACCTCACACTTGCCTCAGGATTC 
TTCTTCCACTTCTTGCACTTTATATTG 
Sequence 1690 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATAGGCAGACAGGTTTGCCTTCA 

GAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACCATTTTC 

CTGCAGGATGTTTATAAATAGTTCCTGGCAAAGGAGGTTGCCCGTATAGGGTCCTACAAG 

TAACCTAGGTGGCCCCGCGTACCTGCCCG 

Sequence 1691 

GAGMGGCGGGTTTGCNGTTATTTGGGCCGCTCTTTCCGNCTTCCTCGCTCACTTGACTC 

GCNTGCGCCTCGATTCAATTCGGGCNTGATGGCAGAAGCCGGCTATNCAGCNTCACCTTC 

ANAAGGGGCGGNTAAATTACCGGGTNTTATTCCCACCAAGGAAATTCAGNGGGGNATNAA 

ACCCCCAGGGGAAAAGTAACCATTGGTGGAGTCCAATAAAAGGGCNCACNCCAAAAATAG 

GGGCCCATGTNNAANACCCCGGGGNAATAAAAAAGNGGCCCCGGCCNGTTNTNGGCCTTG 
GGGGCCGGTTTTTTTTTT 

Sequence 1692 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGATGAAGAG 

GCAAGAAAATTATGCCAAATTACCCTGTTTCCCTTTGTCACACTGCCTCCTCGCTGACAC 

CTTCATTCTCCTGCAACCTTACACTTTGATGATGACTTGCTCTGCTTCCATGGAACTACT 

TCACTTGGCCTATGGCACAAGGCACTTCCTTCCGGCAACATTCGCACCTTGGGAGGGGGC 

TATTTTTGCATGCACCTAGTCCATTGCAATCCATTTGTTCTTGAAGGGCAAGATAGAAAC 

TTTGTTrTATTATC TACACTCACGAAAAACAAATTTTGTGCACCATTCCCAACACNATTC 

1 1 1 1 I IC 1 1 I I I I I ATTTTAGTNAAGGNGGAGAAAGCTTTT 

Sequence 1693 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAGGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATG 

Sequence 1694 

CGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGCAGGGTGTGTTCAGAAGAGCTTGCTG 

AGGTGCTCGGCTCTTAGCATTAAAAATGTGATGTTGGTATATCATCCTGATAGAAAACAC 

TGCTTTCCAAATCCTAGTCACTTGGATGGGAGGAAAGTAAGAACAGATTCTTCCAACCAC 

TACTGATTTGTTATAATTCTCCCCATTGAAATTGGGACAAAATTTGTAACATGCTACTCT 

GAAATTGATCACATTCTCAATTCTGTCAATAGCAGAGAATTTAGCCTCAACACCACCAAT 

TCAAAATAGTTTAGGTCTTGCCCCTGCCTGATTATGTAACAGAAAAGCAT 

Sequence 1695 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTCCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCCTCTGAGCGCTTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTANGATCTACTGGG 

GGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTTGAA 
Sequence 1696 

CCGGGCAGGTACGCGGGACAGTTCTCTTTCCTCCACTAGACTGAGCTCTTCAGAGGAAGA 
CTCACTTGGCTGAAACCATGATTTTACTTTAAACACATTGAAAACCTCTACTGGAGTGCA 
TTGTGTCTGGTGGGCTTCAACCTTAATTCTTAAGTATGTGAAAACACATCACCTATCTGG 
AGGTTTACACTTTCTGCTAATGACTTTATTTTTAAGCCCACCACCCTAACACAACAAATA 
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CTTAAAACTTGTCTTCATTTCCTTTAGGTCTGGCCCTCATGCATGCATATAATTTATAGA 
GTCACTGTTTTGCTCGGTTGTCCTCATGC CTCTATATTATTGG GAGGTT 
Sequence 1697 

CCGGGCAGGTAC I 1 I 1 U f i I I I 1 I I M M I 1 1 11 I G AGATGGAGTCCCATTCCGTCACC 

CAGGCTGGAGTGCAGTGGTGTGATTTTGGCCTACTGCAACCTCCGTCTCCTGAGTTCAAG 

CAATTCTCCTGCCTCAGGCTCCCAAATAGCTGGGATTACAGGTGGGCACAACTACACCCA 

GCTMTTTTTGTATCCTTAGTAGAGACGAGGTTATACCATGTTGGTCAGGCTGGTCTCAA 

ACTCCTGATCTCAGGTGATCCACCCGCCTCGGCCTCCCAAAATGCTGGGATTACAGGTGT 

GAGCCACCATGGCCAGCCGGTTGTTTACTTTTGATAAAGGATAGCCAATGTAGGATTAAA 

GAGGATTCCAATCGCAGTCCCCAGGTTGGGAGTTTATCCATCAATAAAGCAAATTACGAT 

AACTCATAGTGATAMTATTAMCCCAAGCAACAGGAAAATATTTTATTTTCCAGTGTTT 

C ATTG GGCTTCTG AATCATTCTGG ACAC ATGTATATTTCCCTGTCTATATTAAGTAG N AT 

GTTATTTTCTC ACTTTAAAATTATCCCGTTAA I I i I IGGATTTATTTGNTATTTTGGA 

Sequence 1698 

CCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATGTGAAGATGGCGGAATTCCCAACGTG 

CTTCTGCAAAATTTTATGTATTCATCCCCCTCTGCTCTCAGTAGGGTGTTACCACCTGCG 

GATCCTTTGCTTTCCCCGCTGGATACTAACAGCCTTTTTCATAATTGCTAGCACCTATGG 

AATTGAGTGTTGAACTGAACAMCAATAGTCTTTCTCTGTMTGTCAGAGTTATTTATAA 

GACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGGCAAAGAATCAGACAACCAGGATTT 

GGTTCCATTCTTTTATTATCAAAGAATAAAAAAAAAACAATATTATTTTAAGATGCAGTC 

CCMGTCTTATAAGGAATTTGAAGGCTTCTGGGATCTAATCTAGGTCATGCCCATTCTGC 

CTTCCCTTGCTGTGTGCCCCTGACACAAGCATGGAAACATGTATTCACTTACCTTTGCCC 

TTGGGAACACTTTGGTGGGGAAAAATGTTGATATAGGTTAGCATTTTTATAAACTGTCAA 

GTTGTACCT 

Sequence 1699 

TCTGAGAACTTGGGCTGCGCAGGGAATGCTGGTGTGGGAGAAGCAAGCCAACTGAGCTGA 
GCTGCGGCCCTGTTTGTCCTTAGCTCTCCAAAACTTGCTGTCTATGTCACTTGGAAGGAA 
TCTGATTCTTTTCTGCAAGGCACTGACAATACCTTACAACAGAGCTGGGAGATAAAATGC 
AGAGCTTGTGAAAGTCATCAGAAAGTTGAACACAGATGGTCCCCGGCTTCCCGCGTACCT 

0 

Sequence 1700 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACTGCACTG 

TGGTTTTGATTTTCCCTGGTCATTAGTGATGTCGAGCA 1 1 I I I l CATATGTTTGTTGGCC 

ATTTGTGTATCTTCTTTTGAGAATTGTCTATTCATATCCTTAGCTCACTTTTGATGA 

TG I 1 1 I 1 1 I CTTTCTAGTTTGTTTGAGTTCATTGTAGATTTTGGATATTAGTCCTGTCAG 

ATGCATAGGTTGTGAAGATTTATATCAAAAGTGTAGAGCCTGAGTTTGGTCATGAGGAAA 

TATTAGAGAGACCCTGGTTAGTGTTAGCCTAAAAAGTGAAATGCCTAAATACTTAGGATT 

GTTAAGGAAATAATAAACCAAAAAACATTGAAAAACAGAGAGCAGCTGCACTAGTGCTGA 

GMTTGAATGAATATATCAGAGTTCTTTTAGTGCCCGTGAGATGCTAACCTTTTGACCTA 

GGCAGAGTAGAGATATTCCACAGAGCTCATTGGACATTCAGTTTAGACCTCAGAAAGGGC 

ATACTGTAGGAAGTAAGGCCACATTCTAATGTAAAGAACTTAACCCTTTGACCAGTGGAC 

AAAAATTGAAACCAGACCTTATAGAGCAAATGGATAAAATCAACCCGGGTGAA 

Sequence 1701 

TGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCTCTGGTTGTTGCTCANGAAGCTT 

TAGAGATATATGGNTATTTAATTTGCAAATAATACTTACAACTGCAAAAACACGTTCACC 

TGGTCTTGTTGAGAAAGAAATCATANGTCCCCTGACTTAAACTCTCTCAATTGATGAAGT 

GACCAGGATTACTCATAGTGCAGTTATTGGCACCCCCGAAGCACCAGAGTCCAGGAAAGA 

TGCTTATC N ATCTTTTGCCATTGGCTTCTACTTGCAN GTGCCTTCTCTACTATTCAATCA 

TATACGATCATTGGAAAGACAAATGAACTCAAGAAAAGCAGCGAACAGTACCTNGGCCGC 
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TCTANAACTAGTGGATCCC 
Sequence 1702 

NAGGATACCAGGCGTTTTCCCCCTTGGAAGCTCCCTCGTGGCGCATCTCCTGTTCCGACC 

CTGCACGCTTACCCGGGATTACNCTGTNCCGCACTTTCTTCCCATTACGGGGTAAAGCCG 

ATGGGCCGCTNTTCTCAATAAGTCNTCACCGCCTNGTAAGNGGTANTNCTTCAANGTTTC 

NGGNTGGTAAAGGGTTCCGGTTACGGCCTTCCCAAAGACCTGGGGGCCTTGTTGGTTGGC 

CACCGAAACCCCCCCCCCCGTTTTCAANACNCCCGNAACCCGGCTTGNCCGGCCCCTTTA 

ATTCCCNGGGTTTAMCCTTAATTTTCGGGTCCTT7TGGAAGG7TCCCCAMCCCCCCGG 

Sequence 1703 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTCACTGCGTA 

TTTAGAAGTTCTTGGGGCACTGTGTAAGTGTAGGAGAAGATAGATGGAGTATCTAATTAA 

TCTAAAGATAAAAATG AATG AGGTAGAATCCATTCAGTAGACTAATAAATAGTTCTTAAG 

ATCTATAGGATTTCTCTTTTGGAGTGTGGAGACATAGTGTGGAGACATTTGCTAATGTAG 

CCATAAAGATAGACCTATTGTCCATTGCAATGAAAGCAACAAATACTATAACATAAAATA 

TTTCCAAATTGGTTTGGGTGAAAATMGCTTGTTTCCCAATAAATGTCAGATGGCACTTG 

TTGT AAGAG GTMCTTTGMGAAATTTAAATTAATATATTATTTCCAGCCAAAAACGTCA 

TGCATT7TTAGAAGTTCACAAAAGTCTGTAAAATATGCACACAAATGTCCATCAATTGTA 

ATTTATTTCAATAMGAMGGTAGCTTATACCACTTGTGGCCATCAGATTTCCTATAGAC 

TATGCCTCAAGAAATTATCTAAATCTAAGGNGTTAGCAAAATAGTGCTTACATGTATTTT 

ATAGTATGAAATCACCCTTCAATATACATATTCATATAAGTTGTTTAAGCTGGGGCACTG 

CTTTACTGAAAA^AAAAATCTTTCACATTCTCTAT^ 

GAATGGCTCANCTGGTNAATTTCTACNGAGGGTTAAAAGGAAGGAAAGAAAAAAA 
Sequence 1704 

GGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTC 
ACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACC 
ACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCA 
AATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCT 
ATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGATTTTACTCC 
C ATAACCCAGGCAGAAG TACCTC GGCCGCTCTAAAN CTAGN GGGAATCCCCCGGCCCTGN 
GGNAATTTCNATNNANGC 1 1 I I I ANTNAN NCCCN NC AAACCTTNTNNGGGGGGGGGCCC 
Sequence 1705 

CCGGGCAGGTACAAGCTATCAAAGGCGGCAATTTTCTCCTGCAAGACTACGACAAGGATC 

CGmATTGCMTGGAGATCATAMTAGGTGTCTTAAATTGAATGGACTTGACCATCTCC 

CCAGTACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 

CGCGAACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 

CGCGAACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 
CGCGTACCT 

Sequence 1706 

CGCCCGGGCAGGTACAATGGAACAAGGAGATAAGCAGTGAAAGGCCAAGGGAATGTCTGG 

AGTTAGGACTTCAGGTGATTCACAACTTGGCTGCCAGTCACCCGAGACTGCCCAAGCCCA 

GATTTCTTCCTTCTATMGMTATrGATTCTTGCAAATAAGATGAACCTAAATGTGGTCC 

AGGAGTCAGCATCTTCTACATGGTACCT 

Sequence 1707 

GMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTTCTTGAGTTCATT 

TGTCTTTCCAATGATCGTATATGATTGAATAGTAGAGAAGGCACCTGCAAGTAGAAGCCA 

ATGGCAAAAGATTGATMGCATCTTTCCTGGACTCTGGTGCTTCGGGGGTGCCAATAACT 

GCACTCTGAGTAATCCTGGTCACTTCATCAATTGAGAGAGTTTAAGTCAGGGGACTTATG 

ATTTCTTTCTCAACAAGACCAGGTGAACGTGTTTTTGCAGTTCTAAGTATTATTTGCAAA 
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TTAAATAGCCATATATCTCTAAAGCTTCCTGAGCAACAACCAGAGGTACCTGCCCG 
Sequence 1708 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGGCATGAGCCACTGTGCTCGGCCAAG 
ATTCTCTTAAATTCATCTTTTGCTT^ 

CACCGTCMTCACAGAATTGGTTGGATAGTTTGCTTTCTTGCTAATTTTGCCTTCTTTCA 
CTTGTTCTCTATAAATTGTCCAGACTATCAGATAAAATCCAAATGTTTCATCCTTTGCAA 
CTGCTCATATACTGTCTAACTTAGTGAAACAMCTGCTTTCATTGTTATTTGTTCTTCAG 
AATACTACCTCC TAAGTAAATGGTACCT 
Sequence 1709 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGTG 

CCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGATGGCCTGTGAAAAC 

TGMGGCMTTTGGAAGGCATTTTTAAATTCCGTGGAGCCACTGGAGAAATGTTTACAGG 

AATGATGCCTTAAAATTCCTCAGTGGTAAGATCTGAGTACCTGCCCG 

Sequence 1710 

ACTNCTATAGGGCGAATTGGAGCTCCCQ.GCGGTGGCGGCCGAGGTACACCAGAACTTAAA 

GTATMTMTAAAAAATAA AGTATT ACAGMCATGGTCAATTATATTAAAAAGTAGATTA 

ATTMTGTCCATATATACATm^ 

AATTAATATTTTGTTTA I Mil I ACTTATTAATTTATTGCAGAGATGGGAGTCTTGCTAT 

GTTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCAA 

AATGCTGGGA TTTCA GGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTTA 

ATTACTCATAATTTTAT TTCA GTATATATATATACACACACACACCATACACACCATAGT 

AGMTAAATTMTTATATTTTCTAC AAAAATTATTTTCAATC TTTT 

Sequence 1711 

CCGCGGTGGCGGTGCTGGCGGTCATTCTCGGCTCATTGCTGCAGGTGCTGATCCCCCGAG 

ACTGGCTGTTGCGCCTGTTTGGTCGCGCCGGCCTGGGTTCCACCCTGCGTGGCGGGCTAT 

TCGCCTTGCCGGGGATGATGTGCAGTTGCTGCGCCGCTCCGGTGGCGGCCGGTATGCGTC 

GGCAGAAGGTCTCGGTGGGCGCGGCGTTGGCGTTCTGGATTGCCAACCCGGTACTGAACC 

CGGCGACACTGGTGTTCATGGGCTTTGTGCTGGGCTGGGGCTTTACCGCGCTGCGGTTGG 

TGGCCGGGATCGTGCTGGTAGTGGGTGTTTCGCTAGTGGCGCAACGCATCGC 
Sequence 1712 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGATAT 

TTGCTGAACTCATAAGTTTATTTGGGTAACMGAGTCAAAATGATATTGGTTAGCAAATA 

AAMTTCAGGATACATATTCCAAAAATGCTGTTTATGTAATTAATTTGGTGAAGTCTGTC 

AAAAAAACCANCTAGNTGCTTGTGGGCCNNGTTGACTGCTGTGGTAAATTAAGTGATGCC 

CTAGCAAACTAAAGACAGTGTCATGCCTTCCAAAAGGCCCAGAGAGAGCTTCAGGAGAGA 

ATAGGTCCATCCTCATGTAGCCGGGAGAATTGCAGAATGAAATTGCTCCCAGTATTACTA 

GTAAACCTTCAAATTTCATTTACTGGGGCAGAGATAGTTCCGGTTTTAAGGAAATAATAG 

AAAATCATGGTCTGAAAATTCTCAAACCTGAAAG 

Sequence 1713 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAA 

GMTGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTACAT 

CAATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAA 

TGAAAATAAATAAACTATGGCTTATATGCAAACAACATGGATAAATCTTACCAACAATGT 

TANGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGT 

TCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTA 

TTGCAGAGGGGGCTNCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGTTTGCAT 

GAGCATGTTCACCTTTGTGGAAATTCA 

Sequence 1714 

TCGACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGGCGTCACGCCACCCGAG 
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CCGGACTGGCTGCTGATGGATTATGCCGACCTGGTCAGTTC6GAAGCCTTCCGCGGCACC 

GCCTTTGCCGCCGCCGTGCGCCAGCAGCTCGACACGCTCGAGCGCGAGCACCCGGGGCGC 

GGCGTGGAGCAAGCGATCGACATGCTGCTGNCCGTGTTTGGC(3CGGCGCTGCAAAACCAG 

GGCGGCACGGCCCTGTTCCTCGATGGACGCCTGCTGACGGAATTCCTGGAAACACCGACC 
ACCA 

Sequence 1715 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACACCCTAGCTCCAGCTTCCCCTG 

GGAGACTGTGCATCTCCTGGCTCCACTAACACCACCTTCTTCTGACCTTCCAGCCTAGAG 

ATGATGACTCTGCCAGCCTAGATGGGCTCTGGGTTGTCTCCCTATTCCTGNTTGCTTTGN 

AGMTNTCCCATTATGCTGNCACCAACTCCCCAGCCTAAGCCCTCTCTATTTTAAATTCT 

CAAG TGGAT TATGTTCCTGATTAGTCCCTGACTGATATACCACTCTCCTCATGATCTCTG 

ATTAGTTTTCCTGTTAGGTTGTTGCANTCNAAAAAAAAAAAAAAAAAAAAGTC 

Sequence 1716 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTCTCCC 

CTGACCTCCTTTGG CTAAA GTGCCTTTCAGATTTTATTACTATTGTTAACATTTCCAACA 

GCCATTTTAACATTATTTTTACAAATTTGATGTGAATATTCCATATTAGAAA^^ 

CAGAAGACAAAGAATTTGTTTGTGAAAATGCTAAGGACTGAAGCTGACAGAMTAACATT 

CCTGCTTACAATGAACTTTTGCACGTATAGTAGAAAACCAGGAGCGTCCTCAATGTGCTC 

CTTCACCCTATTGGCTGCATCTCCATGAGCCCCTGAGAAGGGAAATTAACACTACCTGCA 

GAGGAGACAAGAGGCACCTGAGGACCCAGGAGGACCCACAATAAGTTGCTAAAATGAAAC 

ATCTGTTACATTTGTGTTTAATTAGTAATATGTCTT 

Sequence 1717 

NATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAGAGGGGTG 
GTATCGGTTATTCAGGAC I I I I I l I I I I I I 1 I CTTAAATATCCTGTGCTTCTTTCAATCA 
TT TGAA GGTAAAACCAGGTCCTGTGAGTGGTAAACTGATTTTTGGTTCTTATGAGGGTAT 
TATTTTCTGTGTAGATATTTGTTAACTTGGTGTCCTTGCAGG GTAAACG ATCATC N AANC 
TTTNTGTTCC ACC ATNTTGCTCCTCTCATTTTAGACCTAATATATTGACTATAAN N GTTT 
CCCTCTGGAAGTATTGACTATTNTTTTGCATCAATACTAAAGAAAATAAAATATAATNAT 
AGAGACTNAAACATATTAATATATATATAATGGGTGTTAAACTAMTGGNTAATTTGAA 
Sequence 1718 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGG 

GACCCGCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCCTGAAGAGCAGTAGAGGTTG 

TTTCATTA AGTGC AAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCC 

GAGACTAATTTTTCCATTGCTMCTCTAGATTCTCGGCCACTGGAGTCTGAAGATACTCT 

CTTTGAGAATGCATATTATTTTGCTCACAGCTAAMCATTTAAGTATCATAGCTGATCAG 

TGGAGTGAGATTAAAAGGTTTCTTTTTTGAATCATCAGCTAGAGATGTACCT 

Sequence 1719 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACTCTGGATAATTAACGGCAGAAAGATAAA 

CTGTATTTCTGCACATCTTCACTGCACTGCGAGCCCCAACCACAGCACACACCTGCACCC 

TCCCTGCCTTCTCACATAAAGCCCAGAGGGAACACCGGGCACCTTGGTGTCTTCTCAGAT 

GCCCCTGGAGCGTITCACACCAATGCAATAACAGAAGTTTTGATTTTCCAGATAAACAAA 

TGGATTGTTTACAGCTACAGAAGCCCCCTTCAGAGACTGCCAAATTTCTGTCCTCATCCA 

CCAACCTGCCTAGAAGAGAGAAGCTAGTGCCCTCTGCAAAACCTCCCACACTAGGGGGCT 

GGTATAAGGGATGAAAACTAGGGGGAGTCTCTTTAAGGGTGATCTTTCTGCCAAAATGAC 

TTTACTTTTCCCTGAACCATAATC 

Sequence 1720 

GAGCTCCCCGCGGNGGCGGCCCGAGGTACACTTTCAAGACCAAAATGTAAATTTACAAAT 
GTGTCTCAGTAAGAAAAGTTACAGAAGCTGGGCCGGGCGCAGTGGCTCATGCCTGTAATC 
CCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACTTGAGGTCAGGAGTTCGAAACCAGCC 
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TGGCCAACGTGGNGAAAACCCTGTCTCTACTAAAAAATACAAAAACTAGCCCGGGAGTGG 

TGGCGCACACCTCTAATCCCAGCTACTCAAGAGGCTGAGGTAGGAGAATCGCTGGAACCC 

GGGAGGCGGAGGCTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGTGACAGA 

GCAAGACTCTGGTAATACAGTCAAGCTGCTCCTGTCAGATGTATACTTCCTGAATGAGCA 

CTATAATGGATGCTGCTCAGGTTCATCTATTCAGTAAGGTTGGAGCCTACTATATGCCAG 

GCAC 

Sequence 1721 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGAACTTAAA 

GTATMTMTAAAAAATAAAGTATTACAGAACATGGTCAATTATATTAAAAAGTAGATTA 

ATTMTGTCCATATATACATTTTTTGGTGGAAATCAAACATMTGAAMCTATTTCACAT 

AATTMTAITTTGTTTATTTTTTACTTATTAATTNATTGCAGAGATG^ 

TGTTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCA 

AAATGCTGGGATTTCAGGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTT 

AATTACTCATAATTTTATTTCAGNATATATATATACACACACACACCATACACACCATAG 

TAGAATAAATTAATTATATTrrCTACAAAAATTATTTTCAATC II I I I I 

Sequence 1722 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGATTAGTTNAA 

TATAAAGACTCCGTAATTTTTACAATTTTAACAATAATTTTATTTCTTC 

NTNGGGATTGNATTAAAACTACAGTGTGTGACTTANAAAATGATAATGCTGCTTTATGGA 

AMTGGATTATAGGTGGGTAAGACTTTCNTTTGCAAAAATTTGGGNAATTACCCATCAGT 

NGTTAGGAACCCAGNTGAAGTCTANANGACAGATGATAGTATCTTATACTAGGTTGGGCT 

0 

Sequence 1723 

CACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCAGCCATTCGGCGAACTG 

CGCTTCGGTCTCTTGCGAAATGGCATCGTAGCGCACGCACTGGAAATACGTGCCGTCGGC 

CGGCAGCAGGCTGAAGCGGCTGCCCGCCAGGCCGTCGCGGAACAGGTCGCGCTTGCGCTG 

GTAAAACGCGGGCAAAGTCCAAGGTAGGGCTTGCGGCTTGGCCATGTAGCCCGGCAATGC 

CGTGCTGCATCGGCGTGTTGACGGTAAACACGTTGTACTGGTGCACCTTGCGAAATTCCG 

CCGTCAGGGCTGCGGGCGCCGCTACATAGCCAACCTTCCAGCCCGTCTTTGCGCAATGGT 

GAGTAGTTGACCCATGGACGTATGCTAGAAGCCGCATGGATGCAGGCTTTCAGATGTTTG 

ATCAATGTCCATGGGAAAGTTCCCGATTTGACGCTCTT 

Sequence 1724 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACATTTTCTTGATTCC 

TACAATCTATTCTTCCTACAACAGCAACAGTAATTGTTTGAAATTATACATAAGACTTCA 

TCACTATTTTGTTCAAAATTCTTAGGATTCTTCCCAATACTTTTAGGATAG 

GCTTTATCATTGACTGCAAGGGCCCTGTGTCCTCTGAACCTGCATACCTATCCAAGTTTA 

TCTCCAACACTATCCCACTGTATTACCCTACTCTGTCTACAAAGTCTTTTCTGTTCTTTG 

AACAAACAAAATTTATTCTTACCTGAGGGTCTTGCATCCACACTCAGACCTTACTTGGTT 

TTAGTTTTCAGAAAGGCCACCAAGGGACCATCCTATTTAAAACAGTCCCCATGGTTAGAT 

CCCACCTCCGTCACCTTTTCCTAGTGAGTTTTCACCA 

Sequence 1725 

GTNGGAGCGGCCGCCCGGGCAGGTTCTGAGTCAAGGACTTCTTTAACGTCATNNACGGCT 
CCTAGGACAGCCACAGAAAAAAATGGGGTAAAGGGGTAGGAAGAAAAAAGGAAGGGAAAG 
AGAGCCAGGGAATTGGGGAAGGAGAGGTAGGGGAAGGGGAAGGGG 
Sequence 1726 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCAGGCACAAGCTGGAGTCACAGATGCCA 
CACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTTT 
CTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTTCCCAG 
TTGGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACA 
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CGGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTAC 
AGAGAGTTACGGA AAAAT CTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGGTAGTCTGTCTGTGGGAGGGGGCAGGAG 

TCTATGGGGAACTCTCT 

Sequence 1727 

CCGCGGTGGCGGCCGAGGTACGCGGGGGATTTTTAGTAAAGTTTTGCCATGTTGGCCAGG 

CTGATCTTGAACTCCTGCCCTCAAGTGATCTGCCCACCTTGACCTCCCAAAGTGCTGGGA 

TTACAGGTGTGAGCTACTGTGCTCAGCTCAGCCATGGAACAATCTTGATGGATAAATTTA 

ATCTGCAGGTATCTCCATCTCTATTTGCCACTCTTAGT.\AATGTTTTAGTTTGACAGTAA 

ACGAGTTGAGACATTAATGATTGAGAGAGCACCTAAGCAGCTCGAAGTGCTGGGAGTCTC 

TCCTAAAGAATTAGATGCACACAGCATTTTTCCCTTACAGACACTTCACTGAAGCAGTCA 

TGTTGGACTTGGGTGGCGTCTGCTATTAGACACACTATTCACCAATAAAACCCTGAAAAT 

TGCATTGCACAG 

Sequence 1728 

CCGCGGTGGCGGCCG AGGTACTATTAAATTTAATGAG AGGTAACCCATGCACC G GCAGG A 

AGTTCAAGGGGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTT 

CCTTCTAGTTTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCCTTT 

GGATTAGGATGCTGTCCTAAATCCTTCCTAAAGAGAGGTTTTGGAGCCACCAACACTGCT 

TCACTGTGGAATAMCTTTCCTTATTAAGATTCTTCATTCACTCATAATCATCTCCTTTT 

TCCTCACTGCTCATTCATTTATGGAGATAGCACCCATATCATAGCTACTTGGTCAAATCA 

TGGTGATACTCCCCACTTAGGAGAGGAAGATGACTAGGTTATAAGCTCTTGCCTTTAGGA 

AATTAAAAACCCTGTGGGT 

Sequence 1729 

CCGCGGTGGCGGCTGGATTCTTGACATTCATTAATATCTTGACAACTCAGCCCATCAGAG 

GTTCTTCGAAAGCCACTTCCACAACGGACCACACAGCGATAAGATCCAATCGTATTGTCA 

CAGTCCTGACCAGCGTGGCAGGTATGCCTACCCAAAGCACACTCATCAATATCTTNGACA 

AGTTCTTCCATCTGCAGCTATGGTGAGGCCTTTAGGGCAGGAGCAGTAGTAAGTCCCCAT 

GGCATTGTGGCAGCTATGGGAGCAGGGATTCCCTGCTGCACATTCATCCTCATCAGCACA 

AAAAGGTCCAACTGAGTCTAAGGTAAACCCGGAGGGGCACTGATTCTGGCGATCTCCTTT 

GGATATTGAAGCATTGAATTTT 

Sequence 1730 

TTCACTCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAATA 

GTCAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATG 

GTGAGCMGGAAAGCAAGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTAATA 

TGATCACACTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGAT 

GCACAACANAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGGTGCTCTCATC 

AGTGGGGGGACCCAAATGCTTATGTGGGACTCACCAGGTATCGAATTTANCCATTGATTA 

GGGAGTGGTTTTGGTTGGTGGNTGGGCNAGGAAAACTTTTNTTAATTCCAAATTGGAAAT 

ACCAATTNGAAAACCTTTTAAAAAAAATTAATTTGNTNA 

GGCCCAAATTNGTCAATNTTTTGGCCGTTACCCTTGCCC 

Sequence 1731 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTTCTTTAA 

AAAGATTTGTATCTTGCTAGGTAATTAAACAATGTTTACAATTMTATATTTA 

ATAGAGACAGGGTCTTGCTGTATAGCCCAGGCATTCTTGAACTCCTGGCCTCAAGCAATC 

TTCTTGCCTTGGCCTTTCCAAAGTGCTAGGATTACAGGGTGTGAGCCAGTGTGCCCGGCC 

CAATTAATACCATCTAAGTCATTGTCTAAGGAACTTTTCGTTTCTGTTTGTTGTGGGGCC 

AGACTTTGGGAAGAAAGTATGTTTGATAAGAATTTATCAGATAACCC l I II II I I I I I I G 

TAATATCTCTTTGGGATAGCATTGTCTTCCTCTGCTTTGGGGAAGTCTTTTTGGTTACTC 

ATTACTCAAAATTTGCCAAATTTAGGGGTGGGTGGTGAATTTTTGGC 
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Sequence 1732 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGATTTTTA 

GTAAAGTTTTGCCATGTTGGCCAGGCTGATCTTGAACTCCTGCCCTCAAGTGATCTGCCC 

ACCTTGACCTCCCAAAGTGCTGGGATTACAGGTGTGAGCTACTGTGCTCAGCTCAGCCAT 

GGMCMTCTTGATGGNTTMTTMTCTGCAGGTATCTCCATCTCTATTTGCCACTCTTA 

GTAMTGTTTTAGTTTGACAGTAMCGAGTTGAGACATTAATGATTGAGAGAGCACCTAA 

GCAGCTCGAAGTGCTGGGAGTCTCTCCTAAAGMTTMGATGCACACAGCATTTTTCCCT 

TACAGACACTTCACTGAGCAGTCATGTTGGACTTGGGTGGCCGTCTGCTATTAGACACAC 

TATTCACCAATAAAAACCCCTGAAAATTGCATTGCACAGGACTG 

Sequence 1733 

CCGGGCAGGTACATGAAGGAGGGCACTANGGGGAGCTAANCAATTTGTGAGTTAGGTAGC 
CAAGATTCGTTGAGGTTGGAAGAATGAGGAAAGAAAGGGGATCTGTGGCAAAAATATGCT 

Sequence 1734 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCATACTACTTTAAGT 

TACCTCCTATTGGGGGCATTATAAATGGGTAAGCAGAGAACATCATGGAAAGACATGGAG 

CTTAGTATATGCAMTGTTGAGTTACTCAGTGTMTGTGTGAAAAAGGAGTTTCATAAGT 

TTCGGTCAGGGAAAGAAAGGCAGGGTCAAAAATTTCTGCTTGAGAAGTTTTGGGGAGCTT 

GGGGAGACTTTAAACAGGGAGCAACACAGCGCCTCTGTACCTGCCCG 

Sequence 1735 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCATACTACTTTAA 

GTTACCTCCTATTGGGGGCATTATAAATGGGTAAGCAGAGAACATCATGGAAAGACATGG 

AGCTTAGTATATGCAMTGTTGAGTTACTCAGTGTAATGTGTGAAAAAGGAGTTTCATAA 

GTTTCGGTCAGGGMGAAGGCAGGGGTCAAAAATTTTCTTGCTTGAGAAGTTTTNGGGAG 

CTTGGGGAAGACTTTAAACAGGGAGCAACACAGCCGCCTCTGTACCTGCCCG 

Sequence 1736 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACAGTTTTGAGCACCTGCCTT 

GTGCCTCMTGAATACTTGTTGGGTGATGAATGATATAAAATTCATCTTGACATTCAGCT 

TAGGTTTGGAAGACAGTGTATGATTTCTTTACCATGGCCTTGACTCATCTTCTGAATGGT 

CAGGCTAAGTGCTGCCTCCTTCTCTGTGTAGCCTATTCAGGCACCTGCTCACCTACCACA 

ACCTGCCAGGCCCCTGCAGCTCTGGAAGCATGCTATGTGAGAAATAATCCTAATATCCAA 

GTCACAGACAGTGGGTGGGCTTGGAAGTGCAGGGCCTGTGATCCCTTTGTGGAGTAGCTG 

AATTTAAGAAGTTGACAGCAATTGTGTTCTATAATGAGAATGGTCCTGATCATCATGGAG 

GAATAATAGCTGCCTTTTATTGAGAAACTGTCAGGTACCTGCCCGG 

Sequence 1737 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCAAACATCTTGT 

TATACTTTGTAAACATTTTCTTTTTTTAAACCCCAGTTCCAGCCGGACGCCCCCA 

CTGAGGTTCGAGGAGGTGGTGGTTTTCATTTGGGGCTTTGCATATTTGGTTGTTAGGTTT 

TGCGAGANCCTTTTTNATTTNGCCCAGACGTCTTNATGCCGGNGGTGAAANTTNCACTTC 

GGGCNCCCCTCCCCTGGAGNTCTTTTGTGTGCCGCNGGAANTTCAGTGGAAGATCCGGTT 

TACTNAGCGATATAGGAGGGATNTATANCTNNNNAN7TCNNNNNTTAANTNTATTTGTTC 

CTTGCCCGGNGNCGNGCACNCNTNNACTGCCCGCTTT 

Sequence 1738 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCAAACATCTTGTTATACTTT 

GTAAACATTTTCTTTTTTTAAACCCCAGTTCCAGCCGGACGCCCCCAGACCTCTGAGGTT 

CGAGGAGGT.GGTGGTTTTCATTTGGGGCTTTGCATATTTGGTTGTTAGGTTTTGCGAG 

CCTTCTTTAI I I I IGCCAGACGTCTCAATGCGGGGTGAAGTCCACTCGGCCCCCTCCCCT 

GAGTCTTCCGTGTGCGCGGAATTTCGAGGAGATCCGGTTACTAAGGATATAGAGGAAAAA 

AAAAAANTTAAAAATTAAAAATTAAAGGTACCTGCC 
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Sequence 1739 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTATGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCMCCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAAT 

GCAGTGAAGTAGAAGAAGACAGGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAG 

ATGAGAGGAGTCTATATGTCAGAATACACATTTCCCAC 

Sequence 1740 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACATACAGTTAAGCTA 

CTTAAGTTGTAGTGACmGGACTTAATTATCTTATAGGATAAMTAAAAACTTTAATCC 

TAAACATTAGCCATTGCATTAAATGCTTTAAAGGAACMCTGGCTTTGAGCTeTGTTCAT 

CCATGCAAGGATTATACMTCTCAGTGCTCTAGCTAAATTGCTAATGTTTTATTACTAAT 

GCACTGGTMGCCTCTCATAGMGATGGGTCTCTTGGATTTACACTGAMTATMCTTTT 

CTCTTCCATATTATTGGAAGACCTTATTCTTCAGGCAGGAGCTTAGAAAGTCTAAATTAT 

GCACCAGAGCCTTAATTATCTGATGGCAAAAACCACGATACTCTGGTCACTACTACCAAT 

TGACAATNAGAGGACCCCAATTAAAAACTCTAGAAATATTGAGTCCTGAATACCAG 

Sequence 1741 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAAAGATGATATACATGAAAATGA 

CTTGAAAATTCGTTTCTCTAC I I I I I ) 1 1 I AATGTAACACAAAGTAAATGAGAATCTGAG 

TATTTTATCAAGTTGTTTCAATGMGACAGAGTTGATTCAGAAGGTAATTCCCAGATTGG 

ATGGTATGAGTTTCCGGCAAGATTAGAGAAAGAGTTTTTCTATGTAATAACTGATATTTG 

TTCACATGCTTCCTCTTGATGGCTCATCAGGATTTTAGAGATAGAAAGGACATNTTATCC 

AACCGCTrrCATTTTCTTAAAGAAATTGGGGACCGGGCGCAGTGACTCATGCCTGTATTT 

CCAGCACTTCGGGAGGCCGAGGCAGGTGGATCCTAGGTCAGGAGTTCAAGACCAGCCTGG 

GTCAAGAATGATGAAACCCCCGTCTCTACCAAAAAAA 

Sequence 1742 

GGCGGCCGCCCGGGCAGGTACAGTTACAACTGACTTGCATAACACTAAAGAAAACATTGT 

AATTCCTACGCCAGTAGTGGGGGTAAACTGAGAACCAACCTAATTTTCACCACAGGGCCC 

TCTGGAAATAGAGGAACTAAAATGATGACTGNTTGAAAGTGAAAGAGGGCAGATTCCTAA 

GAACATCACTCCTACTGCATTCCTTACCTCCACTCCATACCCTTTCCTTCTTCACTCAGG 

TAGGACTGTCTCTGGAATGCATGACAGCATGCACAGTTCAGAGCTTGGATGTGATCCTGG 

AAATGGGCTTAGATGACCTTATGAACACTGAATGCTGATTGCAGACACTCTCAACCTTCT 

TCCTCTAGTTGACTTCTATGGCTGTGCCAGCCAGTCCTAGACCCTTCTAGAAGAACACTA 

GGATGATACATTTC 

Sequence 1743 

AGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGA 

ATCATATGGCATANACACTTAGCAAATCCAATGCCTTCCAGGCATCTTGTTTCCTGTTGA 

TGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCANATANTGTCATTCAACCCC 

TTGGGCAGCCTTrrCTTGGGGCTAACTAAAATATTNCGGCACCCCGGTTTCTTGGGTTGG 

GGACCATTTTTCAACCCCCNAAGGTTTGNTTTCANTTTTTATTTTMG 

GGGCCTTTCCNGTTGGGGAAGNAAAAATTTMTTCCTGGCCNCTTTTCCTTTC 

GTNATTTTGGANTCTTCTNCAATTTTGGGNGGCCNGCCCAATTGGGNTTGGAAAANAAAA 

TAATTTCCGGCCCCCTTGGGTTATNCATTAACCCCCCCCCGGCCGGTTTAACCCCCTTGG 

CCCCCCCGNGGGGNCCGGGGTCCCCGGGCTTTCTTAAGNAAAAACNTTAATGNTTNGNGN 

AATTCCCCCCCCCCGGGGGNCNTTGGCNANGGGAAAATTTTCNNGAANTNATTCCAAAAA 

GGCCTTTTATTTCNGAATANANCCCNGTTCCGNACCCCCTTTCGGNAGGGGGGGGGGGGG 

0 

Sequence 1744 

ACAAGGTCTCACTATGTTGCTCAGGCTGATCTTGAACTCCTGGGCTCAAGGGATCCTCCT 
CGCTCGGCCCCCCCAGAGTGCTGGGATTACAGGTGTAAGCCACCACACCCAGCCAACATT 
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CTTAI I I I rATGCTTCCTTTGTAAATATAGTGGAGAAAAAAAATCACTGTATTAGGATTT 

CTCACCAAGGGCMGATAGTCATATGAMTAGTCCTTAGCTTCTTTAGTTGCTTGAATAT 

TACCTCCCTGGTCTGTGAGGTATCTTTTTGCTTCANATCATTTCCTCAAAGAAACACATT 

ATTGGTTAATATNATTGGTCACCTTTTGAAGTTGGGTCCTATATGCTGATTCCGGGTAGC 
CCCAGACTTTCAA 

Sequence 1745 

AGGTACAGTGGCGCCATCTCTGCTCACTGCAAGCTCCNTTCTCCCAGGTTCATGCCATTC 

TCCTG CCTCAGCCTCCCAAGTAACTGGGACTACAGCCGCCAGCCACCATGCCTGGCTAAT 

TTTTTGTATCTTTGGTAGGGACGGGGTTTCTCCATGTTAGCCAGGATGGTCTAGAGCTCC 

CMCTTCAGGTGAGCCACCATGCCTGGCAGGACTTTTTTTGATGTMGCAGAGCCCCTTT 

TGCCAGCTTCCTTGGGCTTATTCCCCCATAGATGGCAGCTGGGATCCCAGCCTCTTGCCG 

GTTGATATTGTCATTCTTGGAGGGGGCCCAGGTTGGGGCTAATGCTCGTGTGACTAGAGC 

TTGCTGGTGTTGAGGGTGGTTCATCTGMGTTCATGACCACTAGGCTTTGATCTAAAAAA 

A 

Sequence 1746 

ATACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGGCTTTGCTCCAGGCCTCAGGAG 

TGTAAGTTTAAGGGGCGCGAACCTGGGTGGGACGGGGCAGTTTTTCCAGCGGGCCATACG 

GGAAAAl 1 1 1 \ G GTTCGAGGAAGACAACCACCCTTTTTAAGGAG AAAACTGCATCTTGCC 

CTGCGTTATTCCTACGCGGTGCCCAGGTGGGGTGTGTGTGGACCAAGTCAATGACCGCCC 

AAGCTCTCCGAGTAGAAMCCCAAACATGGTTTTGTGGGGTGTGTGCCTTTGACCCCGGA 

CTTAAGCAAAAAGCGTGGTCTTGGGCGTAGCTACNAGGTGGGTTGGTGGGGCTTCCAAGG 

CCCTGCGTGCCCCTCGACGTTGAACCGATGATTTGGGCGGGTGCCAAGGATAAACAAGAT 

TCGCCCCTTCTTTNGCCCACTTCAATTGGGCAAGGGGGGACCTTTGCTTACTTTTCCCG 

Sequence 1747 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGCTGGACAGGGATATAAATCAGTGAACTCTG 
AGAACACTTTTAATCCAAGCACCAAATAAGGCAGCATTTCCTGTGACTGCTGGACGCCAC 
GGTTTAAGTGAGATGCCAAACATTCCTCATTTGGGAAAATGCGCAATAGTCCACAGAGAG 
AACAAAATCAAAATGTGAAAAGTTGAATAAAAAACACTC 

CAGCATAAGTTAGCTGACCTCATCTTTGGGACTAGAAAAATAACAGTAAATAGTAATAAT 

AGCTCCCGCGTACCT 

Sequence 1748 

AGGTACATCTCTAGCTGATGATTCAAAAAAGAAACCTTTTAATCTCACTCCACTGATCAG 

CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATAtGCATTCTCAAAGAGAGTATC 

TTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 

GTTTCTGCCCACAGTTTTCAAATTMGAACAATGTGTTTGCACTTAATGAMCAACCTC^ 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGGCGGTCCCCTGTAA 

GTACCTGCCCG 

Sequence 1749 

AGGTACATCTNTAGCTGATGATTCAAAAAANAAACCNTTTAANTTTANTTCNNTNNNCNA 

NTTTTAANCCTAAAATGGTTTAACCTGTGAACCAAAATAATATGCATTCTNAAAGAAGAG 

TATCTTCAGACTCCAGTGGGCCCGAGAATCTAGAGTTAAGCAATGGGAAAAATTAAGTCT 

CGGGCTTCTGTTTCTGCCCACAGTTTTCAAATTAAGAACAATGTGTTTGCACTTAATGAA 

ACAACCTCTACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGGCGGT 

CCCCTGTAAGTACCTGCCCGG 

Sequence 1750 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAGATAACTTATC 
CAAGGACAAATAGAAAGGGGACTGTGTCTAGAGGGTAATGCTAATGCCTGGCATGCTATA 
AAAATAMTGAAATCTGATTAACACTATTGGAGGGAAGAATAAGTCATCGTTTCTATGGA 
ATATTArTTTCCTGATrGGAACCATTrCCATTTTCGTCAGCTTAAAACATT 
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ATGAGATGACTTTAAAACGGTGACAGCTATCTCAAGCCCTGTGATTATAAATGGTGTTAT 

AGGCTTGTAATCCTATGG GTTT AGACTAAAGTATTTTCTTCATAAATTGAAAGAGAAACA 

CACTAGGMTTCAGAAGATTTTAATTAAGTTGTTCAAAAMTTTMTGGGM 

GCTGGCTCACTAAGTAGTTTTGATAATGAATTTAATTTATAGAAGAAAGTTCTTGGC 
Sequence 1751 

CACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAAG 

MTGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTGCATC 

MTGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAAT 

G AAAATAAATAAAC TATGGC TATATTCC AC CAAC ATG G AT AAATCTTACC AACAATGTTA 

GACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCA 

CAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTG 

CAGAGGGGGCTCCGGGTGTTTGTMTTTCTATTTCCCAGTGCAGGTGTGGTTTGCATGAG 

CATGTTCACTTTGTGGAAATTCATTGAGCTGTAACTAAATATACAGAATTATTTATGAAT 

GAAATG 

Sequence 1752 

ACCACTGCTTTCCCGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTAC 

CACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGCGTCCT 

Sequence 1753 

CAATCTGTTCTGCTCACCAGCGAGTTGTTCGAGTTCATCGAAGACGGACACGGCGGCCAC 
CTGCTGAGAGTGGGGACCTCTAAATTTCCGGTCGGCACGCTGCGGTTCACCGCCCAT 
Sequence 1754 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACCACTGTTCCCCAG 

CCTGCCAGATAAAAATAAACTGTCCACTCCAGGAGGGGAGAGGCACAAGCTGCTCTGATA 

GAAACCGTTCCTGAGCATTATCTTGACTGTTTGCAGAGGAGACATGTGTTCCCAGCCAGC 

ACAGAGCTGCAGGATTGCCTTTTGCGATTTGTGAGAATTATGTCATAGTGAAATATAGTT 

GAGAATGCTGGGCAGTACCTGCCCG 

Sequence 1755 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTT 

CTTGAGTTCATTTGTCTTTCCAATGATCGTATATGATTGAATAGTAGAGAAGGCACCTGC 

AAGTAGAAGCCAATGGCAAAAGATTGATAAGCATCTTTCCTGGACTCTGGTGCTTCGGGG 

GTGCCAATAACTGCACTCTGAGTMTCCTGGGTCACTTTCATCMTTGAGAGAGTTTAAG 

TCAGGGGACTTATGATTTCTTTCTCAACAAGACCAGGTGAACGTGTTTTTGCAGTTCTAA 

GTATTATTTGCAAATTAAATAGCCATAT ATCTCTAAAGCTTCCTGAGCAACAACCAGAGG 

TACCTGCCCG 

Sequence 1756 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTATGGCTGAGTGTTT 

CAATTAAAAAATACATTGCATGACAATTTTCTGTTTCACAAAGAGAMCAGCATTTT^ 

AAAATTGAATTCTCCTGTGTMCCTGGTAGAGTTTTCTATTCCTGAATATACATCCTTTC 

ATAGATATTACTAGCAGGCTAAACATCTGGCATTTAAATGTCATTACTATAAATAAACCA 

CAAAATTGGCTGCCTGATTTGAAGGGTCAATCTTATGGAGTAACTGAAGMGAATCATTT 

TTAMGGAGATGAGCAGTTTTGGATTTATGATACCAATAATTTCTCTCTTTAAGGAACTC 

CMCTCACTT MTTTA ATGTTAAGAATAAAATCAGACTAATTAGTTCTCAGGACAGGTCA 

ATGGCACACCATTTTCCTTTTCACTTTGGCAGAAACCTAGGAGTCACTT 

Sequence 1757 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 

GGGTGAGATTCTGCTGGGTGAAGCCTAAGGGCATAGATGAAAAGTTAGGATTTGGAAGGG 

AGCCAGAAGCCAAAAGCAGTTCCCAGAGGCCAGGCAAAAACAGAAGCTGAGTGTGGGACT 

GAAGCCAAAAAGCGGAAGATGATGAATTCCACAGCGAACCGTGGGAGCCGTCCTGGAACA 

ATGCCTGAGATGCGCGCTGGCTTTCTGGGAGCAGTTAGGGCCCCTTTAGGTATGTGAACC 
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CGCCTCACTAAATGGCCATGAGCAGAACTGAACTGCCTACCTGTTTCTCCACCTGTGCAA 
GACCAAGTTTTCTTAAGAAN CCCAGACG AAN GCTTCCTTTGAAAATNATACCGTTGCCAT 

TTGGCTTGGCCCCTTTGGCCTTAMGCCTTMCAGGTTTANGGACTTTTGGGCCCCTGGG 
TTAAATTT 

Sequence 1758 

TACGACTCCTATAGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATNATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTT 

ACTCATTCACACGTTAATGGAAGAAGAAGGGAAGTTTCCTAGAAAAATCCTCCCGCTCCA 

CCCTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGACMGACCTAAACTATTT 

TGAATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGAATGTGATCAAT 

TTCAGAGTAGCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGAATTATAACAAATCA 

GTAGTGGTTGGGAAGAATCTGTTCTTACTTTCCTCCCATCCAGTTGACTAGGGATTTGGG 

AAAG 

Sequence 1759 

TGGGCCGC I I I I I CCGCTTTCTCGCTTAACTGACTTCGCTGGCGGCTTCGGGTCGTTCNG 
GCTTGCNGGCGAGCGGGTATCAGCTTCACTCAAAAGGGCGGGTAATACCGGTTATCCACA 
GAATCAN 
Sequence 1760 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGATGAAATGAGT 

GATTCTGTCCTTAACAAACTCCTAGTCTCGTGGGGTGGACAGCTGGGAACTCATTTAGAG 

AGATTACAAGGATGAATACCCTTGGGCTCGAtAGAGGTAGAAGCAAAGTGCTTAGTGTTA 

GAGAGCGTTTCTTGAGGAGGGAAAATCAGAATCAGTCTGGGTTTTAGGAAGATAATTCTG 

GAGGTTGAGTGAAAGATGAATGGAGAGGTGATAGACTAGCTTGGGCATTATGGAGATGGA 

AGGAGAACGGAAGGGGCAGATAGGACAACTAATTGGAGGCCGGAGTGAGAGGAAGGGGCC 

ATAGATGACTTCATGATTTCTGTCCTGGTTGCAATAAAGGGATCATGATCGTAAAGATCA 

TTCCCATTTGACAAAGGGAAGA 

Sequence 1761 

GACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGGGGCGGTATCGAGAGTGTG 

CAGCGTCAGACGGGACATGGGGTACCTTGCTAAGTTGGTAAGTCACCCGGCCATTATATC 

GGCCATATCGCGCACGCAACAGACGTCAGCGTCGTCCTTTTGTGCTCGTTCGTCTGTGAT 

TGTGCGATCCTTGATTCCGGCAGGCTCTTAGCTGGCCAGCTGGCGCAATTTCTGGTCGCG 

CCGGTAGGCCCCTGGTTGCTCGCCCGTGACCCGCTTGAAGGCGCGTGTAAAGGCCGGCGT 

GCGAGCGGTAACCCACCAGTT 

Sequence 1762 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 
GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 
AATTGAGAAGGAGAGAATTCTACTGGTC 
Sequence 1763 

NACTACTATAGGTTNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGG 

CNTGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAGTTAGGCTACTCGAGGGT 

CTGGGACCCACTTGAGGAGGCAGTTTGTCCGTTCTCAGATCTCCAGCTGCGTGCTGGGTG 

CTGGGAGAACCACTACTCTCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGAACCCTCGAAGGGGGGGGCC 

CCGGGTACCCMCTTTTTTGTTCCCTTTTAAGTGGAGGGGTTAATTGGCGCCGCTTT^ 

CGTAAATCATGGGTCAATAAGCCTTGTTTrCCTTGTGTGNAAAATTTGTTTATTCCCGCT 

TCAACAAATTTCCCACMCAAAACCATTAACGAAGCCCCGGGGAAGCATTAAAAAGTTGG 

TAAAAAGCCCTGGGGGGGTGCCCTA 

Sequence 1764 
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CAGAGGGGTCTGTCACATAGATGCCTTTCACAGCTGGAAGTGGCAGAGCAGTTGGGTGAC 

CCTACCCAAAGGAAAGATAGATGTGTTCCTGGCCAGTGCAGGGGTGAGATGTGAGCACAG 

CTGCATGGCCATGCTGCTGTCTTGGGTCTTTGTCGGAGGGGATTCAGAAGTTTAGACAGA 

GGTCATTGCCCAGATCAGTTCCAGTGCTACATGATGCTGGTGGTCATGACTTGCAAGGGT 

GGTTCCTGGGCCATCAGTGAGAATAACACAGGGAGTTCCCTCTCTAGGTCTCAGTGAGAG 

/^CAGTCTCTGAGGCCCTGGCTTCTGAGCCGGATATTCCTGATGGGTACCT 

Sequence 1765 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGAGTAAACTACTTGAAGGCAATTCCTGTGCT 

TAAAATGGCCACGTGAAAGGTGTCCCTGACATGGAAAGGACAAACCATGAAATGACCCTT 

GAAGCTCTTAAGGTGTCTGTCAAAAAAACAAAGCCAACTGTCAAACTGAAAAGCCAAAAT 

CATCACAAAAACACCTATTCTTCAGAGGCCTCAAATCTCAAACTATCTCCCACCATCCTT 

TGTAAGTATGTATMGGAGGCCAGGAAACTAGCATTTGCTGAGACTTTATTCTCAGCTTT 

GT ATGC TTTGCATAAATGATGTCATAGAACACATTGACAGTGAGAAAACTAACGTATTAA 

CATTTTAATTACCTTGCCCTATCACAAGATTAGAAAGATTCAAGATATGAGACATCCAAA 

GTCTGTGATCTGAACACCAGAGTCAGAGGCTGGCAACCTATGGCCCACGGGCTGAATCTG 

GCCCACTGCCTGTTCCTATAAAATAAAGTTTTATTGGA 

Sequence 1766 

CC GCGGT G GCGGC CGAGGTACTAGGATTGCAGGCATGAGCCACCATGCCCAGACTAATTC 

CTGTTTTCAATTTTAGGTAGTCATTTGATTATTGAGTTGTAGAAATCTTACTT^ 

AATATATTAAAAACTATCACCAGTTAGCCATGTGTTMCTGCTTAGTTTTTCAACCAAAC 

TTCAATCATCAGC CAGCCC TCACTTTCAGCCCAGATCGACGGACTCAACGCTTGGTGATC 

GGTTAACTCCTTGTTTTTTAGCTMTGCTTTAATTAGCAATTACCATC 

TAMCATCATACTTCTCAATGCTTTGGTTATCAGCTAATGTCTTCTTTTTCAGCTAACAC 

CTTAGTGGACATTAATACCCTTGTTAACGTATGTTACTTCAACCAGATTAGACCTGGCTG 

CATTGGGAAAGGAAAGTGGGGGGGCTTGACAAGG 

Sequence 1767 

AGGTACTTAC AGTTTGACTTTGAGTC AGCTCAG C ATTCTAAATCAAAC GCAAACAGCAGA 

ATCATATGGCATAGACACTTAGCAAATAGTTTGACTTTGAGTCAGCTCAGCAtTCTAAAT 

CAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATAGGCAAGCGTCTCTGC 

ATCCCAGAGCAAATCCTCTGAGATCCTCAGTCTCTCCTGCTGAAACTTCATCTTCCACTC 

CTGCCTTTGGGTCATTTCCAAACGGCAGGGCACCAGACACGCTCTACCTTATGTGGAGCC 

ATACTGTTCATTTATTAGTTTTCTGGCTTCGTGGAGAATTATCTGCCTTCTCTGAGTATT 

GATTTCATTGTGCCCCCGCGTACCTGCCCG 

Sequence 1768 

GCTTGAACAAATCCACCTTCTGTGGACCAAGCACCACCCTGGGCATTTCTAGCATGAGCA 

AAATCCAAGGTCCTGGCTGGACTCCAGAGATGCTATTTACCTCAGAAGCATGACAATAGG 

AGGCAGAAGGAGCAGGCAAATCCAAGTCCTTTCTTGTAGTTTCCTTGTTTGGGGAGGAAA 

AGTTGAGTTTTACTATTATGGAAAAGAAACAGGAAATAGAGACAGACAAAGAGATATGAC 

AATACAGTCCTGCCACCCAGATACTCATTTCCACCCACCATTCCATGCATTTGTTTTGAA 

TATATAAGTATGTACCTGCCG 

Sequence 1769 

CCGGGCAGGTACGTGCTAAGTTGAAAAAGGAGCCTCAGATTTCTCTGACCTACCTCTTAC 
CCTGTGCTGTCTCTCCCCAGGATTTTCTGAAGTTCCTTGTCTGCCTGAAGTCCAGACCTG 
CCTCCTCCACATCTGTGGCTTGGTGCGGAGAGCATCTCTGGGCACTGGAGGAACGGTACC 
T 

Sequence 1770 . 

CCGCGGTGGCGGCCGAGGTACATGTGAATGGGAAACATTCATAAGTTCCTATTTGCCAAG 
GGCATGTAAATGTGATAAAGCGGATCCGAAGACACATGGGAGTGAGACTAGGAAAGTATT 
AAGAACATGGGCCTTGACCGGGCGCAGTGGCTTACGCCTGTAATCCTACCACTTTGGGAG 
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GCCGAGGCTGGTGGATCACAAGGTCAAGAGTTCGAGACTAGCCTAGCTAAGATGGTAAAA 

CCCCGTTTCTAAAAATTAGCTGGGCACAGTGGCGGGCACCTGTAATCCCAGCTACTTGGG 

AGGCTGGGGTCACTTGAACCCTGGAGGCGGAGGTTGCAGTAAGCTGAGATCACGCCACTG 

CACTCTAGCCTAGGTGACAGAGCAAGACTCTGTCAAAAAAAAAATAATAATAAAAAAGAA 

TGTGGGC 

Sequence 1771 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATCTGTCACT 

GTTCTTCCCCCACTTCAAAGCTGACTTGACAACTGCATTTATTTCACATTGCAATTGCTG. 

CAGGTTMTMTATTGTTTTATCAGGAAACATTTATCTTGAGGAAGAAAGTTATCCTAAT 

ACNGCAAGATTACCTNATCCCANGGNCAAANTAGAAAGGGGACTGTGTCTAGAGGGTAAT 

GCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATTGGAGGGAAGA 

ATAAGTCATCGTTTCTATGAMTATTATGTTTCCTGA 

TCAGCTTAAAAACATTGCATTTAAAGGATTGAGATGACTTTAA 

Sequence 1772 

TNNNCCNTCNGTCCTTTTNGGTTTGCNGCCGAGGCGGTNTTCAAGCTCACNTCAAAGGGC 

CGGGTTMTACCGGTTTATTCCACAAGAAATCAGGGGGATAACCGCAGGAMGAACATTG 

T 

Sequence 1773 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAAT 

GCTAACCTATATCAACATTTTTCCCCACCAAAGTGTTCCCAAGGGCAAAGGTAAGTGAAT 

ACATGTTTCCATGCTTGTGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCT 

ANATTNNANNCCNNAAAGCCTTCAAANTTCCTTTATTAAGAACTTGGGGACTGCATCTTT 

AAAATMTATGGGTTTTNTTTTATTCTTTGATAATAAAAAGAATGGAACCAAATCCTGGG 

TTGTCTGATTCTTTGCCCANTGCNTCTTTCTNATGTCTAACACTGCCTCGTCTTATGAAT 

AACNTCTGNACATNTACAGAGGAAAGGACTTATTTGNNTT 

Sequence 1774 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTT 

ACTCATTCACACCNTNATTGGAAAAANAAGGGAAGGTTTCCTAGAAAAAATCCTCCCGCT 

CCACCCTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAAACTA 

TTTTGAATTGGTGGTGTTGAGGCTAMTTCTCTGCTATTGACAGAATTGAGAATGGTGAT 

CAATTTCCAGAGTAGCCATGTTACAAATTTTGTCCCCAATTTTCAATG 

Sequence 1775 

CCGCGGTGGCGGCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCA 

AACGCAAACAGCAGAATCATATGGCATCGACACTTAGCAAATCCAATGCCTTCCAGGCAT 

CTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATAG 

TGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGTGGGACA 

TTTCACCCAAGTTGTTCATTTATTAGTmCTGGCTTCGTGGAGAATTATCTGCCTTCTC 

TGAGTATTGATTTCATTGTGCGCATGGTGAAGATATCGCCTGT 

Sequence 1776 

CGAGGTACGCACTTGTGTGCNTTTCTGTTGGGTATATAACAAGTATGATATTGCTGGGTC 
TTAAAATATTTATATATTAACTTCATTAAATGCTGCCTAAGAATTTTTCA^ 
CTATTTTACCTTCACCAGCNCTAGNTGTTCCATATCCTTGCTGTNACTTGGCATNATCTG 
TAI I I I I IATTTTGACCAGACTGGTCTGTGTGTCATAGTATCCCATTGTCATTTTAAGTT 
GCATGCCTTTGATGAAGACATTTTCTGATGCTTAATTGGCTCTTNGAATATATTCTGTAA 
CTGATTTGTAGGAGTCC CNTTATATATTCTGG ATATAAG N7TTTTTATTGGAC ATAAATA 
TTGCANATATCCCTCTCCCACTTTAACTTGCCTTTT 
Sequence 1777 
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ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTA 

TTCTGAAAAA TCTAT CTCTAGAATGAAGTCATGACAGTTCACCCTACTCTACTAGTTTTG 

TCCMGCTAATT7TCATGTATGATTTTGTAATCTGATTGAACMTACTTGAAAAAAAAAA 

AGTTAGATATATAMTGAATTTGTTGGGGGATAAAACATAAATTAAGCCTATCCCTATAA 

AATCTACTCTTTCCTGACTTCCAGGAGTGGTCCTTTATTTGAGAGGCAAAACATATACAT 

TTCACATTACCATTTTTGAGACTTGATGAAGCCCAGTCCTAAACTGTGGCTGAGGCTAAG 

GCTGAATGGTTGGGATTTACTGTGAGAATGGACATTCTCCCGGGTCCTCACCTGAGGTCA 

TTAAATTGATTTAGTCTAGGTCCACTGGTTGTATTTT 

Sequence 1778 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGA 

CAGCATGCAGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAAT 

GTGATGTTGGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATG 

GGAGGAAAGTAAGAACAGATTCTTTCCAACCCTACTGATTTGTTATAATTCTCCCCATTG 

AAATTGGGACAAAATTTGTAACATGCTACTCTGAMTTGATCACATTCTCAGTTCTGTCA 

ATAGCAGAGAATTTAGCCTCAACACCACCAATTCAAAATAGTTTAGGTCTTGCCCCTGCC 

TGATTATGTAACAGAAMGCATTAAAGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGA 

AACTTCCTTCTCTTCCATTAACGTGTGAATGAGTA 

Sequence 1779 

CTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGGGTCCGGCGCATCCATGGCGA 

ACAGGCAGACGAACGCGCCTTGCTGCGCGCTGCTGGCGGTGGCGCCCTGCGGCGCCAGCA 

CCGTGAATTGTGGCGCCGTCACCAGTTGCAGCAGCGGCTTGGCATCGGTGCGCAGCACGC 

GGCGCGGCTGGCGGCGCGGGCCCNGCGCCGCANCAATGGCGGGCCTGNAGTTCCAATNGG 

CAAGGGTGCGCGCGCGGCNN 

Sequence 1780 

CCGGGCAGGTACAGCACCTTGTGTCCTGCCACCAGCCGGCTGTGAAACTGGTACGCGGGG 

TAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTA 

GGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCT 

GCAGGAAAATGC AGTG AAGTAG AAG AGAC AGGG ATATC C C AG AAGGTTATGC AAAACATC 

AAGAGAAGATGAGAGGTCAGAGATGGGMGAAACAAGAACTTTGACATGCTTGGTGTTCT 

TGCCCAAGCTTTGMGAAGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTG 

CCCAA 

Sequence 1781 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATC 

TGTCACTGTTCTTCCCCCACTTCAAAGCTGACTTGACAACTGCATTTATTTCACATTGCA 

ATTGCTGCAGGTTAATAATATTGTTTTATCAGGAAACATTTATCTTGAGGMGAAAGTTA 

TCCTAATACAGCAGATAACTTATCCAAGGGCCAAATAGAAAGGGGACTGTGTCTAGAGGG 

TAATGCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATTGGAGGG 

AAGAATAAGTCATCGTTrCTATGAAATATTATTTTCCTGATTGGAACCATTTCCATTT^ 

GTCAGCTTAAAACATTGCATTAMGATGAGATGACTTTAAAACGGTGACAGCTATCTCAA 

GCCCTGTGATTATAAATGGTGTTATAGGCTTG 

Sequence 1782 

ATAGGGCGAATTGGAGCTCCNCGCGGTGGCGGCCGAGGTACCAAAGGTCTTCAAGGGTAA 

CGATGCTGCTCCATTTAAATAAAAGATGAAGGATGTCGAAACACAAGCAGCCTTTAACCA 

GTAGGCCCATAAATAAGACAGTAGCTAGTGCTATGCAATTCTAACACTGGCACTTGTGCT 

GGGATTGGCCTCTTCTAGGCTAGGAATTCTTAACCTGGAGGTCAAGGACTTGGGGGGAGG 

GTCCTATGGACCCCTTGCMTGTAGGCAMTTTTGCATAMCATTTTTCTGCACACGTTT 

CAAAGGAACTTGGAATATCCGTAAGTTGAGACCTCCAAATTACTAGAGCTAATTGCCCCA 

TTTTATTCCTTTGTATCATCTCAAATGCCAATATAATATCTAGTGTATAATAAATGTTTG 

CTGGCTGCTTAGGAGAGAACTAGATGTGTTAATGACGCCATCTCACAGTGAAAAGGGGCA 
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Sequence 1783 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGGGACC 

GCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCA 

TTAAGTGCAAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCCGAGAC 

TMTTTTTCCATTGCTAACTCTAGATCTCGGCCACTGGAGTCTGAAGATACTCTCTTTGA 

GAATGCATATTATTTTGCTCACAGCTAAAACATTTAAGTATCATAGCTGATCAGTGGAGT 

GAGATTAAMGGTTTCTTTTTTGAATCATCAGCTAGAAGATGTACCTCGGCCCGCTCTAG 

AACTAGTG 

Sequence 1784 

NGCGGCCGAGGTACACTTATAGTTGAGAGCCAAGTCTNCCTTATCATTGGTTAATGAGAA 

TGAGCTACTGAAMCAAAAAGAGGGTCTTNTACTCAGCCTCTACCCCTAATATTTATATC 

AGAAGCAGAGATTAACTGTCCTTACTCATTCACACGTTAATGGAAGAGAAGGAAGTTTCC 

TAGAAAAATCCTCCCGCTCCAC 

Sequence 1785 

CCGCGGTGGCGGCCGCCCGGGCAGG1 ACCATTGACTACTTCATACTGCTACAAAATGCAT 

TTGAAA1 I I It I I I ICTGGCCAAGGAACCTATTCATCCACTCTCTCCAAATTAAATACTT 

ACTAGAGAAATGTTGGCACTTATTAACCTTGTAATATGAATTAGAAACAMCTTGACCTT 

CATATATTCTTTTACACTTATGAATCTATTTAACTGCATGAT7TATCTGCATGATTCAGA 

ATTTTCTTGATACCATGTAACTGAAGTAAGATTTAGAAATATATTTTCTTGATACCCTGT 

AACTGAAGTAAGATTTAAAAAGCTGAGGTGACAGTTGTCACCCATAATGCTGATAAATAT 

AGGAATGAGTCGATTATAMTTTAMCCTTTTGGGGGCATTATGCTACCAAAATNTCTCT 

till IAAAA 

Sequence 1786 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTATTTCTTTGCAAGATAGTTTTGTAAAGCGT 

CCATAAGCAGAGACTGTGTCAAGGCGTCCTACGCAAATTACAGGACCTTGCACAGACAGT 

TGCATAGAGCTGCATAGGGGAAAGGCGGGCAAAAGGTCTGACCTAAGTCTTCAAACACAN 

GCCTTCATACACCGTGCCACCT 

Sequence 1787 

GGTGGCGGCCGCCCGGGCAGGTACAGATTGNGTCCACTGGAMGGTAMTGATTGCTTTT 

TTATNTTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCAC 

AAAAATAATTCNTCTTTCTAGGTTATG AANAG ATAATTATTTGTCTG GTAAG C NTTTNTA 

TAAACCCACTCATTTTATATTTAGAAAAATCCTAAATGTGTGGGTGACTGCTTTGTAGNG 

AACTTTCATATACTATAAAACTAGTTGTGAGATANCCATTCTGGTAGCTCACTTAATAAA 

AACAATTTCAGAATTAAAGGAAATTTTCTATGCAAGGTTTACTTCTCANATGAACAGTAG 

GACTTTGTACTTTTATTTCCACTAAGTGAAAAMGAACTGTGTTTTTAAACTGTANGA 

ATTTAATAAATCAGCANGGGTATTTTAGCTAATAGA 

Sequence 1788 

AACCTTTAGGGTTNTNCCCAAAGTCCCGGCCGNCCGGNTNGGGACTNGGGGGTTGAACAC 

TTCGAAANTTTCANCTTN CGCAC NACG AAANCAAC ACN N N GAGAAGGC AG ATAATTCTCC 

ACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGNGNGA 

TATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAAAGACACCTATCTGAAGATATGAACC 

AGTTTTGCTCTCCATAGGGGAGGATTTCATCAACAGGAAACAAGATGCC 

Sequence 1789 

AGGTACGCGGGAGTAAATACAGTACAAATTGTTCTACTGTTTTAAAAAGTTTTCCGCAGA 
ACAGTGCATTTATGGCAATGCTATGTTTAATGAGTTAGGGACATCAAATATATAGTAGTT 
CCTTATTTTCAGTTGTGAAMTGAAATGGCTAAAGCAGAAGAGACGTCTATTTTAGTCTT 
TTAAAAATGTGTGTGGGTGGTC I Nil \ CCTCAGAAGCCCAAAGCACATGTATATTTTGT 
TATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACTTTAAGAAAAGGAACAAGAAA 
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AAGGTAAATTCATGTGTTCCCCACTGCTGTGTCTAGAACCAAGATCACATTATATCATTG 

TTAAAATTGTGTTATCTAGAAAAGGTGCAATATAGGGGAAMCACTCTAAGAATCTTTTA 
AAAGCCTAGT 

Sequence 1790 

AGGTACGCGGGGGACTCCTTGAGAAGAAACGGCGGAGACCTGAGACCGGGAGGCTGAGGC 

TGT AGGTG GGCTCCGCTGGGTAAAAGTTGCCGCAGCAGCTGTCCCTTGGCCCCATCGCGA 

TTTATTTTTCCCCCTTGCTTTCCGGGTCCCGGGATCCCAAGTTTGTAACTAACGGGAAGC 

GAAATCCCACCCGAAGCAAAATGTTTGCGAAGTTTCAGGCNCCCTTAATTGAAAGGCTGT 

AATTAACMGTCCGTTGTTTGCCAGCCAGGCNCCGTTGCAGGCNCTTTCTGNNGATTGNC '» 

ATTTATTTCTCACMGCAACCNTAGGGGGNTTGTTANTNNTTGAAAATNTGGAAAANACN 

TNTTAAAAATTTGNGGGAAMCCNCNNTTTTTGNNNNNANCCCCN I I I I III I > I I CGGG 

GGGGGAAAAAAAA 

Sequence 1791 

CCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACAT 

TTTTCCCCACCAAAGTGTTCCCAAGGGCAAMGTAAGTGAATACATGTTTCCATGCTTGT 

GTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACTTAGATTAGATTCCAGAAGC 

CCTCAAATTTCCTTATAANACTTGGGGACTGCATCTTAAAATAATATTG 1 1 U K M I AT 

TCTTTGATAATAAMGMTGGAACCAMTCCTGGTTGTCTGATTCTTTGCCCAGTGCTCT 

TTCTTATGTCTAACACTGCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTAT 

TGTTTGTTCAGTTCAACACTCAATTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAG 

TATCCAGCGGGGAAAGCAAAGGATCCGCAGGTGGTAACACCCTA 

Sequence 1792 

CCGCGGTGGCGGCCGAGGTACATGCCAGGCACAAGCTGGAGTCACAGATGCCACACTGAC 

TATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTTTCTTATGT 

TTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAGTTGGAAGT 

CAGTTG CTTATGGGAAACTGGCTGCAGGGGGATCAGTCCACCTGTAACACACG G AAAGG C 

AGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTACAGAGAGTTA 

CGGAA AAAAT CTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAATGACTGAT 

AGATCCTTTTACAATATTCTAGGGTAGTCTGTCTGTGGGAGGGGGCAGGAGTCTATGGGA 
ACTCT 

Sequence 1793 

AGGTACAACTTGACAGTTTATAAAAATGCTMCCTATATCAACATTTTTCCCCACCAAAG 
TGTTCCCAAGGGCAAAGGTAAGTGAATACATGTTTCCATGCTTGTGTCAGGGGCACACAG 
CAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATCCCAGAAGCCTTCAAATTCCTTAT 
AAGACTTGGGACTGCATCTTAAAATAATATTG 1 1 M I M r 1 ATTCTTTGATAATAAAAG A 
ATGGAACCAMTCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTTTCTTATGTCTAACACT 
GCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTATTGTTTGTTCAGTTCAAC 
ACTCAATTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAGTATCCAGCGGGGGA 
Sequence 1794 

GTAATACACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTCTGCG 

TTGTTACCACTGCCTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTT 

GTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGT 

TACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTA 

CCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACC 

ACTGCTTCCCGGGACTCTGCGTTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCGGGCTGCAGGAAATTCGAATATCAAAGCTTTATCGATACCGTCGACCTTCGAAGG 

GGGG 

Sequence 1795 

CCGCGGTGGCGGCCGAGGTACTCTGCGTTGTTACCACTGCCTCCCGGGACTCTGCGTTGT 
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TACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTA 
CCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACC 
ACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCAC 
TGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTACCTGCCC 
G 

Sequence 1796 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAACATTTCGACATACGCCAGCA 

TGTGATGGCCGAAGGTGATCGGCTGGGCCACCTGCATGTGGGTAAAGCCCGGCATGATGG 

TGTCGGCATGCTGTTCGGCCAGGTCCGTCANCGCGCTGCGGAACGTCGCCAGCAGGGCGG 

TGATGTCGTCGATGGCGGAACGCACGTTACAGGCGGGATGTCCGGTGGGCCACCTGGTCC 

TTGCGCGAACGGCCGGTGTGCAATCGCTTGCCCGCATCGCCTACCAGTTCGGTCAGGCGT 

TTTTCGATATTAAGGTGGACATCTTCCAAGATCCAGCAGCCATTCTAAC 

Sequence 1797 

N I I I I I M i I I M I I I M I I I IGGGCAATACCAGCTTTCACCCTGTCTTCTGCACTGCTT 

AGCCTTGGCTCTGCTGAGTTGTGCAACACATTCTGAGAGATGCCATACAATGCTCCAGGC 

AMCTTATAGAAAAACAGGAATGAGTGATTTTATACGGGATGTGTTTCAGCTGTCCAATT 

CAAMTAMTCACGTCCAGGTGCATTCTTTCCTAATTTGTGACCCAMCTGCTTGCACTT 

TATACATTMTGMTTATTTTTTTAAAAGAMTTAGCTNTATCATGAAGACAGTGCTATC 

TMCCCTTTGTGGTCTTTCTCCCTTAAAAAGCAATTC 

TTMTCTACATATTTGAATGAAGCTGAAATGAACTAACGTATGTCAACCAGTAAAAGCAG 

GACACTTAATAATMTGCTTCCATGAAAATATACATTGTGCATATGTAGGCTCTTTTTGG 

CATTACCAAATTTGCCAAGAACTTTCTTCTATAAATTAAATTCATTATCAAAA 

GTGAGCCAGCCTTTAAAAATTNCCAT 

Sequence 1798 

CGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTC AG ATCTTACCACTGAGG AATTTT 

AAGTTATCATTCCTGTAAACATTTCTCCAGTGGCTCCACGGAATTTAAAAATGCCTTCCA 

AATTGCCTTCAGTTTTCACAGGCCATCTCTTACCCTGTGCTGTCTCTCCCCAGGATTTTC 

tGAAGTTCCTTGTCTGCCTGAAGTNCAGACCTGCCTCCTCCACATCTGTGGTTTGGTGCG 

GAGAGCATCTTNTGGGCACTGGAGGAACGGTACCTGCCCG 

Sequence 1799 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACANGATCT 

GTTTCCAGAAACCACTACAAGTGTCATGAACCTTGGGTTGCTCTCTGATTGAATTCTGGG 

ATCCAAGCTGGGTAAGGAGAGAGATGAGTCCAGGCTGCTACAGAAGATTATTTTCCATAA 

TTTCAATATGAGCAGTTTTNAAACACANAAGGATTTNNTNGTCNATGTTAGCTAGGCCAG 

CCTTAACACTTCCTCTACCTCATTTCTTAGAAACCAACAAGGGCCTTTCCAGAGAGGGCC 

AAGGACGGTGGCATCCAGAAGGGATGAAAACACTTGAGTACCTCGGCCGCTCTAGAACTA 

GT 

Sequence 1800 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACMCTTGACAGTTT 

ATAAAAATGCTAACCTATATCMCATTTTtCCCCACCAAAGTGTTCCCAAGGGCAAAGGT 

AAGTGAATACATGTTTCCATGCTTGTGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGG 

CATGACCTANATTAGATCCCANAAGCCTTTCAAATTCNCTTATAAGACTTGGGACTGCAT 

CTTAAAATAATATTGT N I I M I I I M I ATTCTTTGATAATAAAAGAATGGAACCAAATCC 

TGGTTGTCTGATTCTTTGCCCAGTGCTCTTTCTTATGTCTAACACTGCCTCGTCTTATGA 

ATAACTCTGACATTACAGAGAAAGACTATTGTTTGTTCAGTTCAACACTCAATTCCATTA 

GGTG CTAG C AAATTTATTG AAAA 

Sequence 1801 

CCGGGCAGGTACTCACAGATCTGTTTCCAGAAACCACTACAAGTGTCATGAACCTTGGGT 
TGCTCTCTGATTGAATTCTGGGATCCAAGCTGGGTAAGGAGAGAGATGAGTCCAGGCTGC 
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TAC AG AAGATTATTTTCCATAATTTC AATATG N GCAGTTTTAAAACAC ATAANGGATTTA 

TAGGTCATGTTTAGCTAGGCTCAGCCTTAACACTTNCTCTACCTCATTTATTAGAAACCA 

ACAAAGGGCCTTTTCCAGATGAGGGCCAAGGACGGTGGCATNCAGAAGGGATGAAAACAC 

TGAGTACCTNGGCCGCTCTANAACTAGTGNGATCCNCCGGGCTGCAGGNAATTCGATATC 

AAGCTTATCNGATACCNGTACGNCCTNTGAGGGGGGGGGCCCG 

Sequence 1802 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAATCACTGGGTGTCCCAGT 

GCCCGCTGTGAGCTACGCTACCCGCCTAGGAGAGTCTGAGAGGCCAGTCTCTCTTGGCAG 

TGAGGCCGTCTGTCCGGAAATAGAGGCGCACATTGGGAGGGAAATGCTCTCTGTGTACCT 

GCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCCTGNAAGGAATTTCTAANAT 

CAANGCTTATCGATACCGTCTGAC 

Sequence 1803 

TGGCCGCCCGGGCAGGTACGCGGGGATAATAAAAAATGTATTTACTGAGCCAGTTGTGGT 

GGCTCGCGCCTGTGGTCCCAGCGCCTTGGAAGGCCAATGAGAGTGGATCGGTTGAGGCCA 

GGAGTTTGAGACCAGCCTGGCCAACATGGTGGGATGCCGTCTCTACTGAGAATACAGAGA 

TGGGCCGGGCGCGGTGGCGCGTGCCTGTAGTCCTCAGCCTCCCAAAGTGCTGGGATTACA 

GGCGTGAGCCACTGCACTTGGCCTGGACTCATCTTCATTGTCCACCTCCTAGGCTAAATT 

TATGTCTTTTATGTGCTTGTAATCTCTGCATACCACTTTAGTGACACTTGCCCTGGAGTG 
TGATTGGTAATCC 

Sequence 1804 

AGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGACATCCCAGGGAAGGTG 

AACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTACACACAA 

CTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGAATT 

CTCTCCTTCTCATTGTTCCAGTGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTC 

TCTCCAGAAGTCTCAGCTACGTGACCTTTCAAGTCTTCCATGGCAGTCTCAATGGCCCC 
Sequence 1805 

CCGCGGTGGCGGCCGAGGTACTTCCGTGAGATATAAGGCTGATGATAAGGATTGGGGTAA 

AGAGTTGATTCGGTCACACTTGACTGCCTTGAAATCTGTCTTGGCATCGGGTCCTCTTCC 

TCCATCGATAGGTGGTCAGTCTCAGAGGGTTGGTCATCTCCCTCTTGCTTCAAAGACACA 

GNCCTGGAAATCNTTCAAAACTCCCAATTGTTGACATTTCTTCAGCCTGACTAGGCTGGT 

CCTCAAGGCTCTTCTTTCTCTTTCTGCACAAAGGGGGCTAGACCACAGATAGTGGCCAGG 

AGAGGTGTGACTGGCTCAGATAAAGCCAGGCAAGACAGTGCCCTGTGCTGTGACCCCACT 

CAGTCTCCCGCGTACCTGCCCG 

Sequence 1806 

CCGCGGTGGCGGCCGAGGTACATACATACTTGTTATTCTTGTTACATCTTCCTTATTGAA 

TAMTNGTCATCACTTAAAATGTCCCTCTTCATCTCTAATAAGTTTTTGTCTTACAGACT 

A1 ^ ATC TAGTA ^ M ^^ 

ATC Mill CTATCCTTTT ACTTT T CTTTTAAAAATAATTTN AAAAAATTAATTNACATAC 

CCCTAAAATGCAGAAGATTTTTAAGTGCTCAGTTCAATGACTGGTGACAATTATACACAG 

CTATATATCCCACCCCCAAACAAGATATTAAACCATATATAATATTTCATACAGGNTTTC 

TCATGCCCCTTTCCAGGCCAATCCCCAT 

Sequence 1807 

CCGGGCAGGTACTACAAATACATTTTTATAAATGGATGATTGGAAGTGCCCTAAATGTCT 

TATAAGAAAATACATTATGGTACGATATATATACTAAAATTTATATTTCAGMGAATATT 

AAAAGAGGAAAATGCTAGTGGTAAAATGTTAAGTAACAAAAACATATATTATCCTCAACA 

AAATATTATCAAACCAAATTCAGCAACATCTAAAAAGANTTATACACCATGACTAAGTGA 

GATTTATCCCTGGMTAAGCTTAGTTTAACATCTGAAAATTCATCAGTGAAATACACCAT 

AGTAAGAATAAAGGACAAAAACCACAGGATCATCTCAACAGAAACAGAGAAGGCATTTGA 

CAAACTCCAACACCCTTTCATGATAAAAATACTCAACAAACTAGGAATAGATGAGAATTT 
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CCTCAACCTAATAMGGGCTTCTGTGAAAATCTCAAGGCTAATTTCGTATTAATGG 
Sequence 1808 

CCGCGGTGGCGGCCGAGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGAC 

ATCCCAGGGAAGGTGAACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGC 

TGCAGCTGCACACAACTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCT 

GTGACCAGTAGMTTCTCTCCTTCTCATTGTTCCMGTGGTCTATCTTTGNCAAGAGCCA 

GMGCCTTGAATGGTCTCTCCAGAAGTCTCAAGCTACGTGACCTTTCAAGTCTTCCATGG 

CAGTCTCAATGGCCCC 

Sequence 1809 

CCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATGTGAAGATGGCGGAATTCCCAACGTG 

CTTCTGCAAAATTTTATGTATTCATCCCCCTCTGCTCTCAGTAGGGTGTTACCACCTGCG 

GATCCTTTGCTTTCCCCGCTGGATACTAACAGCCTTTTTCATAATTGCTAGCACCTATGG 

AATTGAAGTGGTGGMCTGMCAAACAATAGTCTTTCTCTGTAATGTCAGAGTTATTCAT 

AAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGGCAAAGAATCAGACAACCAGGAT 

TTGGTTCCATTCTTTTATTATCAMGAATAAAAAAAAACMTATTATm 

CCCAAGTCTTATAAGGAATTTGAAGGCTTCTGGGATCTAATCTAGGTCATGCCCATTCTG 

CCTTCCCTTGCTGTGTGCCCCTGACACAAGCATGGAAACATGTATTCACTTACCTTTGCC 

CTTGGGAACACTTT 

Sequence 1810 

GCGGTGGCGGCCGAGGTAC 1 1 I 1 1 I I I I I I I I I I I 1 1 I! I GCAAG ATGG AATTTCATCAT 

TCCATCGGTTTTGCCCAGACTGGTGGCAAACTCCTGGGCTCAGGATATCCGCCAGCCTCA 

GCCTCCCAMGTGCTGGGAATACAGGTATGAGCCACTGCACTCGACCATTTATGCATTTC 

TTCATAATCCTTTTACTGACMCCGTAAATTAACCTCTGGGT7TCTGTCAATCACTGGAT 

TTTAAGTCACTAAGTAGCAGAACTATATCTAATTTATTAACATTCATCATCCCCCTTTGC 

CAGCATTGAACATGATGTTTTGCATATANTAGGCATTAAATTNGCTAAANGGAACTAAAT 

AATGTTrGATAAATAACATCATGGCACATGACCATATTAGGCCTAACTAATTATTTATTC 

TTCCATTTAGTGAGTGCAAAACTTTAGTCAAAAACATTAAACATAC 

Sequence 1811 

CCGGGCAGGTACTATTAAATTTAATGAGAGGTAACCCATGCACCGGCAGGAAGTTCAAGG 

GGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGT 

TTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCNTTTGAATAAGAT 

GCTGNCCTAAATCCTTCCTCCCGCGTACCTN 

Sequence 1812 

CCGCGGTGGCGGCCGAGGTACGCGGGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGT 

GGAACAGCATTCATCCTCMCATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACT 

GATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCA 

CAAGTG AAAN CAAG ATACAGGAGATAGGG AAN GGAAAGCTGG AAG C AN AGGTC ACTGGAG 

GGAGAAGGGAGATGGACACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCT 

CTCAATTGTGGGACCCCTCAAAATGTGTACCTGCCCG 

Sequence 1813 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGACAGGCCATCT 

CGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCAACATTTTTACGAAGACAAAATG 

AAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTMTCCTGGAAGAT 

GTGGTGTGAGAAGGTAGCACAAGTGAAGCAAAAGATACANGGAANATAGGGAAAGGAAAG 

CTGGAAGCAGAGGTCACTGGAAGGGAGAGGGAGATGGACACATTCAGGGCTACAAAGCAA 

GTTCTATGTGATTTGCTCACCTCTCAATTGTGGGACCCCTCAAAATGTGTACCTGCCCG 

Sequence 1814 

AGGTACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCG 
GGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGG 
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ACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCGGGACT 
CTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCT 
GCGTTGTTACCACTACTTCCCGGGACTCTGCGTTGTACCTGCCCG 
Sequence 1815 

CCGCGGTGGCGGCCGCGTCCTTGCAGGGTTTTATCGGCGCGCAGGTGAAGGCGGACCGCA 

ACCGCCTGGCGCTGGCCATGATCTTGTGCTGGCTGCTGGCCGACGACTGGTTCATCGGCC 

AGCGCTTGCCGCAGCAGGATCTGCTGCAGGTGCTGGGCGAGGCGGCGCGCGAACTGGCCG 

CCAGTACGCCGGCCCACCAGTTCACGCAAGACCCGGAGCGGCGCGAGGAACTGGCGCGCA 

TCGTGCTGGCGCGCCTGGGCTTTCGGCCCAGCGACGAAAGCGTAGCGCAGGCCACGGACA 

GATTGTCTGCCATCAGCGGCACGGAGCGCCGCCGCCTGCTCGAACGAAGCCGCTGGCCGA 

ACAGCGCGCGCGGGAAATCCGCGAGGCGCTGGCGAAAAA 

Sequence 1816 

CCGCCCGGGCAGGTACAGTAGAATCCAGTCGTTGGTTTGGAGGCGAGGGCTCTCTGCGGT 

CCTCTCCTACTGTCAGGAGCTTCCCGTGCTCATTTTTCATTTACTGATGCACTAAAGGAG 

AGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGC 

ATAGACACTTAGCAMTCCMTGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCC 

CTATGGAGAGCAAACTGGTTCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTT 

CTGGGCTACTAMTATCACACCGTCTGGTGGGACATTTCACCCAAGTTGTTCATTTATTA 

AGTTTTCTGGCTTCGTGGAGAATTATCTGCCTTCTCTGAGTATTGATTTCATTGTGCGCA 

TGGTGAAGATATCGNCTGTTCTACAAGGGAGTAAGAATGAGTCCCCGCGTACCTCGGCCG 

CTCTAGAACTAAGTGGATC 

Sequence 1817 

ccgcggtggcggcccgcccgggcaggtacctggtaatttacagaacgttcgttatcagat 

ttgataattatatattaaggttgtaaagaatgtgtatgccagaattataggtagtagatc 

ctagattcttaggaaaaatagtttctttataatcttttgagtagtgaaatgg™ 

acaatggttatgaactgggtcaaggcaaaagggccactatatgtcttcagncatctttct 

atgcctgaaatccangaaacagtgaaaatggatgttccctggaacagccattgcgatgcc 

atatgttggtcattggtgtccttaaagtgtatctcaaataattg n gtgtctcc cttactt 

gtgaactgaggggtcacaatattgatgtgcctgatgtgtgcccctctgcatttcgtgggt 

agtgtctcaaatttggctatcatattgatgagmgtcctaatttggtttagctgccagag 

gtttag g nggttttcacttcc ag n g acttacctg attgc ac aatagttccaaccgtc act 

tggttgcactccttcmngctagcatcttt^ 

tgggnttatccttttaaaatanttgcttga 

TATATTTGGTGGGATGCCTNAATNCAATGGGGCCTCT 
Sequence 1818 

TACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGGGG 

ATTCAGAGTGAGAAGGCATAAAGGAGAATCCCCAGCTGACTTGTGCAGTGGTTAATTGAA 

ATTATTCAGGCAAGAGATGATGGTGTCTTGGACCAGGGGATGAGGAAGGCTACAAAATGT 

GTCTACCTGTATTCTGTGAGGAGMCGTGTTCCCTGGTTTTAGATACTGTGAAGATGGAT 

CAGGAGAGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGCGATTCCTTCAGCTACACAGG 

ATTGAAAGGAGACATTTCTGAAGGGGAAAAAGGAAATGAAAGAAAAGATGTTTCAGATT^ 

AGGATATGCTGTGTGGTGAACTTGTTCTTCACTCTGTAGGGTTCACAAATGACTCTTCAC 

TGCCCTCTTGGATGAAATAAACTGGTTCCCATAGAAATGGACCGTCTCTGATTTCACAGT 

CTAGAGCACACAAATAACCCAACTTTCTGGAAGCAAGGGTGATGGCTCGTGAGAGCTCAA 

GGACAAGGAGGAGTCTGTAAAAAGCACACCTGGACTCGGGGAAATCATTGCCTCTCATCC 

TGAATTAAGTAGCGCTTGAAAAGACTCTCTTTGAATCAGAAAAATAATGCGAGCCTCCTA 

TCAAACTCCTTACACAGCCCACCTGAGTGATCAAANGCTTGGGTGGAAA 

Sequence 1819 

CCGCGGTGGCGGCCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATC 
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AAACGCAAACAGCAGAATCAJATGGCATAGACACTTAGCAAATCCAATGCCTTCCAGGCA 

TCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATA 

GTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGTGGGAC 

ATTTCACCCAAGTAGGCAAATGCATGACAGAACTCACTGGATCAATATCTCAAAATGCTG 

GGAAGAAAAAMCGGTMCCAGAAATTCTACACAGAGTGAMCTACAAMCAAAAATCAA 

AGCAw^AGCAACMTTTCTGGATATGCATAAAAGATGAAATTATATGTTGCTAGAAGATCC 

ATAGCACAAGAAATGCTGATATCCNTCAGGCTGAGGGCTATTAAACAAGATGGAAATCCA 

GAAATTATAGAAAGGACAAAAAGAACAAAAATGAGAAGCATACAGGTCAATACAAAGACT • 

GCATATTCTCCTT 

Sequence 1820 

TTGNCCTCCCTGGCTATAACTACTCTAATCTTCCACCTTTNTTTCTCCTTACCCTNACTG 

CTTTATCTCAAGAAGATCTTGTGTGGTATATAGATGCGAGAAAGTTACCTGCATGATTCC 

GACAGGTGCCCTGCCATCACTCTAATGCTGTTCTCATTGCTNGGCCTTATGGCACATTCT 

CACTCACTCACTACCCCTGCCCCGCGTACCTGCCCG 

Sequence 1821 

CGAGGTACATGTAGATCCAMCACATGTAAACATTTAGTGGAATTTCCAATCAGTATATA 

ATGATTCATTAATTTAAAAAAAMTGGTTCCTGGACAAGTGATTGATAACATGGAGAATG 

AAAAAGGTTAGATTTATACTTTACAMCCATACACATAAATAAATTTCAGAMGGTTG^ 

AATCTAAATATTTAGAATGAMTGATCAAAGAACTAGAAGATMTTTAGATGAATATTAC 

TGTAATCTTAGGTTTAGAGCTTTTAAAACATGATAAGGAAGAAACCATAAAGGAAAATAG 

AGGATTTGAATATATGAAMTTTTACACTTCTATAAATGAAAAATAAGCCATGAAAAAAC 

ATAAGAAGTTCAATAACTATTGCCACATAAATCACTCTCTTAAGATTGATTTTCATATAT 

AAAAAGCAGTAAAAG C AAAATCCC AATAGGAAATAGCCTAAGTGC ATAATTACCAAGN N A 

AAAN ATGTN N NTAAAANN N AAAAAAAAAGTNCCTGCCCGGGCGGCCGGC 

Sequence 1822 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAATAGGAA 

TAATGTAAATA I I I I I ATATTTAA \ I I II I I I AATTACAGAGGTTATTG AGTTTGTGTTT 

TTCAGCTGCAGAGAGAGAAATTTGTGGGGATAAAGTCACAAGAATACCAGTTGGCAGAAA 

TTGAATTACCTTCCTTTGTATAAGTAATATCCAGTCTAGTCTTAAATGCTATAGGATGAG 

ATAAGAGAAGGATAGCTCACATTCCAAAATAAAAGCAAAAATATAAAATATTGAGAATTG 

AATTTGACAACATGCGAACAAAACTGAAAATAGAAAAATATAAACCTTTATTAGAGGACA 

TAAAAGAAGCCTGGAATAAGTGGAGAGACAGGTCAGTTTTCTAGAAAGACATGATATTGT 

TAAGCTGCTGATTATGTTCAMTTCATACATTTAATTTCAGTC 

TTGTTTGAATCTGAAAGGCTGATACTGTAGGCCATTTAAAAGAATAATATATGGGCTGGG 

CACTATGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAGGGAGGGCCAGATCATGG 

GGTCAAGGAGATTGAGACCCATCCCTGGCCCAACATAGTGGAAACCCCGTCTNTCTTAAG 

AAATCCAAAAAATTANCTGGGTGCAGTGGCAAGGTGCCTAAAGGTCCCACTTGCTCANGA 

NGCTTGAGGCAAGGAGAATACCTTGAACCTGGGANGCGGANGGTGCAGTGATCCGAGATC 

CACCCTTTGCACTCTTGCCTGGCCGACCNGAACGAGACTCCGNTTCAAAAAAAAAAAAAA 

A 

Sequence 1823 

GCGGTGGCGGCCGAGGTACGCGGGCATGACAACTAATGGGAAAGACATGGCTCAGATGTG 
CAGCCTCAGAGAGCTTCTGMCATTTCTTCTCAGACTAAGCTCTTACACACAGTTGCAGT . 
TGAAAGAAAGAATTGCTTGACATGG 
Sequence 1824 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATGTGTT 
ATTTATCATTTTATGATGCCAGAGATTTACACGAATTTGTGAAA 

AGATTCAGAGGCGTGTTTGAGGCAAGGTTTTCATTGTGCAGTGAGGAAATTGGCACATAG 
AAATCAATTGATGTGCAGGACTCTTGGACTGACTATTCTGGTTATGTTTCCTTATAACAC 
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ATTAAGAAGAAATTTGAGATAMGTTAG 
ACAAATGTTGCAGTATAATTTT 
Sequence 1825 

CACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCGA 

TTTATTATATGTTAACTACTTCTCAATAAATCTGTTAAAAAATTAAACACATTAAACCAC 

CCCTCAAGAGAATCCCTAAMTAAAAATTTAAAGAGGGGGAAAAAAGGAAGGGAAGCTAC 

TGAAAGTTGTMGAAGAGACAAAMGTTACCAATAATGAAGCCAGTGTTTCAAGGAGTGT 

GCAGTTAAAATAACATGACACACAGATATCAAAACTGGCAACTGAGAGAAGTCCAAGGTC 

CTTGGCAAAGAGAAAGCCACTGACCATCCGTGGAAGTGTAGTAGGGTGACAGCCAGAATG 

TCAGAGGCTGAAGGGAAGCTATGTGAAACAGAGAGGATGGCAGTGAGCTTAGACCACTTG 

TTTAAGAAGCTCAGTAAATGAAAATGAG 

Sequence 1826 

AGGTNCACTTATAGTTGANAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGANCTAC 

TGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCANAAGCAG 

AGATTAACTGTCCTTACTCATTCACACGTTMTGGAAGANAAGGAAGTTTCCTAGAAAAA 

TCCTCCCGCTCCACCCTGCAMCTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAN 

ACCTAAACTATTTTGAATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACANAATTGA 

NAATGTGATCAATTTCANAGTANCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGAA 

TTNTAACAAATCATTAGTTGGTTGGCAAAANCTGGTTCTTAC 

Sequence 1827 

GGAGCTCCCCGCGGTGGCGGCCGAGGTCGCGGGGGGCTTANACCTGACGCTGGGAGGAGA 

TGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATT 

TATAAACATCCTGCAGGAAAATGAGTNTATATGTCANAATACACATTTCCCACCTTGCCC 

AACAGTGGAAAANCATAAGAAGAGAAAAACNTTAAAAAATGACANGGAAGTTAATGGAAG 

TCAGCANTGTGATGGTGTTTGGAGGTGGAGCCTTCANAAGGTAATTAATGCCCTTGTAAN 

AANAGGCCANANAGCTTGCNCACCTTNTTCCTGCCATGT 

Sequence 1828 

CCGGGCAGGTACGCGGGGAGACCTGACGCGGGGGGAGATGCTGCCACCTAGGTTACTTGT 
AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 
GCANTTG AAGTN G AAN AG ACANGG N ATNTTTCC AAG AAAGGTTTATGCCAN AAACATTN A 
TAGTAGAAAGTATTN NANNAAGGGAGGTNCTATATTGGTC AAG N AATAACCAACATTTTT 
TCCCCA 
Sequence 1829 

CTATAGGGCGAA TTGGA GCTCCCCGCGGNGGCGGCCCGAGGTACTTACAACTTTTCATAT 
AGTATTGGTAGTGTTTTGMTAATTTAAAAAATCATC 

TTATATAGTGCTTGTATTAATGATTTTGTGAATGTTTTTTAAAGATGGATACCTTGAT^ 
GGCTTGGCCTNCCAAAGTGCTGANATTACAGGCGTGAACCACCGCGCC 
Sequence 1830 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCGTTTGTAGATGGAAGGAAGAACTTG 

TGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTAT 

ACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTAAGCCCT 

TGGTTAAATTTATTGGCTTCATTTATGTATCTCACAACACTTC 

AGGAATAGAAACTTACTTTTGGAAATACAGTACCT 

Sequence 1831 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGTGACTAAATTTGTAGGAATGATAT 

TTTCATGGGATTACTCAATCTCACCCACCATTAGTTGCAGGTGACAAGAAAGCTAAGTTG 

GCAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTCCTTTGTGCATGTGTTCCAGACAT 

ACCAGTGGCTTGGTATTTAAACATCATGCTCAGGTGTGCAGGGTAGTTTTTGAGTTATAA 

TAGGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTATGTAAAAATTATTTTGCACTCAC 
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TTCTGAAATGAGTGTTAGTAAGAATCATCTTTANAGGAGGTTCCAAGGCATTGAACTGAG 

ATACCTGCACTGTTTGCTGTAAATTTAAGCTTAAAATTGAAACCAGGTTATCAGCATTTC 

ATGCCAGGAGAGAGTGGGCATGMTGATTTNAGGAAATGAANAGCTAGATTTCACCTTGA 

A7TTGCTTCCACCCTTCTGTGGCAAATTAGTGTGGGCTCACTGAGCACTTTATCTGCCCG 

TGGTAATTTA 

Sequence 1832 

CCGCGGTGGCGGCCCGCCCGGGCAGGCACCTCCTCCCTATGCATCTGCTGTGGGAAGTGT 

GGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCACCACAGGCAGCAGTGAATG 

AATGAAGTGAAATGGTAT7TATCCATTTCCTGGAAAGGTCCAGGGTTTGGCTCTGCAGGG 

CCAAGAGAACAGCTTTAGCTGTGCGTTAACCCAGTCCTGGAGAAGCCAGCAGGCCGTAAT 

CACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGGTGGTGACAAGGTGAGGTGG 

TCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGTGCTCTGACATTGATGCTAA 

TATTCCATATTATCAGGGCTTCTGTGGTGTCTAGGCCTCTAATTTTTCTCTTCAGTAAGG 

GGAATAACTTTGCTGTGTCAGCGCAGTGCCTCAGGAGGGTTGCAATCTANAAACAGTGAT 

TGTTAAAATTGCCTATAAACACACAATATAAACAATATGCCACAGACACAGAGGCCCCCA 

GCATAGTTTCTGGAANGTGAAGTCTCACCTCTGCCANAGTANAATCACAAGNGGGAATTG 

AGGCTTGGCATCTGGGTGCCTCTTGAAACTCTTNCTGNGGCTCCTGGTAGACAATTGGCC 

CCCNAAACNCAGAAAGATTGAGATTGCCCATTGCGTTCAANAAATTGGNGGNNAATTAAT 

GTAATTCCNGAAAAAAAMNACACACA^4GCTTGCTTATNTTTAAACCNCAAAAAAAAAAA 

AAAAAAA 

Sequence 1833 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGATTACAGGAATGAGCCACTGTGCCTGGC 

CTGATTGCGGATTTTATTGGAATAACCTATAAATATTGAACAATGCACCTGCGAAAGAGT 

ATGTCTTAATCTTCTCTTGGAATACTGTCAACAAAGATCTCTGAGTTTGAAGGGTAGAAG 

TAACTCACTACTTGGCTGCTGACTGTGAAATTTAGTCATGTTTTTATTGTGGGAGTGGAC 

TGAT TAAACAGACAAGTGTTATAAGGTTCAAATTACAACATTTCTCCTAATATGCAGTGC 

TTTTAACTATGATAAACACTGCACCCCCCTTCACAAAAAAAAAGAACCCACCCACCAATT 

TTTAACTTTCTTCTACAAACACCTCTTATCTCTTCTCTATCAAAGCATGACTGGCCTTTT 

TGAAATATTCAGGACAAAAAGTTAGGAAGAGGGCAGGCTCGCTAGTTCAATGTCACTACA 

TATAGGGGAAATGGMCACTTCCATANAAAGCTCTTGATTTCTCTCAGCGTGAGTCTNTG 

TCTGANGCATTCCCCTNTNTNTNTGCTACCAGACACCATCACAGCATGTCTTTNCAATCT 

GAGTCCTCAAANGGCATTCTGAMGTGTTCAGGTTATTAATGNCATACCCAAAAGCCTTA 

TTAAAACTGCACAGTTATTCTGCTTAAGGTTTACTGGCCATTAAAAATTC 

Sequence 1834 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTAGA 

TGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTT 

ACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAG 

GAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGA 

GAAGATGAGAGGAGTCTATATTGTCAAGAATACACATTTCCCACCTTGCCCAACAGTNGA 

AN AAAAAAAAACTANATTAAAAAAAAAANGGTACCTN GGCCGN TCTAAAACTAGTG 

Sequence 1835 

CCGCGGTGGCGGCCGAGGTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAAT 

AAGGTGTGGACTCACCAAAGACAGTTCCAAAGTCAATTTCACTCTGACACACTCTCTGTG 

ATCTTCCACAGTCAGCACAATGCCTGCCCCCTGTGGGTGTTGTATAAATATTTGTTGAAT 

GAATGAATCAATCATTCAACAGACCAAGGCCAAATCAGAACCCCAAACCCTAAGGTCTTT 

ATACTCTCACTGTCCATCCATCGATCTTCCTGTCAGAAATCAGAATATACCTTTGCAATA 

CCCTTTGCTAGCCTTTCAGTTATCTTTTGAATAGAGGCTCTGAGCCTTGAAAATATTGCC 

TGGGAAATATTAACACCCATTTGAGTATCTCCCAAACACCTCAATTAACTATATGGTGCT 

GTCTAGCCAGGACCTTATTTCAGTATAATGTGAACCTGA 
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Sequence 1836 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTCAATCT 

TACTCCCTTGTAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAG 

AAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGT 

CCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTA 

TCTGAAGATATGAMCCAGTTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGA 

TGCCTGGAAGGCATTGGATTTGCTMGTGTCTATGCCATATGATTCTGCTGTTTGCGTTT 

GATTTCGAATGCTGAGCTGACTCAAAGTCAAACTGTAAGTACCTGCCCG 
Sequence 1837 

CCGCGGTGGCGGCCGAGGTACAAGCTAACTGTGCTAGGCAGGGCAGCCCTGTGAGTTCTA 

CTGCTGTCTTGGTTTTACAGAGGGGGAAGTGAGGCACAGAGAAGTTAATTAACCTCTGAA 

GTGTTGCAGTCTAAGGCACAGAGGCACAGTTCCAGGCAAGGTTCATCTGAATCTTAAGTC 

CTCACTCTTTGCCACCATCCTCCACTGCTGANACCATCCCTGTGAAGTCCTGCCGCTCTC 

CTCCCCTGGTCCATATTCACTGCTACTCAATGAGGCCAAGGAAGCCAATGGTCGTGTCCC 

CAAGAGGATATCTCTCCCCTCCTGAGAATCTTTCTTCATACATCTCAATTCTGAGATACA 

GATTGAGAAGCACCTNAGCAAATCCACTGCATGGAAGGCAAAACAACCTTGATCCACAGC 

AATTGACCACAAGCAAGGGAAGCCTGGC 

Sequence 1838 

CATACT TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAATTG 

TGCTGAi I l l f CTCTC ATTTAAAACGGTTCC NCAAGTAGTTAGTCCAGGGCTGGTGTCAT 

GGCTCCATGGTCACTGGGGCATTGTGTTCCTCCACGATTTCTGCTTCACCAGGCTTAGCC 

TTTGCTTCCATCCTCTAGTTTGCCTCATGGCTCAAGATGGCTGCTAGAGCTCCAGCTGTC 

TCTTATGCATTCCTGGCTGTAGAATGGAGGAAGGAGGAAAGGGTAACGGGACGCTTTACT 

GGCTATTCTTGTTTAAAGGAGCATCCTTGATCATACTTAATGAGAATCTGGAGTAATTTT 

CAGGAGATCTCAGAGAGGGTACCT 

Sequence 1839 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGT 

GTCCTTCTTCAACTGGACACCTCTAGATAAGAAGGCAAGAAATTCTTAAMCTTTTGCTA 

ACCTAATTGTGTATGTAATTAGMTAACCAAGTTCAGTTAATTGAAACAATGAGTATTGG 

AGTAAGAGTTTGGGATTTGATTCCTTAGGGATAAACTACTAAATCCAGGACAGTCATTTA 

ACTGCTGCAGATTTGAGTTCCCAGGGGAACCAAACAGAAGTCACACAATTTCTAGGATGA 

AAACATCTCAAAAAAAATCAGTCCTCCCAGCACTTCGGGAGGCTGAGGCANGGAGAGCAC 

TTGAAGTCAGGAGTTGAAGACCAGACCAGCCAACATGGTGAAATCCCCGTCTCTATAAAA 

AATACCGAAATTTAGTTGGGCCGTGGTGACCCCGTGAC 

Sequence 1840 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTTATCAGTATCTG 

TCCTAAATACTGTGCTCTGACATTGATGCTAATATTCCATATTATCAGGGCTTCTGTGGT 

^^^AGGCCTCTMTTTTTCTCTTCAGTAAGGGGAATAACTTTGCTGTGTCAGCGCAGTG 

CCTCAGGAGGGTTGCAATCTAGAAACAGTGATTGTTAAAATTGCCTATAAACACACAATA 

TAAACCAATATGCCACAGACACAGAGGCCCCCAGCATAGTTTCTGGAAGGTGAAGTCTCA 

CCTCTGCCAGAGTAGAGTCACAGTGGGAATTGAGGCTTGGCATCTGGGTGCCTCTGAAGC 

TCTTCCTGTGGCTCCTGTTAGACAATTGACCCCGAAACACAGAAGATAGAGATTGCACAT 

TGCGTTCAGTAATTGTTGTTTATTTATGTTATTTCTGAAAATAAAAGACAGCACAGG 

Sequence 1841 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATCTGT 
CACTGTTCTTCCCCCACTTCAAAGCTGACTTGACAACTGCATTTATTTCATATTGCAATT 

gctgcaggttaataatattgttttatcaggaaacatttatcttgaggaagaaag™ 
taatacagcagataacttatccaaggacaaatagaaaggggactgtgtctagagggtaat 

GCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATTGGAGGGAAGA 
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ATAAGTCATCGTTTCTATGAAATATTATTTTCCTGATTGGAACCATTTCCATTTTCG 

GCTTAAAACATTGCATTAAAGATGAGATGACTTTAAAACGGTGACAACTATCTCAAGCCC 

TGTGATTATAAATGGTGTTATAGGCTTGTAATCCTATGGGTTTAGACTAAAGTATTTTCT 

TCATAAA7TGAAAGAGAAACACACTAGGAATTCAGAAGATT7TMTTAAAGTTGTTCAAA 

AAATTTAATGGGAATTTTAMGGCTGGCTCACT^ 

ATAGAAAGAAAGTTCTTTGGGCAAAATTTTGGTAATGCC 

Sequence 1842 

gactnctatagggcgaattggagctccccgcggtggcggccgaggtacaccagancttaa 
agtataataataaaamtaaagtattacagaacatggtcaattatattaaaaagtagatt 
aattaatgtcc atatatac attttttg gtg g aaatc aaac ataatg aaaactatttc ac a 
taattaatattttgttta1 i i i 1 i acttattaatttattgcagagatgggagtcttgcta 
tgttgcccaggctagtctcaaactcctggcctcaagtaatcctcccacctcaatccccca 
aaatgctgggatttcaggmcatgccactgcactttattagtgttacctgtcataaagtt 
aattactcataattttatttcagtatatatatatacacacacacaccacacacaccatag 
tagaataaattaattatattttctacaaaaat™ 

Sequence 1843 

atggagtcagctgcgatagcggcctgacgtccgccgtggtcgatatcgcgcgcgcgcagg 

agcaatttacggttgccctgaatggccgcttcaagggcggccatatcacgcgccattacg 

gcctgccggacaagcgcgtgcatgcgatccanctggagatgtgccagtgcgtgtacatgg 

atgaagcggcgccgttcggctaccggcctgacctggccgcgcaagtgcagccgctgctgc 

nccagatgatggac 

Sequence 1844 

CTAAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl rrTTTTTTTTTTTTTTl 
TTTTTTTTTCTTTTTTTTTTT^ GGGCAACTTTCTGGTATTACTTGTAAAC NCTGGTTC 
CTTCAACTTTCTG ATATTACTTGTAAAC ACTG GTTCCTTNTCAACC ACCG N ATTCTG ATT 
GGGTCTATAAGTAGCACCCAGTCCACACCACANCACNCTTCTGGAGGAGGAATGCCAATG 
GGGACCCTGTCTGCAATGCCTGNGGGCTNTACTACAAGCTTCACAATNTTAACAGACCCC 
TGACTTTNAAAAAGGAAGGCATNCANACCANAAACCGAAAAATGTNTAGCAAATCCAAAA 
AGTGCAAAAAAGTGCATGACTCACTTGGAGGACT 
Sequence 1 845 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGAACTATTTATAMCATCCTGCAGGAAAATGGTAAGCCCTTGGTTAAATTTA 

TTGGCTTCATTTATGTATCTCACAACACTTCTTTTCTGTT^ 

TTACTTTTGGAAATAGAGTACCTGCCCG 

Sequence 1846 

CCGGGCAGGTACTTGCCAAGATCTCACTCCTCAGCAGCTTTTrATTTCTAAATTTAAGCT 

TTTGGTTACTGAAAGTrTACAGCAGTTCTAGACTCCTATTCTTTAGCATTCCGTTACGTC 

CAGACCATGTGACAACCTCCATTCTACAAATATGTAACCACAAAGAAGGTGAGTGGGGAG 

GCAATAAGAACCCAGACTAACAGAACATCAGAAAAGCTGGCATACGCACACATAAAGAAG 

GGGGAGAATATAGGACTATATGGGAGGGAGGMGGTCACAGAAAATGATTACGGTTTTGA 

TGTTGTATTTGTGMGGTGAGGAATACTAACTATATTTCTTTAGTCTTTGAAGATCACCA 

TATTTAGCCATTATTCTATCAACTGTTAAGCCTTGG 

Sequence 1847 

C ATTTTTGTTANCAN NGAAGTG AG N GGC ATCTATGTATTTTTTAAGGTATATAATG AAAT 
TGTGCCTAGGGGAGTNATAATTTACTCTATGTATTTNTATATNTACTTAAATCAATACAT 
TCTTACAGACTANCTTCTTTAGTGGTAACATACANGCCGATTTTCTCACTCCATGAATGA 
. TTACNGACATGTATCCANNANCGGAGGTCCCTCATCCACCATTTCACCCAGGTGTCTTGC 
TCTTATCT1TCANAAGGGAAAATTGGCTAGCNGGTTTCTCTCCACCATGTGCTGTTCTCC 
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AGGGACTTTGGGTGAATCCAGGTGTGGGAAAGAAGGTAGCATCAGCTGTAAGATTCAAT 
Sequence 1848 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCTGTAATTTTGA 

CATATATAATATTTTAGCAGTGGGTATTTATTTCTAGATAAATTAAAATAATAATAATGA 

TACAGCGCAAAACGATAAAAGCTGAAAGGAAAATATGGTGAGGTATAAATGTAGAAAGGA 

CAGAGATTTCTCCTCCAAATACTGTTTCTCTTGGCCCATAATTTAATCATTAAAATATCA 

AAGATAAAATTGTAGGGCAAATAAAATATCCCACTTGGACAAAAATCAGTAGTATTTCTA 

TACACCAAAAACGTTCAAGATCAGCATCAAATCAAGAACATGATGTCATTCCAAATAGCC 

ACAAAAAGTTGAAATACCTAGGAATATGGATAACCATGGAGGTGTGAAAGATCCGCTACA 

AAGAGTGCTACAAAATACTGCTGAAAGGAAATTC 

Sequence 1849 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGC 

CACACTGACTATCACATTGTCATC ATACTC AGAGAGAGAAAG GTGTATAC ACCC AG TATT 

TTCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCA 

GTTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACA 

CGGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTAC 

AGAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAA 

ATGACTGATANATCCTTTTACAATATTCTAGGGGTAGTCTGTCT 

Sequence 1850 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTCCTGCGT 

TTGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCAC 

CTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACAT 

CCTGCAGGAAAATGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGTTCTTG 

CCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCC 

CAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAA 

GTCAGCAATGTGATGGTGTTTGGGAGGGTGGGAGCCTTCAGNAAGGTAATTAAATGCCCT 

TGTAAG 

Sequence 1851 

CCGCGGTGGCGGCCGAGGTACAGAGAGTTCCCATAGACTCCTGCCCCCTCCCACAGACAG 

ACTACCCTAGMTATTGTAAAAGGATCTATCAGTCATTTTCCTCACCTCTCTTATAGATG 

TCTCTCTTCAACAGTGAGATTTTTTCCGTAACTCTCTGTAGCAGGACTGGGCTTGTTTGC 

ACTCAAATCCATGACTCATTGTTTCTCTGCCTTTCCGTGTGTTACAGGTGGGCTGATCCC 

CCTGCAGCCAGTTTCCCATAAGCAACTGACTTCCAACTGGGAATGTCTCGGGGGATAATG 

GGGGTGGGGATATGNGAAGTATAGAGAAAACATAAGAAAATA 

Sequence 1852 

TTCCTAGTCACATCTGTTGTTGCAGCATGTTATCAGTAGAGCCTCCTTTTACTCTTAAAA 
ATGCCCTGGTTTGGATGATCGATCATACCTGCCCCCATCCCCACCCCCCCGCGTACCT 
Sequence 1853 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAT 

TGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCC 

TMTATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATTCACACGTTAATGGAAGA 

GAAGGAAGTTTCCTAAAAAAAATCCTCCCGCTCCACCCTGCAAACTTTATG I I I I i CTGT 

TACATMTCAGGTAGGGGCAAGACCTAAACTATTTTGAATTGGTGGTGTTGAGGCTAAAT 

TCTCTGCTATTGACAGAATTGAGAATGTGATCAATTTCAGAGTAGCATGTTACAAATTTT 

GTCCCAATTTCAATGGGGAGAATTATAACAAATCAGTAGGTGGNTGGCAGAATCTGTTCT 

TACTTTCCTCCCATCCAGT 

Sequence 1854 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCTCCTCCCTATGCATCTGCTGTGGGAAGTGT 
GGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCACCACAGGCAGCAGTGAATG 
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AATGAAGTGAMTGGTATTTATCCATTTCCTGGAAAGGTCCAGGGTTTGGCTCTGCAGGG 

CCAAGAGAACAGCTTTAGTTGTGCCTTAACCCAGTCCTGGAGAAGCCAGCAGGCCGTAAT 

CACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGGTGGTGACAAGGTGAGGTGG 

TCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGTGCTCTGACATTGATGCTAA 

TATTCCATATTATCAGGGCTTCTGTGGTGTTTAGGCCTCTAATTTTTCTCTTCAGTAAGG 

GGAAT 

Sequence 1855 

CAGGTACATGTCAAAGGAAAAACACGTGAAAGATGAATTCAGCCAAACCCACCAGTGTTC 

AACCTCAGCCTAATCAATCTCATACTCCTAGAGGCTTAAGTATCAGCAGGTAAGATCGTG 

ATGACCTGTCTNTGAGGCTCCAGACAATAATTTCTAACTGCCAACTGGAAATCCTTATAT 

GGTTAGGCTGCCAACATCCCANGGAACAGGACCAAAATAAAAAGCATCACTCATTATCCT 

ACTGCAATTTTCCTCTTCCCTTTGTCAAATGGGAATGATCTTTACNATCATGATCCTTNA 

TTGCANCCAGNACAGAAATCATGAATGTCATCTATGGCCCCTTNCTATCACTCCGGGCTC 

CAATTAGTTGTCCTAT 

Sequence 1856 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGA 

GCTCCCATTTCCCCACATCCTGGATGATACATGTTATTTAGACATTCTAATTTCTTTTCC 

AATCCAGTAAGAACGACATGGCMCTCATTATATTCATTTGCATCCACTCTATTGCTATT 

GAGATTAAGGGCTTTTACTGTTTTTCTTATATTTCTTGGCCACATGGCTTCA 

GAATTGCCTTTCTTATGGTCCTTTCATGGTTAAMCAGCCCTTTAGAAGACTCGGATTT^ 

AAGGTTGGATTAGGGGTTGGGATTTGAGAGTTTTACACCATGGCCATGCCTCGTTTCCCT 

AAGTTATCATTAAAATGCTCAGCCTCCACAGCCCATATCCATGCATATCACCCAGGTTGT 

CCCAGAACATTCAAGAACTTACTGCTCTTGCTTTGAAT 

Sequence 1857 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGT 

GACTAAATTTGTAGGAATGATATTTTCATGGGATTACTCAATCTCACCCACCATTAGTTG 

CAGGTGACAAGAMGCTAAGTTGGCAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTC 

CTTTGTGCATGTGTTCCAGACATACCAGTGGCTTGGTATTTAAACATCATGCTCAGGTGT 

GCAGGGTAGTTTTTGAGTTATAATANGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTA 

TGTAAAAATTATTTTGCACTCACTTCTGAAATGAGTGTTAGTAGMTCATCTTTAGAGGA 

GGTTCCAAGGCATTGMCTGAGATACCTGCACTGTTTGCTGTAAATTTAAGCTTAAAATT 

GAAACCAGGTTATCAGCATTTCATGCCAGGAGAGAGTGGGCATGAATGATTTCAGGAAAT 

GAAGAG 

Sequence 1858 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTAAGTTTAAAATCAAAAGCAAGANAT 

ATGATACTTGCAGCTCANAAAGCAGCAGGACTTCGGTGGGTTGGAACGCAACTCTTCTTG 

CTGTTGTCCCAGGTGATTTTTCATCAACCAAGTCTTAGATCAATCAATTACACAAATATG 

ACACTATTACATCTTTCTTCTGGAGTTATCTTTCTGTTGGTCTAATACTCAGATGCTTCT 

GAGCCACGCAAAGTAAAAGCCCCAGGTNGAATATCCATGAGATGGGACTGATTGTGCTTC 

CATGGTTTACCTATCTCAGTGGCCTGCCCTCCTGGCCCTGTGAAATCAGTTTGGAAACTT 

GCCCTTCTTCTTCCTCCTGGTGGTCTTATTAA 

Sequence 1859 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAGTTTTTC 

AGAATCAGGCCTGCTTTGTCCCTAAATGCTTTCTTATTTCCCCAGAGCCCTTATTGGATT 

TAAGCCCCATATCATATCTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGT 

CCTTGTAAGTTCTTTTTCCGTGCCTAGAAGTTGCAGAAGACCAAGGAACCTTGGAAGTGA 

CTTACTCTTTTGCTTCATAAAGTTGCTTTAAAACAGCCTTTATTT^ 

TATGTCACACATTTTTAAATTTTAAGGTTAATTTAGTCATTAATTCTGTCA 

TCTTTGGAAGCTAGTTGTCTTCAATGTCTTTTACTTGGAGTTACTTGAAGTTTGCTTA^ 
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Sequence 1860 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGTTCAAATAGTCAGCAGCTCATCATAATC 

AATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATGGTGGGCAAGGAAAGCAAGTT 

ATTGGAGGCACATGTTAACACATAAAATATAAMTTAATATGATCACCACTGGAAAGGCT 

TGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCACAACAAAAAGCAAGAG 

AACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTGGGGACCCAAATGCTTA 

TGTGGACTCACCAGGTATCGAATTAGACATGAATAGGGAGTGTTTGTTGTGATGGCAAGA 

AACTATATAATCAMTGMTACAATGAAACTTTAAAAATAATTGTMGGAATCTTTACAC 
CAGC 

Sequence 1861 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAAMCATTTTTTCC 

TTGAGAAATCATTGCCTAAATCTGATACATATGTGCTTCTCTGCAGAAAAAAAAAATGAA 

ATAAMTAAAAATCTTTATTTCCAATGAGCTMTGAGGAAAAAGGGGATGATGAAAAAA^ 

AGGGAGGTGGGAAGCAATTTGGAAATGGAAAGAATGCTGAATACCTACATGATGCCTAAT 

TTAGA^AACATTCTGGTATGTAGGACATTATTTTC 

mum I ATATGCAGATGTATCTCTACTCTCTAAAGATGTACATGTGTACACTTATAGTT 
GAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGGAG 
GGGTCTTCTACTCAGCCTCTACCCCT 
Sequence 1862 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACT 

GCACACCACAGGGTGGCTGCACTTTTTGGGTGGTGGGGTGGGGATGGGGGCAGGTATGAT 

CAATCATCCAAACCANGGCATTTTTAAGAGTAAAAGGAGGCTCTACTGATAACATGCTGC 

AACAACAGATTTNANNTAGGAAACTGGCCCGCTCTANNAACTANT 

Sequence 1863 

CCGCGGTGGCGGCCGAGGTACGCGGGGTGAAGGGCTCTCTGTATACTGCTGAAGATTCTA 

TAGTTTCTCCTGA AATCC ATGAGGGAGCCCCCTGAAGGATTTTGGCCAGNAAAGTGTCAT 

GAATCAAATTTCTATTTTMGAATAGTATCGAGGCAGCAATGTCAMGTTTGGTTTANM 

AATANGGATGGTTCCTGTAATCCCAGAACTTTTGGAGACCAAGGCGGGAGGATTGCTTGA 

GTCCAGGAATTTGAGTCCAGCCTGGGCAACATGGTGAGACCTCATCTCTGCCAAAAAAAA 

ANNNNAAAANAAAGGTCCCTGCCCG 

Sequence 1864 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCAT 

GGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCT 

CTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAA 

GACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGAT 

TCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCT 

GGACAAGATTTTACTCCCATAACCCAGGCAGAAGTACCT 

Sequence 1865 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGG 

AAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

AATGCAGTGAAGTAGAAGAGACAGGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCA 

AGCTTTGAAGAAGTTTACAMGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAAC 

AGTAGAAAAAAAAAAAAANNAANAAAAAAAAAAGGTTCCTGCCCCG 

Sequence 1866 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTATTTAAAGGAGGAAAGAGCGATG 
AATCTACATGGAAAATTAGGTGAAGGAGTGGAAAATTGCATCGGCCATATAAGTCCAACT 
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ACATATGGATCTATTCCTCCATGAAAAACTGGAAATTTCACTTTAACATGGTATTTGCAC 
AGTCAGCAGCAATTCTTAACAATGTCTTTCTTGCTGCTGACATCATATTTTTTCACCCCA 
ATAATAGAGGATAATTTGGACCCTTTCTGCTTTTCATT^ 

CACTTCTCCTACTTCTCTTCCCACATGAGAGATGATTGTATTACTTGTGGAATATGATGA 
ATCACTTCAACAAGAAATATGTGGCATCTGGTTAAGTTTCTTTCTAGCATATAAATTAAG 
CTTGAATCACAGCTGGGTCAAACATTTTCTTTGAAACGGG 
Sequence 1867 

ATCGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTT 

ACTCATTCACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACC 

CTGCAAAC7TTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAMCTATTTTG 

AATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGMTGTGATCAATTT 

CAGAGTAGCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGAATTATAACAAATCAGT 

AGTGGTTGGCAGAATCTGTTCTTACTTTCCTCCCATCCAGTGACTAGGATTTGGAAAGCA 

GTGTTTT 

Sequence 1868 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAG 

CCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTT 

CTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCAT 

TCACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAA 

ACTn"ATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAAACTATTTTGAATTGG 

TGGTGTTGAGGCTAMTTCTCTGCTATTGACAGAATTGAGAATGTGATCAATTTCAGAGT 

AGCATGTTACAMTTTTGTCCCMTTTCMTGGGGAGAATTATAACAAATCANTAGTGGT 

TGGCAGAATCTGTTCTTACTTTCCTCCCTCCAGTGACTAGGGATTTGGAAAGC 

Sequence 1869 

CGTATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGT 

ACTTTTT7TTTTTTTTTTTTT! IGCAGAGATGAGGTCTCACCATGTTGCCCAGGCTGGAC 

TCAAATTCCTGGACTCAAGCAATCCTCCCGCCTTGGTCTCCAAAAGTTCTGGGATTACAG 

GAACCATCCTATTTTCTAAACAMCTTTGACATTGCTGCCTCGATACTATTCTTAAAATA 

NAMTTTGATTCATGACACTTTCCTGGCCAAAATCCTTCAGGGGGCTCCCTCATGGATTT 

CAGGAGAAACTATAGAATCTTCAGCAGTATACAGAGAGCCCTTCACCCCGCGTACCT 

Sequence 1870 

CCGGGCAGGTACACAAAAAAATGCCCCATTATCCCTTTGCTGTGGGAGCAGTGCTGATGC 

TAGTGGCTCGCTCTTTGGTCCTGGCTCTGACTCCACCAGCAGTAACAACCAGCAGTGGTG 

GAGGCTGTGGGTGCMTAGACTTTTATCATGGCCACTCTTCTGGAAGATGTCACTGCCAG 

CAACTCCTGACTCAGCTTTGGCAGCAGCTGCCGCATCGAAGACTGTTGTGAGTGGGGAAA 

GGTAATGGGGCTGCAGGAATGTGAGGATGTAGGGACTTTGGGACCTCCAAGGCAGGATGC 

AGTCTAGACCTCCAAGGCTTGGCTTTC AAAATG GCACC 

Sequence 1871 

CCGGGCAGGTACTTGCCAAGATCTCACTCCTCAGCAGCTTTTTATTTCTAAATTTAAGCT 

TTTGGTTACTGAMGTTTACAGCAGTTCTAGACTCCTATTCTTTAGCATTCCGTTACGTC 

CAGACCATGTGACAACCTCCATTCTACAAATATGTAACCACAAAGAAGGTGAGTGGGGAG 

GCAATAAGAACCCAGACTAACAGAACATCAGAAAAGCTGGCATACGCACACATAAAGAAG 

GGGGAGAATATAGGACTATATGGGAGGGAGGAAGGTCACAGAAAATGATTACGGTTTTGA 

TGTTGTATTTGTGAAGGTGAGGAATACTMCTATATTTCTTTAGTCTTTG 

Sequence 1872 

AGGTACGCGGGGGTATTTAAAGGAGGAAAGAGCGATGAATCTACATGGAAAATTAGGTGA 
AGGAGTGGAAAATTGCATCGGCCATATAAGTCCAACTACATATGGATCTATTCCTCCATG 
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AAAAACTGGAAATTTCACTTTAACATGGTATTTGCACAGTCAGCAGCAATTCTTAACAAT 

GTCTTTCTTGCTGCTGACATCATATTTTTTCACCCCAATAATAGAGGATAATTTGGACCC 

TTTCTGC1TTTCATTTAATTCTGTCTATTTCTGATTGCACTTCTCCTACTTCTCTTCCCA 

CATGAGAGATGATTGTATTACTTGTGGAATATGATGAATCACTTC 

Sequence 1873 

AGGTACGCGGGGTCCAGTGGACGCCAGGGATCTGAAGGGCAAAGGCAAGGGCTGCTGGAG 
CCTGCATCATGTCGAGTCCGCAGAGGAGGAAGGCTATGCCCTGGGCACTGTCACTGCTTC 
TCATGGGCTTCCAGCTCCTGGTGACTTAT 
Sequence 1874 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAMCTCATTTCAGTTAAGGGGAAAAAAA 

GAGTAAGTGAGCCAGGATTACAGTCCAACTCCAGTCAAAGTTAGTATAAGAAATAAGAGA 

ATAAATAGATCAACATCCTCACAGACAATGAAAGCACACCGGAAGGACATGACCACAAGG 

CAGATAAAAACCTAATATTTGCCAAAAAATAAAATGCAAATATGACAGGAMTTTATATT 
AATTAGTACCT 

Sequence 1875 

CCGGGCAGGTACCAAAAGMTGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAA 

GCAACCCAGTTGTGCATCAATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAAT 

AAAATACTATAGAGCAATGAAAATAAATAAACTATGGCTATATGCAACAACATGGATAAA 

TCTTACCAACAATGTTAGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATC 

CATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAA 

GGACTGAGAGAGTATTGCAGAGGGGGCTCCGGGTGTTTGTAA 

Sequence 1876 

CCGGGCAGGTACTTTCTTGGCCAGACATCATACATCTGCCTCTATAGAATCCCTCATTCA 

GGGGGTCACTGATGAAGTGTAAGGCCCTAGGGTTGAGCTTCAGCTCCACTGTGGGGTAGG 

TTTTTACCTTCTCCAAAGCTTCATTTTCTCATTTGTAAAATGCAGATAATGTGTGTCTTG 

CCAAACTCTAAGGGCTGTTGTAAGAGGTTTAAATGGCATAACAGATAGAAATGATTCAAG 

CTATTAAGTCC7TTATAMTGTGTGGAATC 

ATTATCTCAACAAAGGTGATTTTCTTGTTAAGAAGGTTTAGTAGCCAGGCATTGGACAAC 
T 

Sequence 1877 

CCGGGCAGGTACTTAGGACTATGGAGAMCAATGGAAGACATTAATTCTGCTTGTCTTGA 

TGGTAAGGGTGGGGATGAAAATACAGATCGAATTTGGGGGAGCATGTTAAGGTTAGGGGA 

ATAGAGATAGCTTATATTTAGAACAAGTTAGTAGAGACTGTATGATTTTCCAAATAGAGA 

AGGGTAAAAAGACATTCCATAGAGGAAGAAAATTACATGCAAAACCACAGAGACCTTAGT 

TTATAAAGAGTAACTGGGTATGGCCGGGTGCGGTGGCTCACGCCTGTAATCCCGGCACTT 

TGAGAGGCTGAGGAGGGTTGATCATGAGGTCAGGAGA 

Sequence 1878 

AGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGACATCCCAGGGAAGGTG 

AACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAA 

CTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGAATT 

CTCTCCTTCTCATTGTTCCAGTGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTC 

TCTCCAGAAGTCTCAGCTACGTGACCTTTCAAGTCTTCCATGGCAGTCTCAATGGCCCC 
Sequence 1879 

AGGTACC CAAGC CAGAAAGTGAGCCCACTCTGACTCTCATGAACTGGGTCCTGATTCCTT 
TAAGGTTATTTTGCCTAAGTATATTTCCACTCTGGCCTCTCCTTTCCATGCCCACTGCAA 
CCACCAAATGTGCCACATCCCACCCAGATAAGTACCTGCCCG 
Sequence 1880 

CCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
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TGCAGTGAAGTAGAAGAGACAGGGATATCCCANAAGGTTATGCAAAACATCAAGAGAAGA 
TGAGAGGTCAGAGATGGGMGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCAAGC 
TTTGAAGAAGT7TACAAAGTCTATATGTCAGMTACACATTTCCCACCTTGCCCAACAGT 
AGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGC 
Sequence 1881 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCACGTGCAGAAAAGAAAGA 

GTCTCCGCTTG TCTTTGTC TG ATTCTCCTGTCCTCTCCATGGAAGTTACATTTTCTGTAA 

AGGATGAGCTGAAAATTCTCCTGGTCGTTGCCAGTTGAACTTCTGCTGTGCTC.TGGGAAG 

GCATTCTCACTCTGTTTATGTTGTCTAAGTGCANACATGGATGTGCAGGTTTGCTAGAAC 

CTCCTGAGGATGTGCAATGGTTCTGTTCATGCCTGAATCAGTTCTTTTGGGAGTGGACAT 

TCTTTCTCTCCCTCATGCACAGCCTCAGGCACATGGCTTGAGCTATGGCGGCACGCAGNA 

TGGCCATCACCCAGGTACCTGC 

Sequence 1882 

AGGTACACTTATAGTTGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTAC 

TGAAAACAAAMGAGGGTCTTCTACTCAGCCTCTACCCCTMTATTTATATCAGAAGCAG 

AGATTAACTGTCCTTACTCATTCACACG1TAATGGGAAGAGAAGGAAGTTTCCTAGAAAA 

ATCCTCCCGCTCCACCCTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAA 

GACCTAAACTATTTTGAATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTG 

AGAATGTGATCAATTTCAGAGTAGCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGA 

ATTATAACAMTCAGTAGTGGTTGGCAGAATCTGTTCTTACTTTCCTCCCATCCAGTGAC 

TAGGATTTG 

Sequence 1883 

GGCGGCGGCCGAGGTACAGTGGTTTACGCCTGTTAATACTATGCACCTTGGGANNNCGAG 

GATTACAGGCGTGACCCACCGCGCCAGACCACTTGCAAATATTTCACAGAGCCAATGTGT 

AACTTGAGTGGTGCAAGGACATGAAGTTTACCCAAGGCTTAACAGTTGATAGATAATGGC 

TAAATATGGTGATCTTCAAAGACTAAAGAAATATAGTTAGTATTCCTCACCTTCACAAAT 

ACAACATCAAAACCGTAATCATTTTCTGTGACCTTCCTCCCTCCCATATAGTCCTATATT 

CTCCCCCTTCTTTATGTGTGCGTATGCCAGCTTTTCTGATGTTCTGTTAGTCTGGGTTCT 

TATTGCCTCCCCACTCACCTTCTTTGTGGTTACATATTTGTAGAATGGAGGTTGTCACAT 

GGTCTGGGACGTAACGGAA 

Sequence 1884 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGG 

TAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATA 

ATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGAC 

GGTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATA 

TGAACCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGC 

ATTGGATTTGCTMGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGC 

TGAGCTGACTCAAAGTCAAACTGTAAGTACCT 

Sequence 1885 

ANGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACTGTTACTAT 

CCTTAAGATGAGGGAACTGAGGAACCAAGAGGTTAAGCAATTTGCCTTTGGTTCACATAG 

CTAATGATGTGGAGATTTGAACTTAGGCTGTTTGTCTCCCAAGCCTATGTTCTTAAATTT 

GGGGAAATAGTAMGATAATTTCCACAATGTGMGACAGTTAGCAGCCTTAAGGATGAAA 

GGATGGTGCAAATACCATGCCCAGTGAGTGACAGAGTATCAAGGCTGGTAGAGCCTGATG 

AAAGCACAAGTTTTCAGAAMGAGGGAAACAAACAATTCCTATAAAGTTAAGAAAATCAC 

ATTGACACCAGACTTCTCATTGGTAGAGACCACAAAGCCCTTTTCAAAAGTGGGTCAAAC 

TGGCAGGCTGAGCACATATCCTTCCCTCCCTTTCTG 

Sequence 1886 

CTCCTATAGGGCGTTTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAAGA 
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ATGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTGCATCA 

ATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAATG 

AAAATAAATAAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGA 

CAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACA 

ATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTGCA 

GAGGGGGCTCCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGT7TGCATGAGCA 

TGTTCACTTTGTGGAAATTCATTGAGCTGTAACTAAATATACAGAATTATTTN 
Sequence 1887 

CCGCGGTGGCGGCCCGAGGTAC I II H I U M I M U I HI I I GTTTGTTTGTTTTTTTT 

TTGAGACAGAGTCTCGTTCTGTCACCCAGACTGGAGTGCAGTGGTGCGATCTCGGCTCAC 

TGCAACCTCTGGGCTGGGCAACGGTGTGAGACTTGGACTCAAAACACACACACACACACA 

CACACGAAGCGGGAAAAATTCTGGTGCTTCGCAAAGTCATCACAATACAAATCAAAATAA 

ATTTGTCTT AAGC CACTTAGACAATACTAACCAACTTTCACCAGGTGTAATCCATGCACA 

ACCAAAACCTTTTACTGTGAGAGCGAAGCTCTTATGGAGTTCTACCCTCATCAGNGATCT 

CCATTA TAAGG ATATAAMGACTAGTAGATTCTATTCATGAMCCATTTTTCTTGGCTTG 

GTATTAATTTTGGTAAGAGATGAGTTACCAAGTTAGAAATTAGTGTGGTCCCAMTTAAA 

0 

Sequence 1888 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTN I I 1 1 M I II M I I n I 1 1 1 M 1 1 1 I A T 

TTTTNGAGACAGCTTTCTATTCTGTCACCCAGGCTGGAGGCCATTANAAMTAATTTGTG 

CTGTTCTTATAAATATGAAGAACTTTGGAMTATCTNTGTGACATCGGATGCAGTTTGTG 

TTTGGGMGTATCACTGATAA GAAAT AATACTAAAAATTAGCAACCCCATTGAAAAGTGG 

GCAAAGGACATGAACAGACACTTTTCAAAAGAAAAAATTCATGCANCCAACAAATATATG 

ACAAACAGCTCAACATCACTGATCATTAAAGAAATGCNAATCAAAANCCNCATTGNGATC 

CATCTCACTCCAGTCAGAACCACCATT 

Sequence 1889 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTGTATTTCCAAAAG 

TAAGTTTCTATTCCTAGAACTAAAACAGAAAAGAAGTGTTGTGAGATACATAAATGAAGC 

CAATAAATTTAACCAAGGGCTTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 

AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 

TCAGGTCTAAGCACACAAGTTCTCCCGCGTACCTGCCCG 

Sequence 1890 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTTAAATG 

GGAATAAATCTTTTATAAGCCTTTCCATTTACTTTCCTGCTTATTAAAGGTGAATT 

TGGAAAACCCTAAATGTCACAGAGTCTGGAAAATTGAAATAATAAGGGGCAGCATTCTCC 

TTAAGTGGTATAACCTAGTAAAAAGTGGGCCTAATGGAAAGTCCACCGTGGAATCATCTG 

TGTTTCTATGCMCAAMTTATmCC^ 

ACAAGGTTGGCTGTCTTAANCATTGNGGTGGAGAGCAATGGAAAGATGAGTGTGTAGTGG 
GAG 

Sequence 1891 

CGAGGTACGTAGGGGCCCTTTGCTGCAAGCTTAGGCCAGTGTCAGCTCCTGAGCCACTCC 

TGACTCAG TTGG AGGAGACTCAGTGTCCACTGCCCCCTGCCTGGTGGAGCTTGGCAGCAA 

GTCCCAAGTTTTACAGTGTTGAGACCAAGTGGGCAGCCATCACCATTGAGGGGTGCCCTT 

TTGCTCCCCCTCGAGTCCAAGTTATAGTATTGCTTACTTCCTACCTGAAATAGTTTATGG 

GTCATGGGTCCGGCTTACATCAAGCCCCATAGCCAGTCTGGTTGCCCCACCTAAGGTCTT 

GGTGCTCTTTGATAATAGCCACAAGCTTAAATGTATTAAGTCCTGTCCTAGGAAATGCAT 

GATCTTCTGCTCTAGAACTAAGTG 

Sequence 1 892 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGNCCGGGCAGGTACAAGGG 
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GAGACAGCATGCATGGTGTGNTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTANCGTTAA 
AAATGTGATGTTGGTATATCATCCTGATAGAAANCACTGCTTCCCAAATNCTAGTCACTG 
GATGGGAGGAMGTNANNMCAGATTCTTCCGACTCACTACTGANTTGTTTATANTACTT 
C CCCATTTGAAATTTGGGAC AATTANTTTG NGGGCATNGCTACTCTG 
Sequence 1893 

GNCCGGCCAGGTACTGCCACAGGACTTTCAAAAAGGAGGGGGAAAATTAATGAAAGTGAC 

ATGCATCAAACAAATCAAGGGGCAGTGTTGAGGTCATCTCCACGGAGCTGTAAACTCAGA 

AGTGT TTCCT GGTCATATATGGTCAATTANGGTCAAGTCTGAAATAAATNTANAANAATG 

ACCTAATTTTCCAGCTTAACTCAGNAGCTAAAATCCATAACACTATCANACTTTCCTTTT 

AATTTATGAGATGGAGTCTTGCTCTTGTCGCCCAGGTTGGAGTACCTCGGCCGCTCTAGA 

ACTAGTG 

Sequence 1894 

AACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAC 

CACCAGGGCCCTTGCTTCTGAGCAGGAAGCAGCTGGGGAATAGGCTCTTTCTC7TAATGA 

CTTCCAACATAGTTCTCTCAAACCTTACTCCTCCAGAAGGCCACCCTCACCTGGCTATGG 

CTACTTCAGAAAAAACTTGGCCTCTGGTATAATAGAGCAGAATCATCACCTCACATTCTA 

TTTCAAGCCAAAGTCAATATCTCAAAGGCTGGTTCTGTGATTTATTTGGCCCTTGGGAGC 

TCCTACTGAAAGTGCTGAAATGTCGTACCT 

Sequence 1895 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGCCCGCCCGGCNGGGTCCTTTAAAAC 

TNGGGGGCCNTAGGATTCATTTTAACCATTTCTTGCCTCACTCTCACtCTGTCACTATGT 

GGAGTGCTCTAATCCATTAGGGAATATAGCTCCTATATCACTGGCTACCCAAATGGACCT 

CAAAAAGGCAGACAGAGCCCATCCCCAGAATATCCTTGGTTCTTGCTGGTCACAAGGACC 

AATAACATCATTAACATTGATGTTTCAGAGTCAATACAGGTATAAGAACCTTAGGTTTAG 

GACTGTGAATAGGGGAGCGGTGAAATGAAAATACGGCCTTGAATTAGTTTCACTGTCAAC 

ATATCTCATATTTATCTAGCTAGGGTATTTTAATTCTAGGGTATGTTATGTCCAGAAGAG 

AGGAATGATTTT 

Sequence 1896 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCTCGAGCGGCCGCCCGGGCAGGTACTC 

TTTATGCACATCAGTGTGGGGCCTTAGTAACCACTGTGCAAGTTGAAACTGGGCAAAGTT 

ATCTTAATAATTAATGGGAAMCATAAACTTCTTCCTGCAATCTTTAAAGACACTTGTCA 

GTCAGGTGCGGTGGCTCACGCCGGTAATTCCAGCACTTTGAGAGGCCAAGGTGGGCGGAT 

CACTTGAGCCCAGGGGTTTGAGAACAGTTTGGGCAGCATGACGGAACCCTGTCTCTACAA 

AGACTACGAAGGGTTGGCCGGGAGTGGTGACATGTGCCTATAGTCCCG 

Sequence 1897 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCTAGC 

T GATG ATTCAAAAMGMGCCTTTTAATCTCACTCCACTGATCAGCTATGATACTTAAAT 

GTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATCTTCAGACTCCAGTGG 

CCGAGAATCTAGAGTTAGCMTGGAAAAAATTAGTCTCGGGGCNTTCTTGTTTCTGCCCA 

CMGTTTTCAAATTAAGAACAATGTGTTTGCACTTMTGAAACAACCTCTACTGCTCTTC 

AAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAGTACCT 

Sequence 1898 

CCGCGGTGGCGGCCGAGGTACAGATAGAGTCTCACTATATTGCCTAGGCTGACAGAGTAT 

H II I ICCTTAATTATTCTTTATTCTCAAATGCAGTGAACAGTCACAGTAGCCCACCATA 

GGGGGTTCATTTTCAGACATTACAGTAACTAGGGGAATTTTATTAAGGMCCAGCNGCTT 

MCCCNTANTNGGTANTAATTCTTCTTTTTTAAACCCAACCTCCAGAANTTNATTCAGAA 

TTCATCAGTCCTTCCCNATCAGGTGGCTTGTCTTTTCTNGGTCCCATGGATCCAATGTAN 

GGAATTCCACAATCACAAGTTAAGTNTTTTTAATCACTTCTTANGNNCCTNCTCCCCACT 

TCTGGGGATGGGTTTCCCTCANNTAATCCTCTTTTGGTCTTTCCCAGGGGAACCCTGGAC 
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CACCTTTTTNGAAAAGGAAGGTTANTTNGGGAACCAGGCCTTTTTTTTT 
Sequence 1899 

CACTAC7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATTAA 

ATTTAATGAGAGGTAACCCATGCACCGGCAGAAAGTTCAAGGGGACAGAAGGGGCTTGCT 

CCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGTTTGACATGCGATCTTGTC 

TCTCTC7TTTATAMGTGCTTCCAGGCCCNTTTTGATTAAGGATGCTGTCCTAAAATCCT 

TCCTCCCGCGTACCTCGGGC 

Sequence 1900 

TGGTGGCGGCCGCCCGGGCAGGTACTACAGGGGCAGGCTACCGTCCCCGGCTAATTATTT 

TMTTTTAATTTTTAGTAGAGATGAGGTCATGCTGTGTTGCCCAGGCTGGTCTCGAACTT 

CTTGGGCTCAAGAGATCCTCCTGCCTTGGCCTCTCAGATTGCTGGGATTATAGGNGTGAG 

CCACC NACAC CTGGCCTATGATCTTAACTTTATTTACCACCTTTTTGACATTTGCCAAAA 

ACTAAGTTTTACAAGATC TCTCACCATATCTCTGGGGATGG GAGGAGG AAGGGAAGG ATA 

ATG GTCTC ATATTCATAATAGTATTTGCAGTGCCTATTCTCACAGTTGATAAATGACTAT 

AACCTTTTCAAGAGGATGAGTGAAAGAAAGAGGGAGGCAAGAANAATGAAAAAGGAGGTA 
A 

Sequence 1901 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTGTTGTTGT 

TTTAGAGAGACAGGGCCTTGCTCTTTTGCCCATGCTGGTCTCAAACTCCTGAGCTCAAGC 

AGTCCTCTGCCTCAGCATCCCMGTAGCTGGGACTACAGGCCTAATACTTTTTGTGCCTG 

CTCCTCTCTTCCTGTTTCTACCCCATCATTATCTGCTCAGATCTTCTTAATCAACTACTG 

CAGCAACCTG CTAAA A TGTC TCTCAATCTTCTGGGATAAGTCAATCAACTGCTAAATGAA 

TCTTCCCTAAATTTTACTTTTATTGAGTTCTTC 

CTAAGTCCTCAGCATGGTATTTCTTTCTTCTATGTTAATTGACAGTCTGTCCTTAAATAT 

GAACTGTCTATTCCTGCTCTTCCTTCCCTCAAACCCAAACTCAC 

Sequence 1902 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACATTATCTAAGATGA 

TACCTATAAGGCAGATAGTATTATCCCCATTTTACAGATGCAGAAATTGAGTATCAAAGT 

GGTTAACTTGAATAATGTCAAGCAGCTAGTCAGTGGTGGCAGTGGAATGTGAATCCAGGA 

AGTCTGACACAAAMCAAGTGGTCTTAATGCTATAACATGCTTAATGCTAAGAAAAAGTG 

GACAGTATAATGCTATACTGTTCCTGAATAAATATCCAGTAAATAAGGAAGGCTAGAAGA 

ACACAMGT1TGTAACAACATGTTTTTCTAGTAGAAGTCTTCTTTGCTGACTTTCTGATT 

CTCAGGTATAGCATCAATCGGTGAGGGTTGCTATATGGTAAACATTATACTGACAGGCCA 

AAGCGATCAACTGTGAAAACTAGGCAAAGCCAAAGTA 

Sequence 1903 

CCGCGGTGGCGGCCGAGGTACAGCTTGAACAAATCCACCTTCTGTGGACCAAGCACCACC 
CTGGGCATTTCTAGCATGAGCAAAATCCAAGGTCCTGGCTGGACTCCAGAGATGCTATTT 
ACCTCAGAAGCATGACAATAGGAGGCAGAAGGAGCAGGCAAATCCAAGTCCTTTCTTGTA 
GTTTCCTTGTTTGGGGAGGAAAAGTTGAGTTTTACT 

GAGACAGACAAAGAGATATGACAATACAGTCCTGCCACCCAGATACTCATTTCCACCTAC 

CATTCCATGCATTTGTTTTGAATATATAAGTATGTACCTGCCCG 

Sequence 1904 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGCTTTATTTTTTCATG 

AGACATAATCATGTCTTATTATAAATTTGCCTAAGCAGGGGATAATATCTGACATACATG 

CTACCTGTTATTGTGTAAATTGAAAATATTCTATCTTACCAGTTGTTACAATTAAATATA 

TTATACTGATAATTTATTTTTTCAGACATTGACATAATAAAGTTTTAm 

TTAATCAATAAATATAAAACAGAAACCACTCTAATAGAGGACAATTGTTCTAGTCAATAC 

TCATATTTGTTCTGACTATTGAGAAAAATTATGTCT 

AAGCTTCAGATTTTAGAATAGGGATTT 
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Sequence 1905 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGACGGCATCTGAACCTTGCCGAGTAGGAATGT 

TCCAGAAGTAAGAGAGATGACTAGGGTTTCAAGAACAAGGACCAGCACCTATGAAGGCTC 

TGAGGCACAGACCACTGTGACCTGTGGAGAAGTGACAGCAAAGTGAAAGGCACGTTTTGA 

TTACTACCGACATTCACAGTCTGGACACGGGTGACAAATCTAAAAGATATTGCCCATTCG 

TCCAACTGCCGCTGTCCTGCCAGTATATCGTCTGAGATTACCATCAGCTAAATATTGCAT 

TGTCTTTGGACAATTTTCCTACATTGTGACTCCTCACTACATTTTCAATGCTTTTCTAGT 

CCCAGACAAGCTACAGACTGTTAATTTACACTTGCTTTTGCCTTGCCTGAGGAAATTGAA 
AATAAATT 

Sequence 1906 

AGGTACTTACMCTTTTCATATAGTATTGGTAGTGTTTTGAATAATTTAAAAM 
TAGGTTTGTTATTTAACTTTATTTATATAGTGCTTGTATTAATGATTT^ 
TTAMGATGGATACCTTGATTTGGCTTGGCCTCCCAAAGTGCTGAGATTACAGGCGTGAA 
CCACCGCGCCCGGCCGACTTCAGGAGATCTTTAGGCATCATTGGTTTGTGTTCTTCAGTT 
AAAAATAAAAAGCCAAACTCAAGACAAAGAATACAGTTCAGTTCCAGGTTAGGAAGAGGG 
AGAAGCTCAGCTTCTGAAGTAGTGCTATATTTACTGTTTACCTTGACTGTTATCTTTAGA 
AGTTG ATG ATN ATTTT AAAAATTG N GTTAACTGCTGTTTG AACTGTTAAAGTTC 
Sequence 1907 

CCGGGCAGGTACTGGTTTTACTGATCTTTGTGCAATTTGCTGCTCAGCAGATATTTTCAA 
TGCTGATTGGTAATATGTCAGTGGATAACCAGTGGAAATGCATGTCTATTAATTACTATT 
GCACATAATATTTATCTTGTTTTTATATTT^ 
TCTGAGCACTGTAAAGTTTTCTTTGAGAAATAAGTGT^ 

ATGGGAGGGTAACATTTGTAAATATACTTCAAATTAGTGTTGTTCCCTCTGCTTTCTTTT 

TCCTGACTCAGAAACAGAGTGCTTTGACTGCTCTATGTTGGGGCCATG 

Sequence 1908 

AGGTACAATATAGGCAGACAG7TTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAG 

GTCAGCATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 

AAAGGAGGTTGCCGCATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 

TCCCCCGCGTACCTGCCCG 

Sequence 1909 

CCGGGCAGGTACGCGGGATGGGTTTAGACTAAAGTATTTTCTTCATAAATTGAAAGAGAA 

ACACACTAGGAATTCAGAAGATTTTAATTAAAGTTGTTCAAAAAATT^ 

AAAGGCTGGCTCACTAAGTAGTTTTGATAATGAATT^^ 

CAAATTTGGTAATGCCAAAAAGAGCCTACATATGCACAATGTATATTTTCATGGAAGCAT 
TATTATTAAGTGTCCTGCTTTTACTGGTTGACATACGTTAGTTCATTTCAGCTTCATTCA 
AATATGTAGATTAAACCTAACACCTATAAAATGGTAACAATT 
Sequence 1910 

CCGGGCAGGTACCATACACCATGGAAGTTAGCCTTCCACCTCAGGAGAAAGCAAGCTGCT 

CTCATAAGCCTCTCACCAACTACCCAGTAGTCTTTGCCTCCAAACTGAACAACAGGAAAA 

AGTGCCATAGTTTTTAATTAAGCTGTTTTAAAAGTCCATGTTCCGGGAGAAAACAAAAM 

AAAAAAAAAAAAAAAAAAAANGTACCT 

Sequence 1911 

TCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCANATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGN 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCAC 

ACCGTCTGGNGGGACATTTNACCCAAGTAGGCAAATGCATGACAGAACTCACTGGATCAA 

TATCTCAAAATGCTGGGAAGAAAAAAACGGTAACCAGAAATTCTACACAGAGTGAAACTA 

C 

Sequence 1912 
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AGGTACCATACTTTGTTTTTATTCTTGAGAGGAAACTTCGATCATTTAAAGCAGATATTT 

GCCTTCATCCCCTACCCCCACCCAATAAAACAAACCTGGAAATGCCCATTTACATATTTT 

AGCTTAATCACGGTCAACTGTCTTCTATCGCCACTTCACTGCTCACAAAATACACATATT 

TAAAAGTGAAAATACTGAAATATGTCTTCACAATACAGCTAATACCACCAGTAGAAAATT 

CACGTCACCATGTTCAGTAGACAAACGTGAGAAGCAATGTGGTGTCAGATTGGAAACTTG 

GGAGAGATTCTCAGAAATAAAGTAACTTCAATAGGTTGCTTTTTGCATAGTGCAAAATTA 

AACTTCACTTTATTTAGTGGAAGTGACCCAATTCAATCTTATCTGGGGTCAACACAGCTG 

ATTTCATTTGTGTTTGACCACTGCACATTTTGGCATCCAAAAAGTTAAGGAGAGGAAGTC 

MGGGT ATTAT GGAGAACTACGAMGACAGATGCCTCCAATAAAAGACTTTTTTTGGTGG 

TATTTGGTTTTGGGTTTGGGNGGATGATTTGGATCCCAAAAGAGAAACCACTTMGA 

GGGAATTCCCCCCCAAAATTTCTACCTTNAGAAAGACCCMCAATTGTTGN7TNTGGNGA 

CAGCTTAATCACTGGAAGACTTTTGCATTNTrTTTTATTAANAGAGAGGATTGGGTNGAT 

TTAAGATCCTCTTNTNCCTGCAGGTNTCCATTTAAATTNCACCNCAGATAATNNGGAAAA 

NNNTCCANATNATGGCATCCTAmGCTNGCTTTTTTCCAATTGGAAATTNGATTTNA^ 

NAAAAAAATNNNNNNAGNTC I FTTTTTTTTT] 

Sequence 1913 

CGCGGTGGGCGGCCCGCCCGGGCAGGTACTATTAAAATTTAATGAGAGGTAACCCATGCA 

CCGGCAGGAAGTTCAAGGGGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCT 

CACCT CCTTC CTTCTAGTTTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCA 

GGCCCTTTTGATTAGGATGCTGTCCTAAATCCTTCCTCCCACGTACCT 

Sequence 1914 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATGATTAATCTTCCCTTATCCACAAAC 

TTAACTGTGGAGAAACAGGAGAAAGTGTGTGCCTAGAAGCTGGATTCAGGGACCCATGGT 

TACCCACTTCTACTTATGATTCCGTCATTACTGTTGTTTTAGGAAAATAAACAGATCTTG 

ATTCTTTCATAAAAGTCGATTCTTCAACAAGCAAATGGGAAAATCGGCAGGCCCAGATAT 

GTGTGTATAGCAGCTACTGGTTGGAAATTTGGACACAAAAGTCTTTATCTACCCAGCCTG 

TGAGCCACAAATGTGGACTGAGTATAAAAAAGAATAAAACTAAATACCACAATGTTCTCA 

CTTATAAGTGGGAACTAAACATTTGAGTATGCATGTACCTCGGCCGCTCTAGAAC 

Sequence 1915 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGAC 

TGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTT 

CTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCAC 

AGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCA 

GCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGAT 

GTCCCTGGACAAGATTTTACTCCCATAACCCAGGCAGAAGTACCT 

Sequence 1916 

CGCCCGGGCAGGTACTTGAAGCTGTCATGGATCAGGTCATCTAGGGCAAGCAGGTAGAGT 

GAGAAAAACTCTGGAGAGCTTCAGGACTTAAAATGTGATCAGAAAAAAGGATCCTTATAG 

AAGAGACTGAGAGGTAGGAAGGGAACAGAAGTGATATTGCAGAGGCCTGAGGCAGACACT 

GTTAAAGAGGGAGGCACCTGGACACTTTTGCTGTAGACAAGTAGCCCATCTAATCCTAAA 

GAGCCCCTTTGATATGGCCACTTAATGACGTATGAAATACTTCAAAAATGATTTTAAAAG 

TGTTTAAAGTAATGGGCTTCTGTGTCATTTCCATAATATTATTCTGGGTGCCTGCCATGC 

GCAAGACACTAACATAGGTGTTGGATGTTTCAAACAAATAACATTTATTTGCAAACTCTC 

GTAACTGATACTTTTCTTCAAAANAAAANATNNNNNNNANNAGGTCCCTCGGC 

Sequence 1917 

GCTCCCCGCGGTGGCGGCCGNCCGGGCAGGTAC i rTTTTTTTTTTTTTTTTTl IGGAGAC 
GAGGTCTTGCTATGTTGTTCAGGCTAGACTTGGACTCCCAGCCTCAAGCAGTCCTACTGC 
TTCAGCCTTCCAAGTAGCTGGAACCACAGGCACACACCACTATAACCAGTTTATTTTACA 
TTGTTCTTGTAATTCTGGTATCAGTGATTCTTATAACAATATGCTCTCAATTTATTTTAA 
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GCATGTTG AGACC TTAAT6ATAACTATTCAGAAACCATACTAAACATTTGATCTTGAAAT 
CTTCCATCGTTTTCCATCCTCTGCTTATGCCTCAGTTATTCCATTCGCATGATATGGGCA 
CTTAATTTAGCCAACTCACTGACCCATTTC 
Sequence 1918 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

AGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAAGTTCTCCTCTCTGGGTTGCTGTTTCCTCATCTCCCAGT 

TGGGGTGTATCTTCCCTGCGGCTTAGGTGAAGCGCCCGAGGCTT 

Sequence 1919 

AGGTACAAGCATATAATGCCAACTGGTAAAGTCAGAACTCCAGATGACAAGGTACAGCAT 

GTAATCTTTTGCAGTCTATTGACAGCAATCTAAATAAAAGCAAAGGAAGATGCTTCCAGT 

ACTACAAACTTTATGGGAACTTAGGGTCTCTCAMGCCACACTGTTTTGGGTCACTAGCA 

GGAGGTACAGTATATTAAAGCAAGCAGCTCANCCTCTCACTAGTGGTGAGATGTATGAAA 

ATCCTGGCAATGCCAGGGGGAAAAGAATCTATGAGTCTTCCCTTCGGTCATCTCTCCTAT 

TTCTTGCATTAGTTTTTTCAGGGTTAAATACACATCTCACAAGAAGAAGCTAGCTATATA 

TGGGCTGGGTTGGTCTTCAATNGAATCTNCAAATGTATCA 

Sequence 1920 

CCGCGGTGGCGGCCGAGGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTTT 

CTCATATCAACCCTTTTTACACGTGAAACACAATCTCGCTCTTGAAGTCTTAACTGCATG 

ATCTGTGAMCCTGTATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTGTGTCCTAAA 

CAAACCAAAAGAAMCTTTCCAAATCTAAAGTATTCATTCTCCAATTGGAGCAAGAGGAG 

TCAGTTAGATACTATCACGGCATTCATTTGTGGCTGGCTTGTCATATTTACTTATGATTG 

ATAATAAATCTATTTTGC I I I I I AGAGCCTCCCGAGAAGCCATCTGCCTTCGAGGTATTT 

AGTTTCATGATTTTCATTTTGAATGACTTATTAACTATGTATTTTGGNGAAG 

Sequence 1921 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTGTTGT 

CTTCATCATGAGATCAGAAAACTCTATGCTGCTCCCTAACTATGTGACCTTGAGCAAGTC 

TCTTAAAGGAGAACTTCAGTTTCTGTATTTGGAGAAGAAAGAAGATCATGGTAGATGATC 

CTCTTGMTTGGCTTCAGGCCTCTAAAATTCTGTAGTTTGGTAAAAATGCCTAATGACCA 

AATGGAGGCAAGATCTCGTTACCAAGATCATGGTGGCTTTGGAAGTGACAGAGATGGCTG 

AGACCAGGAGGCTGAAATGAAATTAAAGTCCTATGATGTTGCTTTAAGTTAATGCTATCT 

AAGAAAAGGTGCTGGTAGAGGMGTGGGGCCCCAAACTTGGCTTCATAGTGATTGGAGTT 

TCCAAATCAATGGAATAGG 

Sequence 1922 

AGGTACGTGAAACTGCATCTCTCAC I I I I I CTCTCTCTTCGGGC AGTGCTAGGAAGATTA 
CTCACAAAAAGTCAAATCACATCCCCCATTCTAATGTTGGTATTTCTGGGGAAGCCTTAT 
GGAAAGCTGCGTTTTTTCATGGGGGMCACTTTO 

ATTTCMTAGTCAMCTGTTGCCATAAAAAAATCAAACTTAGGAATAGAACTAAACATGT 

TATTAACAGAATAGGCTTGAGAGAGCAAGAGAATTCTTTTGGAGTAAAGTTATCCAAAGT 

CTATGTGTTCTCCTTTGACACATAAATCTACTATCGACCTAGTCTTGGACTTTTTTTTTG 

GACAGAAATTGGGTAAGTTTTGCAAAAGGACAGGCACAAAGCC 

Sequence 1923 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGTGGCGCGCCAGCGCCGCCTCGATGCGCG 

GGATCTTGCAAATGTCGGTGCCGATGCCGTAGATCATGGGCTAGTCCTATTCCTTATGCT 

TTGTTGCCCAGGCGCGCAGCCACCATGATGGCCTTCATTTCGCGCACGGCGTTTTCCCAG 
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CCCGACAAACACCGCATGCGCGACGATGGCGTGGCCGATGTTCAGCTCAGTAATATCGGG 

AATGGCGGCGATCGCCTGCACGTTGGTGTAATGCAAGCCGTGGCCCGCATTGACCTTCAA 

GCCACGGCCCACGCCGAACAGTACGCCTGCCTTGATGCGCTCCAGCTCCGCAACTGCTCG 

GCGCCTGCCGTATCGGCGTAGCGGCCCGTGTGCAACTCGATCACGGGCGCACCGAGTTCG 

GCAAGCGGCGGCGATCTTGTTGCTCGTCGGCATCGATGAACAGGCTCACGCGGATGCCCT 
CGCCCTGCA 

Sequence 1924 

AGGTACAATGGGAACAAGGAGATAAGCAGTGAAAGGCCAAGGGAATGTCTGGAGTTAGGA 

CTTCAGGTGATTCACAACTTGGCTGCCACTCACCCGAGACTGCCCAAGCCCAGATTTCTT 

CCTTCTATAAGAATATTGATTCTTGCAAATAAGATGAACCTAAATGTGGTCCAGGAGTCA 

GCATCTTCTACATGGTACCTGCCCG 

Sequence 1925 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAAT 

AGTCAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCAT 

GGTGAGCAAGGAMGCAAGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTAAT 

ATGATCACACTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGA 

TGCACAACAAAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATC 

AGTGGGGACCCAAATGCTTATGTGGACTCACCAGGTATCGAATTAGACATGAATAGGAGT 

G^^^^GTGATCCAAGGAMCTATATAATCAAAATGAATACAATGAAACTTTAAAAATA 

ATTGTAAGGATCTTTACACCAGCCCA 

Sequence 1926 

CCGCGGTGGCGGCCCGAGGGTACGCGGGGAATCTTACTCCCTCGTAGAACAGGCGATATC 

TTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCA 

GAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGATATTTAGTA 

GCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTGCTCT 

CCATAGGGAGGATTTCATCAACAGGAAACAGATG CCTGGAAGG CA7TG G ATTTGCTAAGT 

GTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGT 

CAAACTAAAAAAAAAAATGAMTAAAATAAAAATCTTTATTTCCAATGAGCTAATGAGGA 

AAAAGGGGATGATGAAAAMCAGGGAGTGGGAGCAATTTGGAAATGGAAAGAATGCTGAA 

TACCTACATGATGCCTAATTCAGAAGAACATTCTGGTATGTAGGGACATTAT 

Sequence 1927 

CAGGTACTTTTN I It I I II I ANNTN NTTTTANTTCG CAATACCAGCTTT N ACCCTG N CCT 

CTGCACTGC7TAGCCTNGGCTCTGCTGAGTNGTGCAACACATTCTGANAGATGCCATACA 

ATGCTCCAGGCAAACTTATAGAAAMCAGGAATGAGTGATTTTATACGGGATGTGTTTCA 

GCTGTCCAATNCAAAATAMTCACGTCCAGGTGCATTCTTTCCTAATTTGTGACCCAAAC 

TGCTTGCACTTTATACATTMTGAATTATTTTTTTAAAAGAAATTAGCTCTATCA^ 

ACAGGGCTATCTAACCCTTTGGTGGTCTTTNTCCCTTAAAAAGCAATTGTTCCATTTTAT 

AGGTGTTAGGTTTMTCTACATATTTGGAATGAAGCTNAAATGAACTAACNTATGTNAAC 

CAGTA AAAGC AGGACACTTAATANTAATGCTTCCCTGAAAATNTACATTGNGCATATGTA 

GGCTCl I i I 1 G G C ATT AC C AAATTTTCC AAG AAC CTTrTTTGTTTT AAATT AA 

Sequence 1928 

TCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGGAACTATTGG 

AGCACCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGCAGAGGCATCCTGAAAAAAG 

TCAAAGCTAAGCCACAATCTATAAGCAGTTTAGGAATTAGCAGAACGTGCATGGTGAGGA 

GATGCCAAAGGCAAGAAGAGAAGAGTATTCCAAACAGGAGGGATTCCAAAGAGAGAAGAG 

TATCCCAAACAACATTTGCACAAACCTGATGGGGAGAGAGAATGTGGGGTGGGGATGGAT 

GATGAGACTGAAGAAGAANGCCAGGTCTAGATAATCA 

Sequence 1929 

AG GTAC CATGATTAGTTAAATATAAG ACTCC GTAATTTTTAC AATTTTAACAATAATTTT 
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ATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAAACTCAGTGTGTGACTTAGAAAATG 

ATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTMGACTTCATTGCAAAAATTGTG 

TAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGTTAGTATCTTAGA 

CTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGAATTTAATAATACATTAGAAA 

GAMGAAATCAGAGAAGATCTTTTATTTCACTTGATACTTGNGGTGGTACTTTCAATGAG 

ATAAGAAGGGCAGGCCAAGGAGAAAGTTCAGGGGCAGGGGATGAGAAAAAAACAAGAATT 

TTATGTTGGACATGCTAAGTTAAACACCTGCTTAACTCANATTGGCTTCTGTGGAAGCAG 

AGGCTGACATGAGGGTGCTTATGAAAATGCTTTTTTGAGGAAGCCATTGCAGGAGGGAAT 

CTCTTAGGAAGCCAGGGGAAGAAAGGAAAGAGAA 

Sequence 1930 

AGGTACTTTTCCATATTTCCACATACTACCAAGGATTTTCAAGTGTAGGAGTMTTCAA 

CAGACCCTTTCTTTCATATGAGAATTCTCAAAAGAGTAGAATAAATGGAACACAGAAAGA 

AAAGACTTTAAGCAAGGGGCTTCTTGGAAGCCACTGCAGCAGTCCAGGCAAGTGATGCTG 

AAGGTCTGCAACAGAACAGGGGCCCGAGAGTGGAGAAGAGTGGGTAAACTATCATATTAA 

AGAGGCCAAACTCAGACTGGGGGCAGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGG 

CTTAGGCAGGCGGATCATGAGGTCAGGAGTTTGAGACCAACCTGACCAACATGGTGAAAC 

CCCATCTCTACCAAAAATACAAAAAATTAGCCAGGTGTGGTGGTGCGCACCTGTAATCCC 

AGCTACTCAGGAGGCTGAGGCAGGAGAATCCCTTGAACCTGGGAAGCAGAGGTTGCAGTA 

AGCTGAGATGGTGCCACTGCACTCCAGCGTGGGTGACAGAGAGAGACTCTGTCTCAAAAA 

AAAAAAAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnn 

nnnnnnATATCAAGCTTATCGATCCGCGACCTCGANGGGGGGCCCCGGTACCCACTTTTG 

0 

Sequence 1931 

AGGTACl l"l I I I I I I I I I I I I I I I I I I I I I I 1 1 NAAAGGTGAAGTTTCACTTTGTCACCC 

AGGCTCAGTCTCAAACAAACAAAAAAAAAMTTGGGGGATCTTTATTTCATGCTTTCTCT 

CATTAAGAAAAGAGAGGACAATAAAGTGTTAGAAGATAATAAAGCACAGAAGTCTAGCCA 

GTGTCTTTGGAAAATGAAACGGCTGTGTGAA I Mil I ATCTTTTGGC ACCTCTTTTAAAA 

ACTCCACTTAATGACCTACCTGCATTACTGAAGTGTCTCTTGTTGGGGTGATTGTAGTTA 

TNTCTTGGGTGTCCATGCATCTGTGGGCCATGGGAGGCAGGCAAATGAGGCTTCTGAGGG 

TNAAGGTTGTGTNAGGNATGGGACTGAGATATCAGANAGTCCCGGCNGGAACCTGAGGCC 

TTNGGACCAAATGGTTTNTTACCCCCAANNCANGNCGNNTNTCTTNNTTTTGCTCCNAAA 

GAGGNAAAAGGAAAACCAAANGAGCTTAAAGANCCCAGGCNTTAAATTCCCCCAGAAAAA 

GAAAAAMGGNTTTNANCCTCTAMCNGCTNCTTTTTGGCCNTTNTTAA 

Sequence 1932 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCTCTCATAGGGA Al Ulllll I AATTTTT 

AMTTTTAATTATTTAAGAGACAGGGTCTCACTCTGTCACCCAAGCTGGAGTGCAGTGGT 

GTGATCATGGCTCACTGCAGCCTCAAACTCTTGGGCTAATGTGATCCTCCCACCTCAGCC 

TCCCAAAGTGCTGGGATTACAAGCATGAGCCACCACACCCGGCCATCCTAGATGATTTTT 

TAMCATCACTTTTCTCCTGGCAGAATTTTCTGACTCTGTAGTTCTTGTTAACTTCCAGA 

GAAAGCCCTGAGTAAGCCAGACCCTAATTACAAACATGCCTCAGTGGATGAGGCTACAAT 

TTGAGTCCAAACTGCTCAAGGTTCACACTGACAAAGTCTCACACCGTAAACCTGAAACAT 

CATGTTACAGTCTGAGGCCTGCTCCGAGCTGGGTAAGAAAGGCCTCGTGAATGCAGCCCC 

CTGGGCAGATGAGGTTTGGGCCTTCACCCCTTTGGGACTGCCTCCTCATTGCTCCTTGGG 

AAGCAGCCGCGCTTGCTGCCTGTGTCTTTGGGAAAGTGAAAG 

Sequence 1933 

AGGTACAATATAGGCAGACAGTTTGCCTTCAGAMTTCAGAAATGCAGCTTTTGAGGGAG 
GTCAGCATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 
AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 
TCAGGACTAAGCACACAAGTTCTTCCTTCCATCTACCCCCGCGTACCTGCCCG 
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Sequence 1934 

CCGGGCAGGTACCTTCACAGCTGCACCACACCAGTTCTGGGCACATGGACAACAGATCAA 
GCTCCCCCATCATCACATAAGTTAGGCACCAAAGAGTTAGCAAGGGTATTCAGTTCCCAC 
TTAATTGATTTGCATATCATGCAGGCTCTGATTTCAGTACCT 
Sequence 1935 

GANATGCTGCCACCTAGGTTACTNGTAGGACCCTATACGGCAACCTCNTTTGCCAGGAAC 

TATNTATAAACATCCTGCANGAAAATGCAGNGAAGTNGAANAGACAGGGATATCCCATAA 

GGTTATGCANGAACATCAAGAGAAGATGAGAGGTCANAGATGGGAAGAAACAAGAACTTT 

GACATGCTTGGTGTTCTTGCCCMGCTTTGAAGAAGTTTACAAAGTCTATATGTNAGAAT 

ACANATTTNCCACCTTGCCCMCAGTAGAAAAACATAANAAGANAAAAACATTAAAAAAT 

GACAAGGAAGTTAAT 

Sequence 1936 

CCGGGCAGGTACCCTTCACTCTTTAGGAGAAGGTTTGAAAAAAAACTGTGTTTCCTGTTG 

AAGGCCAGTGATACCTGGCAGCTTACAGTTAGAAAACTCTATAACTTTATTATTTTCATT 

TGCACTTAAAACTGCTATGAGTTTTTT^ 

TGAACATATTTATATTCTTCCCCCTCAGAGAGAAACGCTGTTAACATTTTGTGTATTTCG 
GTCCAGGTTCTTTTCTGTGTGTGTAAGCACACATATGTGTTTATATTGTATAAAATAAGA 
GATTCATGCTGGACATACATTGTTTGTMCTTCCTTCCmCCTTCCTTTCTTTC 
TGAGCTAGGGTCTCGCTCTGTCACCCAGGCTAGAGTGCAGTGGTGGGATCTCAGCTCACT 
GCAACCTCCGCTGTCTGGGCTCAAGCGATCCTCCCACCCAAGCCTNCCAAGTAGCTGGGA 
CCACAGGCACATGCCAGGATGCCTGGCTMTTTTTGTTATTTTTTGANAGAC 
ACTATGTTGCCC ACNTTGGTCTN N AACTCCTGAGCTC AAGC N ATCTACTCACCTTGGC 
Sequence 1937 

CCGGGCAGGTACTCCATGCCATGCTTGTTTCTCAGTCTTGGTTGTGTTAGCCCTCTCCCT 

GCTCATGCCCCAGTTCCGTGAGCAGCAGCTTGAGTTTCCTTAATGCCTGGGATTGGCAAC 

CCTGGTAACAAGGACCATGAACACTCTATTAAGAGCTAGGATTAACAATGGCATCTTGCT 

ATAGCTGCTTAGGCTTCAGGAAGGGTGTGGAACTAAAAGCATTTTCCCTCCCTTGAGCAG 

TTCCTTGTCTCAGTCCCCTGACTGCTCCCTATGCTAGATTCAGGGATTGAGAGGGTCAAA 

ATGCTCTCCCAAGATCTAGATTGTATGATTCCCCAGTGAGAAAATGGACCACAGAAAAGC 

CATTGCTTACTTTCTCCTGTATTATAGAGTCACTCCTGATTTCCAGCTGGTCTTGGTCAC 

ACAGGCTGGCTGTTT7TCTTTTCCTTTCTAGTTT 

Sequence 1938 

GATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGG 

TTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGC 

AGGAAAATGCAGTGAAANTATAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCA 

AGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTTT 

GAAAAAAANAAAAAAAANANAATAAAANGTTCCTTNGGCCGCTNTANAACTAGTNGGATC 
CCCCGGNG 

Sequence 1939 

CCGCGGTGGCGGCCGAGGTACGCGGGGACAATGAAATCAATACTCAGAGAAGGCAGATAA 

TTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGCGAAATGTCCCACCAGACG 

GTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATAT 

GMCCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCA 

TTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCT 

GAGCTGACTCAAAGTCAAACTGTAAGTACCTGCCCG 

Sequence 1940 

AGGTACGCGGGGAGCTAATTTCTTTTTAAAAAATAATTCATTAATGTATAMGre 

AGTTTGGGTCACAAATTAGGAAAGAATGCACCTGGACGTGATTTATTTTGAATTGGACAG 

CTGAAACACATCCCGTATAAAATCACTCATTCCTGTTTTTCTATAAGTTTGCCTGGAGCA 
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TTGTATGGCATCTCTCACAATGTGTTTGCACAACTCAGCAGAGCCAAGGCTAAGCAGTGC 

ANAGGACAGGGTGA 

Sequence 1941 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCCTGTGGCCCCCCATCTGAC 

ATACCACCAGAGATCTGGTTATCACCTGGCCCATTGACTACCACAGCCAGCCTTGCTTGA 

CATGCACCACCAGGGTCCCTGAGGACAGGTCTGCCCAATTAAGCTCCACCCTACCACAGT 

GAAAAACAAAAAGAAAGAAAGAAAGGAACAAAGGATATTCAACACAACCAGAAATCAATA 

MTAAAATGACAGGAATAAGCCTTCACATATCAATAATAACCTTCAATGTAAATGGATTA 

MGTTrCCACTTAGMGATAGAGACTGGCTTGAATAGATTTTAAAAATGACCTAATTACA 

TCCTGCATACAAGAAACTCATGAGGGAATTCAAAGGAATT 

Sequence 1942 

CCGGGCAGGTACTTGCAAGAAACCTCAGGACTTGAGTAACAGCAACATGGTAAGTTTTCT 

MGTTTTCTTTTCGTCTCCCATATACGCTGGGCTGNGCTGGAATCACCAACAGGCACAGA 

AAAAATGACAACAAAACANCAACANAACCCCCAAGAATATCCTGTTCTCTTTGGCCAAAG 

TCAGGAAAGGGGAGCCCCMCAGAGACCCAGTAGGAAGCTCTAGCCCCTGTTTTGTACCT 

CGGCC 

Sequence 1943 

AGGTACAACTTGACAGTTTATAAAAATGCTMCCTATATCAACATTTTTCCCCACCAAAG 

TGTTCNNAAGGGCAANGGTAAGNGAATACATGTTTNCNTGCTTGTGTCAGGGGCACACAG 

CAAGGGAAGGCANAAATGGGCATGACCTAGTATTNAGAATCCCCAANAAGCCCTTTCATA 

ATTCCCTTAATAAANG AACTTTG NGNGACCTTGC ATTN NTTTAAAN AN ATN AAATTAATT 

TGGGTTCATNTTTTCTTTMTNTCTNTGCGAATTAMCTATATAAAGTAAATTGGGGAAA 

ACCNCANAAATTGCNCATGNGNCTTTGGTTACTTTGNAATCTTCCTTTTTTGGACACCCC 

AAGNTNTGGCCTTCCTTTTCTNCCTNTTAANTTGATNCCNTTAAAACCAATCCTTGGNCC 

CCTTNCNGGTTTCTTrrAAATTGGNAAAAm 

CCMTNTATGNAANATAGGNMCCTNAATTTTGGGTTNTTTGGTNTTNCMGTNTTTTCA 
AAAAC N ATCTTTTCC AAAATTTTCCCC AATT 
Sequence 1944 

CCGGGCAGGTACTTATTTTTATATTAATGTCAGACTTCAGGATTTATTTTAGCCTTC 

TTTGAGAAGCTTTCTAAGCCCTCAGCATTAAGCAGCATTTTTCCTCATCACACACGGATC 

TGAGCAGGTGACCGCGGCTGGGATAGGTGCTGTTTGTGTTAGTGTGCAGGTGGGCTGGCC 

AGCCATTGGGTGTGTGTTTTTCCTAGAGTAAGAAGCACTCCAGGCTGAGATGAGAGCTGT 

TGAAATGAGTAACATTTCCGTCTCCTGTGATACGCGCTGCTGTTGCNTTCTTTCAAA 

Sequence 1945 

NCTTTNCAGTCGGNAAMCCTNGTTNGANGCCAAGCTNGCATTTAATGAAATTCCGGCCC 

MCGCCGCCGGGGNGGAGMGGGCCGGTTTTTNGNCGTTATNTTGGGGGGCGGCNTTCTT 

TTCCCGCCTTTCNCCTNCGGCNTNCAACTTTGNAACCTTCGGTCTNGNAGGCCTANCGGG 

TTNCAGNTTCNNGGCCTTGNCCGGGNCNGTANNCCGNGGTTATTTCNAGGTCTTTCCATC 

CTTTCAAAAAAGGGGCCGGGGTTANATTMCCGGGGGTTTNAATTCCCCAACCAAGNAAA 

ATTCCAANGGGGGGGGGAATTAAAACCGGNCCAAGGGGNAAAAAAGNAAAACNAATTTGG 

TTNGGAAGGCCANAANAAAAAGGGGCCGCAAGGNCAAAATAAAGGGGGCCNCAAGTGGNA 

AACNCCCGGTTAAAAAAAAAAAAGG 

Sequence 1946 

AGGTACCTTTGTTCTTCCTCCTCCTCCATCCTCCCAGTGCCTGTGACTGGAATGGTGAAT 

GGCACCGCTGGATGGGTTTCTGCAGATGTGGAACAGCCAGGAAGCCAGCCCTGTCCAAAT 

TGAAGTGCTTTCTGTAGGACCAAGAAGGGTCAATTACTATTAGTGGTAGACTTCACAAGC 

CAGGCGGTAGGAAGGCAGGGAGAGCCCTATGAGAACTGAGACCGACTCACCTGGCATGTG 

CTGGATCCATGCATAAAAGCTACTGTATCAATGTTTCAGTTGAGTGAATGAAAAAAAAAA 

ANANNNNAANANNNGGTCCCTGCCCG 
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Sequence 1947 

TACTATAGGGCGAATTNNAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGA 

CAGGATGCAGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAAT 

GTGATGTTGGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATG 

GGAGGAAAGTAAGAACAGATTCTGCCAACCACTACTGATTTGTTATAATTCTCCCCATTG 

AAATTGGGACAAAATTTGTMCATGCTACTCTGAAATTGATCACATTCTCAATTCTGTCA 

ATAGCAGAGAATTTAGCCTCAACACCACCAATTCAAAATAGTTTAGGTCTTGCCCCTGCC 

TGATTATGTAACAGAAMGCATAMGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAA 

ACTTCCTTCTCTTCCATTAACGTGTG 

Sequence 1948 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCCGCCCGGGCAGGTACTAATTTG 

TAAMTTAAGAAGTTGTAGMCTGTGAACATCAACTATTGCATTATAGGGAAAAAGTTAA 

TTTAAAAGAATAGGACAAGGTAMCTTTTTAAAAAATTTTGCGA^ 

AAMGCATTTCCAGAAGTTTGACATTCACTAGAAGCTCTCTCTGTGCCTCTCTmCCA^ 

CATTCATACTGTTAAAAAAAGACMTATTGAMTATTACCATCAGTATTATTGACTGTTG 

AAGACTGGGCCCTTGCTGTAAGTCAGGCACCATGCTGGATGCAGTAAATGTATGTCCTAT 

TTCAAAGGTAAAGCCACTGCATCTTGCACACACAGTTTGGGTTCAAAAGCTTTAAAGAGT 

AACAAGAAGGTTTGCCCAAGCTTAGTCACATCACTGAGCAAAATCCACCGGATAGAATGA 

GGTTGAGATCTCTGAGAAAGCAAMTACTAATTGACAGTAGCAGTTGGCGGGTCATGTTT 
TT 

Sequence 1949 

GCGGCCGAGGTACTGATCTAACCAAGATATTTTGTTTTTCTCACCCACCAGTNACTTTCT 

CAGTCCTTTCTGTATCCCTTGCMTTTGAACAAAGCTTGGTGAATAGTGTGCACACAAAA 

AGCACACTAGGTGAAAGACAGATACATAAAAAGGGTAAAGNCAGGATATTTTAACAANCC 

TATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTCTTTAACCCTCTCTCCACTGTT 

GGATGAAATTTGCTGCATTCAATTCCAGNTCCCACCCCAACTTCCTTCTTAACACAAGGN 

CANGGTTAAGCCTTCGGTGCTTTAATCCAGAGGAAAATTACTTATTTTAAAAAGCAGTGA 

AAAACACCGCATTCCTTTGGCACAGG 

Sequence 1950 

CTATAGGGCGA ATTGGAGCTCCCCGC GGTGGCGGCCGAGGTACGCGGGAAGCAGTGGTAA 

CAACGCAAGAC f I I M I M M I I I I I I II M I I ACTGCAACAACCTAACAGGAAAACTAA 

TCAG AGAT CATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAGGAACATAATCCACTT 

GAGATTTTAAAATAGAGAGGGCTTAGGCTGGGGAGTTGGTGACAGCATAATGGGAAATCT 

ACAAAGCAAACAGGAATAGGGAGACAACCCAGAGCCCATCTAGGCTGGCAGAGTCATCAT 

CTCTAGGCTGGAAGGTCANAANAAGGTGGTGTTAGTGGAGCCAGGAGATGCACAGTCTCC 

CAGGGGAAGCTGGAGCTAGGGTGTACCTGCCCG 

Sequence 1951 

AT7TAGGGCGAATTGGAGCTCNTCGCGGTGGCGGCCCAGCACATAATGCAGTATACGCCC 

AATTAAATTTGAGACTCACAGAGTTTATTAGTGGCCACCTTGTCCAACCTCTCCTATTAG 

AGATGAAGAGACTTCCAGAGGTCATACCGCAAGTTGACAGCAGAGATGCAGTAGGAACTG 

MGTGAGGCAGAATTTGA GGAT TCAGGTCCATGGGGAAGATAGTTTGCTGAATGAAGGAG 

CAGTGCTTTAGGGAACAATTrTMTGGAATTTAAAGACAGAATAATTCAGAACTGGATAC 

ATAGTAGATACCAACTGCACCTTGTGGGAATAATTCTCTGGATAGTATCTGGCAGTCTTC 

AGATACTTTGCTGTTATCCTGGCTAAACCACATMGTTGTTGTGTGTTGGTTCACAATCT 

GATCTTGCAAGATGTAAGTTAGMTCTTGACAGTTTTTTTTG 

Sequence 1952 

AGGTACGCAACATGACATTGGCTGGTGTAAAGATCTTACAATTATTTTTAAAGTTTCATT 

GTATTCATTTGATTATATAGTTTCTTGCCATCACAACAAACACTCCTATTCATGTCTAAT 

TCGATACCTGGTGAGTCCACATAAGCATTTGGGTCCCCACTGATGAAGAGCACCATGGCC 
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AGGTCACCCACTTGATCAGGTTCTCTTGCTTTTTGTTGTGCATCTCATCTTATTGTAGGC 
ATTCAAACTGTGGGCTCAGGCAAGCCTTTCC^^^ 

GTGTTAACATGTGCCTCCAATAACTTGCTTTCCTTGCTCACCATGGTGATGCATTTTTCC 
TACTTTATGTCCTCGCTCATTGATTATGATGAGCTGCTGACTATTTGAACCTGGTA 
Sequence 1953 

ATCACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCTTCACGGCCGAAGCC 

CGATTCGCGGTAACCGCCGAAGCCGCAAGCCGCGTCGAACTGATTGGCGCTGTTGATCCA 

CACCACGCCAGCCTTGATGGCGGGAGCGATGTCCAGCGCGAGGCTGATCGACTCGGTCCA 

CACGCTGGCGGCCAGGCCGTACACCGTGTTGTTCGCCAGCTGCACCGCTTCAGGCGGCGT 

GCGGAAGCTCATGGCCACCAGCACGGGGCCGAAGATTTCAGCCTGTGCCACGGCGGCCGA 

GGTCGACGCGCCCGTGATCAGGGTCGGCGGGAACCACGAGCCGTCAGCCGGCAGTTCGCA 

CGCCGGCTGCCATACGTCGCAACCTTCGGCACGGGCAGAGTCGACCAGCGCGGCGATGCG 

CTGGCGCTGCACGGGGTCAACCA 

Sequence 1954 

CCGGGCAGGTACTTGGTTTTN I II mTNTTTI I CCTTTTGGCTTTGACTTTGAGTCAGC 

TCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGT 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCAC 

ACCGTCTGGTGGGACATTTCACCCMGTTGTTCATTTATTAGTTTTCTGGCTTCGNGGAG 

AATTATCTGCCTTCTCTGAGTATTGATTTCATTGNGCCCCGCGTACCTNGGCCGCGACCA 

Sequence 1955 

CGCGGTGGCGGCCGCATACACGCGGGCGATGACTTCGTCGCGCCGGTTCTGGTCGATGGC 

GCGCGTGACGATGGGGCTGCGTCCGGGCGGCAGCTCGTCGATCACCGACACTTCCAGGTC 

GGCGTAATACGTCATGGCCAGGGTGCGCGGGATCGGCGTGGCCGACATCATCAGCTGGTG 

CGGCACGGC 

Sequence 1956 

GAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACTGATCTAACCAAGATATTTTGTTTTT 

CTCATCCACCAGTCACTTTCTCAGTCCTTTCTGTATCCCTTGCMTTTGAACAAAGCTTG 

GTGAATAGTGTGCACACAAAAAGCACACTAGGTGAAAGACAGATACATAAAAAGGGTAAA 

GTCAGGATATTTTAACAAACCTATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTC 

TTTMCCCTCTCTCCACTGTTGGATGAAATTTGCTGCATTCAATTCCAGTTCCCACCCCA 

ACTTCCTTCTT^CACAAGGTCAGGGTTAAGCCTTCGGTGCTTTAATCCAGAGGAAAATT 

ACTTATTTTAAAAAGCAGTGAAAAACACCGCATTCCTTTGGC 

Sequence 1957 

CCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCCTCAACTTCTGAGAGCTTAAGGTATTTG 

TTCATGTTAGGCAGGTAATGTCTATAAACCAAAAGCTGCCACATACTATCTCTCTGCTCA 

ACACTTTACTGGATACTGGAGGATTTAAGGGAAGGAAAAAAGTGACCTTTTTTGCCTT^ 

TAAGAGCTTGTAATCGGTAAAGGAGATAAAACTCACACAAAGGTCAGGGTATATTTTGTG 

CTAAGCATATGACCGTAGACATTTAAAGTGCTGCCCGTAATGCAGAAAGAAGAAGAAAGT 

GGTGTGATCTAAAAGTAACATAGGTTTTGGTTTTGGTTTGNTTTTTGN^ 

TCTTTTTCTTTTTCT 

Sequence 1958 

CCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGAC 

CTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTC 

CTTTGCC AG GAACTATTTATAAACATC CTGCAGGAAAATGCAGTGAAGTAG AAGAGACAG 

GGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGTCAGAGATGGGAAGA 

AACAAGAACTTTGACATGCTTGGTGTTCTTGCCCAAGCTTTGAAGAAGTTTACAAAGTCT 

ATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAA 
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ACATTAAAAAATGACMGGAAGTTAATGGAAGTCAACAATGTGATGGTGTTTGGANGTGG 

AGCCTTCAAAAAGGTATTAATGCCCTTGTAAGAAGAAGGCCAGAAAAGCTTGCGCCCCTT 

CTTTCTGCCCTGTGGANGGAGCCCAAGANNCCGGCTGGTCTTGCNACCrTGCAAGAAGGA 

CCCCTCACTTAGAAGCTAGGCCNTACTTGGGCTTNCTTAATCTTGGGCTTTTCCNACCTT 

NCAG AACTTGTGANNAAGTTNTNTTGTTTGGGGGTTAATCCAATGGGCTATNGGAAATTT 

TTTTATNACCNNNCCCNNGCCAAGANAGGGCCCTTATTTACTTCCTTCCCTT 

Sequence 1959 

GGCGAATGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTG 

AAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAG 

ATAGACCACTGGAACAATGAGAAGGAGAGAATTCTCTGGTCACAGACAAGACTCTCTTGA 

TCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTGTGAGCG 

CTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGAC 

AAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCNGAGGAGCAGTCTCTTCTGTCCCGCT 

GGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACA 

CCNAG TGAGA AAATTCCTTGAAATTTGCAAGTNGTCTGGGTTCATTGTCTAAAGCTTGGT 

TCCMCTTTTCCAGMTGCCCCCCAAAGAATTCACTNGGATCTTGGAAAAAGGAAAAAAA 

CCTGATGGNGGTTAACTGAANCCATTTTGGTTGGNACCCTANCCAGGGCTNGATGTCATT 

TAmGGGAAACCCNNACAAACTTGGCTTTIT^ 

GAGAAGC I MM IGGNNCCTCGGCCCCTTTTANAACTTAGNGGATNCCCCCCGGCCTNNG 
GGAATTCNAATNCAANCTTTTNGATNCCCGNNACCCTTGNGGGGGGGGGC 
Sequence 1960 

TATAGGGCAAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGG 

AAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTA 

GGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATG 

G7AMGCCC7TGGTTAAATTTATTGGeTTCATTTATGTATCTCACMCACTTC 

TTTTAATTCTAGGAATAGAAACTTACTTTTGGAAATACAGNACCTGCCGGCGGCCGTCTA 

GAACTAGTGGATCCCCCCGG 

Sequence 1961 

TAG GGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAAGGAGGCAAGGAGATT 

ACAi I I I iCTTATTTCTACTACTTTTTGTTGCCTMCACCGTTTAGCTGGTGGGACAGGN 

TCTAAGTATTNGCTAAATATTGNTCTCATTATTTTGAACATGTAAAAGATGACTGCATTC 

TTATATATTTCCCTTTTAAGTTTGAAMGTGAACTACTTTCTTTATATAAAM 

GCCTATTGTCTCAGAATATCACATATAACTGGTGCACTGG 

Sequence 1962 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGAAGAA 

CTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACC 

CTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTAAG 

CCCTTGGTTAMmATTGGCTTCATTTATGTATCTCACAACACTTCTTTTCT 

TTCTAGGAATAGAAACTTACTTTTGGAAATACAGTACCT 

Sequence 1963 

CTACTATAGGGCGAATTNGAGCTCCCCGCGGTGGCGGCCGAGGTACCCGGCTTGGTGAGG 

AGTCCCAGATTGTGGAACAAAGTATTCCTCTGTTCAATACCATCTTCCAAAAAGAAACAA 

TGACCTAGTGTGTCAAACCCMTGGTGTCCTTTATTAGCAAGCCATTTGTCCCATGCACA 

GTGATGCACCTTGAGAAGCTGTGATGAAATAGACAGGCCGTCCACAAAATGTTGCAATGT 

CTGTATCCTCCTTTATAATCCACGTCACCACACAGGTGAAAATGAACAGGGTATCGCCCC 

ATCTGCTGCTGACCCATGTTGTTTCAATTCCACATAAGAAAGGATGGACTGGAAGTAAAA 

Allll ICATTATCATAATGGTGTTCCCCCAAAAGGTTATTCATTAAATTCAAAAAAAATT 

TGGGCACCTGGATTTTTTTCTTGCGTTTAGCCATGGAGTTCCCTCCAACTTTCTTC 

Sequence 1964 
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ATNGGAGCTCCCCGNGGTGGCGGCCGCCCGGGCAGGTACTCCACAAGCCAGAAGATTAGT 

CTCCTTCCCTGTAGCCTCATCTAGCTTACTTATTCAGACTTTCTGTTCACMGGGGTTTG 

TTATCTTCATGAAATTTTMTATGTGTAAGTTAGAAATGAAATTAGAAAAAGAGGTTCTT 

GCTGTAAACACTGGGCANGAGTAGCTCANAAGAAGACTGTGGTTCTTGGATGCCCTCATG 

CCTTTGTTCCAGTGCAGGTGCCTAGTGGAGGGTGCAGTGCTCTTGCAACATAGAAGGACC 

CATTCTTGCAGGGAGCTTGATGGCTTAACTTTCCTNGGAACCGGGAAGGGAGAAGGAAAG 

GGCCTTTGTTNGGACCCCCMCCTTTTGTTTNGNAATTTGGACTTGGNGC'I 1 I I I I I IGN 

ANAAANAACCCAMCCCGGAAAAAAANTTCCCCCTTTTTC 

Sequence 1965 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCAGAT 

GMGCTGAGCTATCAAGTGACTTGGTTCAAGACCTATCTTTCAGACTGTCTTTCCTTCTA 

CTATGCAGATCMCATCCTATGTMGCTGTGACCCCAGGCAGTAATTAGTTTTCACAATC 

CTTCTTCAAGATTGAGGGACCCTTTTCCCAGACTATGTTTTCATGAAGTAAGCTGGCTGG 

CCTTTGTCTCCCAACATTTATGGAAAGCTTTTGGTGCCTGATGTATACTTGATGTCAAAC 

TCTTTCCCATTTTTGTGTCCCTCTGGACACAGAAGTCAGGTAGGTCCAAGACTTCGGTGT 

CCTTTTCCTTCTCTGAGCTCAG I MINIM I GTTCACAGNAAATGTTTATTTTTGTTCA 

TATTATGCCCAGGGGAATTTTACCCTACAAAAAA 

Sequence 1966 

CGCTTTCTCATAGCTCACGCCTGTAGGTATCTCAAGTTTCGGTGTAAGGTCCGTTTCGCT 
CCAAAGCTGGGCCTGTGTTGCACAGAACCCCCCGTTTCAAGCCCCGGACCCGGCTT 
Sequence 1967 

CCGGGCAGGTACGCCTACTCAACCCGGCTGTTCACCATTGATGGCATCAGCATCCCATAC 

ACATGGMCCACACCGTTTTCTATGATCAGGCACAGGGAAGAATGCCTTTCTTGGTTGAA 

ACACTTCATGCATCCTCTGTGGAATCTGACTATAACCAGATAGAAAGAGACACTGGGTTT 

TAAAATTCATGCTTCAATATCCAAAGGAGATCGCAGTAATCAGTGCCCCTCCGGGTTTAC 

CTTAGACTCAGTTGGACCTTTTTGTGCTGATGAGGATGAATGTGCAGCAGGGAATCCCTG 

CTCCCATAGCTGCCACAATGCCATGGGGACTTACTACTGCTCCTGCCCTAAAGGCCTCAC 

CATAGCTGCAGATGGGAAGAACTTGTCAAGGATATTGATGGAGTGTGCTTTG 

Sequence 1968 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTTA 

TGCTACTTTCACTGAGCATCCTATTGAAATACACCAAGTTGAGAAATTCCTTGA 

Sequence 1969 

AGGTACTGATCTAACCAAGATATTTTGTTTTTCTCATCCACCAGTCACTTTCCCAGTCCT 

TTCTGTATCCCTTGCAATTTGAACAAAGCTTGGTGAATAGTGTGCACACAAAAAGCACAC 

TAGGTGAAAGACAGATACATAAAAAGGGTAAAGTCAGGATATTTTAACCAAACCTATCAA 

GCTCTAAATATMNCCTCCTTGGTAGTTTTCTCTTTAACCCTCTCTCCACTGTTGGATGA 

MTTTGCTGCATTCAATTCCAGTTCCCACCCCAACTTCCTTCTTAACACAAGGTCAGGGT 

TAAGCCTTCGGTGCTTTMTCCAGAGGGAAAATTACTTATTTTAAAAAGCAGTGAAAAAA 

CACCCGCATTCCTTTGGC 

Sequence 1970 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTG 
CCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAA 
ACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCA 
AAACATCAAGAGAAGATGAGAGGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGT 
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TCTTGCCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACC 
TTGCCCAACAGTGGAAAAAAAAANAAANN AAN NAATAGGTACCT 
Sequence 1971 

CCGCGGTGGCGGCCGAGGTACAACCACCCCCCGGGTATCCATAACCTAACAATAACAATT 

GTTAAGCATTCGGCCACTCATGTTTCTTCAGCTCCCCCACCTCCTCCCTATCCTCCCAGG 

TTCCCACTGAATTATATATCATTTCATTCATAAATAATTCTGTATATTTAGTTACAGCTC 

AATGAATTTCCACAAAGTGAACATGCTCATGCAAACCACACCTGCACTGGGAAATAGAAA 

TTACAAACACCCGGAGCCCCCTCTGCAATACTCTCTCAGTCCTTTCCTCACTCCTTCCCA 

A AAGAGT AGCTACTATCACCTATTGTGAACCTTGCATATATGGATTCATAGAACATGTGT 

All I n IGGTTTGGCTACATTTGTCTAACATTGTTGGTAAGATTTATCCATGTTGTTGCA 

TATAGCCATAGTTTATTTA 

Sequence 1972 

ACGGTGAGCCGTTGCTTGTTCTCAAAGAGCCA7TTCTTGAACGACCCGTTCCATTGGCGC 

ACCATCTCGGTGTGTGAAGTCGACCGACGGAACAAGACCAAGCCTGTCGACCCCGCCATG 

TGCCAGCTCCACTCTGGAGCAAGCTCAACTTGCCACCCGATGAGCTTGGCCAAGTTCTTG 

TGCTTCAATAAAGCTNGGNACACTACCATCTTCGCTCAGGTAGGGCATTGACACCCGCGC 

GGTGCAATATTCGATGCGCTTCCCAGACTCCAGTGGCAGATAATCATAGTAGTACTGCAT 

GTGGTCTTGGCCTGGAAACTCGATATTGGCTGCTGAGCCCCAAGCCGGCAGGCACGCTTT 

GTAATCGACACCCATTTGCAGCTGCGCAACGACTNCTTTTCCGCTCCCCGGGGCCTCGAT 

ATGGCCCAAC 

Sequence 1973 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAGC 

AAATAGATAGTTTATTCACCAACTATAATCACCTTAACAATAAAAGCTCTCTTTTCGAGA 

ACACTGCGTGTCAGGCACTTCATGAAAATTATACTAAACACTTTATGAAAATTATCTCTA 

GCTGCAAATTCCCGCGGATGCTATGCAAATTAGGTATAGCTACACTTTGCTGATGGGGAA 

AGTATAGCTTAGAGAAGGTAGACGAACTGCCCAAGGCCAGAGCTATGTGGCAGAGATGGA 

ATTCAAATGCMGTATAAATAATAGAATAATTTTCAGTTGGAAATTTCAAAMGCTGTTC 

TAGTTACAJTGATGTTAAAATCTGTTATTCATTTAAGATGAAGATGGCAAGGGGAGATAA 

AATTACAGGGAAGTGGTGAGGATAGAGCTGAGGAAAGAMCTTGGAATGAGATTTTAAAA 

AGAAATCTGGAAA 

Sequence 1973 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAGC 

AAATAGATAGTTTATTCACCAACTATMTCACCTTAACAATAAAAGCTCTCTTTTCGAGA 

ACACTGCGTGTCAGGCACTTCATGAAAA7TATACTAAACACTTTATGAAAATTATCTCTA 

GCTGCAAATTCCCGCGGATGCTATGCAAATTAGGTATAGCTACACTTTGCTGATGGGGAA 

AGTATAGCTTAGAGAAGGTAGACGAACTGCCCAAGGCCAGAGCTATGTGGCAGAGATGGA 

ATTCAAATGCMGTATAAATAATAGAATAATTTTCAGTTGGAAATTTCAAAMGCTGTTC 

TAGTTACATTGATGTTAAMTCTGTTATTCATTTAAGATGAAGATGGCAAGGGGAGATAA 

AATTACAGGGMGTGGTGAGGATAGAGCTGAGGAAAGAMCTTGGAATGAGATTTTAAAA 

AGAAATCTGGAAA 

Sequence 1975 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGC 

AGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAATGTGATGTT 

GGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATGGGAGGAAA 

GTAAGAACAGATTCTTCCMCCACTACTGATTTGTTATAATTCTCCCCATTGAAATTGGG 

ACAAAATTTGTAACATGCTACTCCGAAATTGATCACATTCTCAATTCTGTCAATAGCAGA 

GAATTTAGCCTCAACACCACCAATTCAAAATAAGTTTAGGTCTTGCCCCTGCCTGATTAT 

GTAACAGAAAAGCATAAAGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAAACTTCCT 

TCTCTTCCATTAACCGTGTTGAATGAGTAAGGACAGTTAAT 
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Sequence 1975 

GGCGAATTGGAGCTCCCCGCGGT6GCGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGC 

AGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAATGTGATGTT 

GGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATGGGAGGAAA 

GTAAGAACAGATTCTTCCAACCACTACTGATTTGTTATAATTCTCCCCATTGAAATTGGG 

ACAAMTTTGTAACATGCTACTCCGAAATTGATCACATTCTCAATTCTGTCAATAGCAGA 

GAATTTAGCCTCAACACCACCMTTCAAAATAAGTTTAGGTCTTGCCCCTGCCTGATTAT 

GTAACAGAAAAGCATAMGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAAACTTCCT 

TCTCTTCCATTAACCGTGTTGAATGAGTAAGGACAGTTAAT 

Sequence 1977 

CCGCGGTGGCCCGCCCGGGCAGGTACATGACATTATTGTACGCTGTTGTl TACATTACAG 

GAATACTTCGGATATACCAATGCATGTGAAGAATGGGTTTCCCCTCAGTTTCACATAAAC 

TTAGAAATGTCTCAACTGACATTTCTTTTTC lllllll 1 AATNTTTAATTN rTTTTTTTI 

TTGAGACAGTTTCGCTCTGTCCNCCCANGCTANAGTGCAGTGGTGTGATCTCAGCTCACT 

GCAAGCTCCGCCTCCAGTGTTCAGGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CTACAGGTGCCCGCCACCACGCTCAGCTAATTTTTTGTATT7TTAGTAGAGACGGG 

CACCATGTTAGCCAGGATGGTCTCCATCTCCTGACCCTGTGAACCACCCGCCTTGACCTT 

CCAAAGTGCTGGGATTACAGGGGTGAGCCGCCNCGCCCGGCTCA 

Sequence 1977 

CCGCGGTGGCCCGCCCGGGCAGGTACATGACATTATTGTACGCTGTTGTTTACATTACAG 

GMTACTTCGGATATACCAATGCATGTGAAGMTGGGTTTCCCCTCAGTTTCACATAAAC 

TTAGAAATGTCTCAACTGACATTTCTTTTTC lllllll I A ATNTTTAATTN rTTTTTTTI 

TTGAGACAGTTTCGCTCTGTCCNCCCANGCTANAGTGCAGTGGTGTGATCTCAGCTCACT 

GCAAGCTCCGCCTCCAGTGTTCAGGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CTACAGGTGCCCGCCACCACGCTCAGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTT 

CACCATGTTAGCCAGGATGGTCTCCATCTCCTGACCCTGTGAACCACCCGCCTTGACCTT 

CCAAAGTGCTGGGATTACAGGGGTGAGCCGCCNCGCCCGGCTCA 

Sequence 1979 

CCGGGCAGGTACGCGGGAAAGAGAAGGAGAATATACAGTCTTTGTATTGACCTGTATGCT 
TCTCATTTTTGTTCTTTTTGTTCCTr^ 

CCTCAGCCTGAGGGATATCAGCATTTCTTGTGCTATGGATCTTCTAGCAACATATAATTT 

CATCTTTTATGCATATCCAGAMTTGTTGCTTTGCTTTGATTITTGTTT^ 

TCTGTGTAGAATTTCTGGTTACCGTTTTTTrCTTCCCAGCATTTTGAGATATT 

TGAGTTCTGTCATGCATTTGCCTACTTGGGTGAAATGTCCCACCAGACGGTGTGATATTT 

AGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTG 

CTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCT 

AAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCA 

AAGTCAAACTGNAAGTACCT 

Sequence 1980 

CCGCGGNGGCGGCCGCCCGGGCAGGNACGNNGNTGGAGGGGGTGTNAAAACCCCCNGACG 

CCGGGAGGAGANGCTGCCACCTAGGNNACTTGNAGGACCCNATACGGCAACCCCCTTNGC 

CAGGAACN ATN NATAAACATCCCGCAGGAAAANGAG N CN ATATGNCAGAANACACAN NTC 

CNACCNNGCCCAACAGTAGAAAAACNNAAGAAGAGAAAAACAANAAAAAANGACAAGGAA 

GNCAANGGAAGNCAGCAANGNGANGGNGNTTGGAGGGGGAGCCCNCAGAAGGGAANNAAN 

GCCCCNGGAAGAAGAGGCCAGAGAGCNNGCGCACCCNCNNCCCGCCANGGGAGGAGCCAA 

GAAGCCGGCNGGCNGCAACCCGCAAGAGGACCCCCACCAGAAGCCAGCCANACCGGCANC 

CCCAACNNGGCCNNCCAACCNCCAGAACCGGGAGAAGGANANGGNNGGGGGNNAGNCAAN 

GGGCCAAGGGAANNNNNNAAAGCAGNCCCAGCCAAGACAGGGCCCCAANNACCACANACC 

AANNAAANNANNANANAGGCCCCCNNCANAAACCCANGGNCAAANAAGAGAAAAGAA 
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Sequence 1981 

CCGGGCAGGTACAATCATTACATMTCATGACATTTTAATTCTTAATTACMTGCCTTTC 

CAACTATATAAAAATATGTCTAGTGACACATATTTTGGAAAGCTTTCTTGGAAAAACATC 

CAGAGATTGTGTTCATCACTATCTATATAAATGAGCTTATACTGAACCTTAAATATGTTT 

ATTCAGCCTMCAATCAGAAAMGAAAACGTTTTTTMCAGTATAAAGTTCAGATGAGTC 

CACTTGTCTTGGAATTTTATTTTGATTTTMTTAGAAGAAAGACTGCCA 

TTCT MTGA MTCTTTATAAATTCTCTGGGCAGTGCCCTGTATCTTTATTAGAAAGACTT 

CTTAl 1 I I I MCAATTTTATTTGGAAGAAGTNCTACTCATTT 

Sequence 1982 

AGGTACTCAGTAACTTCCTGGGACCCCTTTTGCCTATCTTCTGAATAGAAACATAGAGCT 

TCTTGCAGAGTATTAGCTCCTGGCTTTATTGCTTGCTCATTTCTACATAACTGGGTCAAC 

CCTGGGGCACCACAGTGACAGAAAAAAGAAAGAAAAATAATGGGGTCCCCACCCTGTCAC 

TCTTTGGAGTTT ATGGTTCCCTTTTCTTAGTTCTTCTTGTCAAAAAGATAGGTTTTTCCT 

TCAGGATTTTAGATCAGCTATCACCGCCACATTGCAGCTGTATGATTGGGGTGGCCCTTG 

ACACAGGACCAAAAGAGAGCAACAACAACAGCATCAACAACAACCATAGTGATCCTACCC 

CCTACTCCTGGTGACGGCTTCCTGGGGCTCCTTATCCCCATTCTTGGGACAGCTTATCTC 

AGATTTTCTCTGCTCACCCCTGCTATGCAGTTCCAAATTTGATATTTGGAACTGGTAGTA 

TCTTCAGATC AAGTTGAGAGAC AN AN N AN AAAAN NN N AN N N N N N AAGTACCTGCCCG 

Sequence 1983 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGTATAACAACATGAATTCATCTCAAAGACATT 

ATGTTGAAGCCAGATTTGACATMCTCATTCTTTGTCATTCCATTTATGTGMCTTTGGG 

AATAGTCAAAACTTAGCTATGGTGACAGCAGTCAGGGCAGTAATTATCTATCGAGTGTGG 

AGAT TGACTGAAAAGGGCACAGGAGAACTTTCTGAAATAATAAAAATGTCCTATATCTTG 

TTTTGCATCGTAGTTACACAGGTGTAGACAGTTGCTAAAACGCTAAAAACTAAATGTGTA 

AGATTTATGCATTTTAGTGTATTTAMTTATAAC 

CTGGAGGGTAAATATGCTCGTTAGGAGCATAAGGGGATGATTTTGAGAAACCCTTGCTCT 
GTCATTCTGTCAGTAAACCGCCACTAACTTTTCTGGTTTCAAAAAAGCAAGGTCCATTGA 
TGAAACTCCTTTGT GCCT TAATGACAAATAACATTGATAAGGCCTACTGGCAACAAGAGT 
GNGGTTGGTATATCTTTTGGTTTTAGNGACTAAATTTGGANGAATGATATTT^ 
Sequence 1984 

CACTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTGGTGTT 

A ^^ATCTATTGCCATCATTTCAGGGACACCTAGGAAAATAAATTTTCAATTGCATAAAA 

CCTAATTATTTCATACTGCAGACATTTGATCAAAGGCTTACTTGATCAAAGGCTTGTCCC 

AATTGCTCCTCCAGGGATTCGTTCTGCTCAGATAAGATATGGTTCCGCTTTAAGAGAGCT 

TGTCCAATTCGAGCAGCGAGTTCCAGATCACGATCCCTCTCTGCCAGGAGATGTGTAACC 

ATGTCGATGTCATTGTAAGTTTTGGTCATCTGCTCCACCCTGTCTGGTGCCTAAGAATCA 

TGTAACCGGAAAGTCTCTTCAGCAAGGAC 

Sequence 1985 

CCGCG GTGGCGGCCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACA 

TTTTTCCCCACCAAAGTGTTCCCAAGGGCAAAAGTMGTGAATACATGTTTCCATGCTTG 

TGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATTCCAGAAG 

CCTTCAAATTCCTTATAAGACTTGGG ACTGCATCTTAAAATAATATTGTT 1 1 rTTTTATT 

CTTTGATMTAAAAGMTGGMCCAAATCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTT 

TCTTATGTCTAACACTGCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTATT 

GTTTGTTCAGTTCAACACTCAATTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAGT 

ATCCAGCGGGGAAAGCAAANGATCCCGCAGGTGGTAACACCCTACTGAGAGCAGAGGGGG 
ATGAATACA 

Sequence 1986 

CCGCGGTGGCGGCCGAGGTACACTTGGTGCAATAAAGTGCATCTTTAAAGTGTTTTGAGT 
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TTGGACACAGACCCGTGAAACCAACACCATAATCAAGATTCATGATGAATATGTTCCACG 
TAGTTTTCATTTAAATCCCTCTCAGTAGCATATTTGTCTGTTTAAGAGCCATTTGCATTT 
TTTCCATAAATTGCTTGTTCCTCTCCCCCTGCCCGCCCCCGCGTACCTGCCCG 
Sequence 1987 

ATAGGGCGAATTGGAGCTCCGCGCGGTGGCGGCCCGAGGTACACTTATAGTTGAGAGCCA 

AGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTA 

CTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATTCA 

CACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAAACT 

TTATGCCTTTCTGTTACATAATCAGGCAGGGGCMGACCTAAACTATTTTGAATTGGTGG 

TGTTGAGGCTAAATTCTCTGCTATTGACAGMTTGAGAATGTGATCAATTTCAGAGTAGC 

ATGTTACAAAT1TTGTCCCAATTTCAATGGGGAGAATTATAACAAATCAGTAGTGGTTGG 

CAGAATCTGTTCTTACTTTCCTCCCATCCAGTGACTAGGATTTGGAAAGCAGTGTTTTCT 

ATCAGGATGATATACCAACATCACATTTTTA 

Sequence 1988 

NACTCACTATAGGGGCGAATTGGGAGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGGTAC 

GCGGGTATCCAGTTATTTCATTGGTTGAAATAGCTGACCGCTTATAATAAAGTAGTCTGT 

GGACTGCTTAAATTTTGTAGTGAATTCCTGTGTGTGCATTTTTGTAATGGTCATTCTCAG 

TCGTAAATGTCTTTAGAAAGTCATCTAACACAAATACATTGAGAGAAGTTTTCCTAGCAT 

AAGCCTGGTCATAAACTCCACCTGATAACTTGGGGCTTGATTTAAATCAAGTGGAAGCTC 

TGTGCAAAMGATGGATATGTGACCAATATAATAATGGCTCCTTTTGCAAGAACAATACT 

TCTGAAATTAAGGGGAATGCATTAAGATGTGTATGTAATAMTAATTGAAMTAATTTTG 

GAGGCAGAAGTTATTGTAAGTGAAATTCTGTCATAAGGTGAAACAATCCT 

Sequence 1989 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 
AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 
AGTGCTTACTGCATGCCAGACCCGTGCTAAGCTCTCTCCATGCACTATCTTAATGTTCAC 
AACCTCACACAGTAAGCATTATTAACAACAGCACTTTACAAATAGAAATTTGTTGAGGCT 
CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAAACCCAAGTTTCATTCCAA 
AGCTTTGCTTTCCTGCTGCTGTATGTCTTTTGCGTCATCTTCGCATATTTAACCTCCTTG 
GCTCAACTGGGAACTCTTTGAAGGCTAGGGCCATCAATTACAI I I I I GTTTCTTCAA 
Sequence 1990 

CGCCCGGGCAGGTACAGTGTCAGGGCTTATGCTACCAGACTAGGTGGAGGAGATGACATG 

CAGATTCAGGAGAGGATCCAGGGCTCATGAGGACGTGGAGGTTTTGTTCTAAGCATTGGG 

TTTTTGTTTGGC I I 1 1 ICTGTGTAAACTTCTCTTTCACCTTCCATTCAGAATGGCTCCAA 

ATTCTTTCTTCAATAGTGGAAGTATTATTGTTAGTCAGG I \ I I I CAATACTTAATAGTGG 

TCTGTATTTGTGGGTGCATGTGGTATTTTGGTGCATGCATACCATGTGGAATTATCAGAC 

CAGGGCNGCCGGGACATCAGTCACCCCAGACATCCACCCCTTCTTTATGTTGGGAACATT 

TCACTTNCTCTCTTTCAGCAATTTTGAACAATGCAACAAACTACTGTTAGCTTTAAACAC 

CCAACTGTGCTATCAAACAGTACATCTTACTTCACTAAATGNGGNTTTTATGCCCACGCC 

AN N CTTCCTGG ATAAAACTTTGTCT AAAGTGTTTC AAAG AGGCTGCTCAAAGG AAATGTG 

GCTTGTTTTGGGANGTCTGAAATGTTTTCAAAATTTCTGGCTTTGAAAAAA 

Sequence 1991 

CGGCCCCTCGTGGTGGCCAGGCGCGATGCGCGCGGCGTGGCAGAACTGCTGCGTCCGTAC 

ACCAAAGAGGCGCTGGCGCCCGGCGAATTCCTGCTCGAACTGACGCCGAAGGATGGCAAT 

TGGGTGCTGGTCAGCGATGCCTGGTTCTTCAAGGAAGGCGAGGCGACGCGCTGGGAAAAG 

GCCCGCTATGGCGAGTTCCGCGTGCTGCCCGATGGCCGCGCCTTGCTGGTGGGCATGCGC 

GGGGAAGATTTACAGGCACTGTAACCGCTTGATTGAACAAACCCAATG 

Sequence 1992 

CGGGCAGGTACTTTGGCACATGCTGGTAGCCAGGAGTCTGGGCCTGAAATTTGGTCCTGA 
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CTCCACCCTCATCTCTCTGCATGACTTCTCTGGGAGATACAGCCTeTCCATTTTACACTG 
AGAGAACTAAATGAGCTCTAAAGCTGCCCTGACAGCTGACAGTCAAGGTTAGCATATTTC 
TGTGTGGCTCTGGCAAAAGTACCT 
Sequence 1993 

CCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAGCCAAGTCTCCCTTATCATTGGTGA 

ATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCCTAATAT 

TTATATCAGAAGCAGAGATTAACTGTCCTTACTCATTCACACGTTAATGGAAGAGAAGGA 

AGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAAACTTTATGCTTTTCTGTTACATAA 

TCAGGCAGGGGCMGACCTAMCTATTTTGAATTGGTGGTGTTGAGGCTAAATTCTCTGC 

TATTGACAGAATTGAGAATGTGATCAATTTCAGAGTAGCATGTTACAAATTTTGTCCCAA 

TTTCAATGGGGAGMTTATMCAAATCAGTAGTGGTTGGCAGAATCTGTTCTTACTTTCC 

TCCCATCCAGTGACTAGGA7TTGGAAAGCAGTGTTTTCTATCAGGATGATATACCAACAT 

CGCATTTT 

Sequence 1994 

CGGTGGCGGCCGAGGTACCGGTTGATGTAAGCACAGGGATGGTGGGGACAGGGTGACCAA 

AGTGAACTGGGCACGAGATGAGTAGCTGGTATATAGTGACTTGGAAGGCAAGGTGCTTGA 

AATTTCCTGGAAMTCTCTGCAAGTGCACAGTAGAMCTATTATCATTGGTTACGTGCTT 

CAAGAGG ACTGG GCAGATGGGGGGGCAGGAATAAGAGAGCCACTATTCACTTGACAAATT 

CTTGGACATTTTGATTTCTGAGCCATATGGATGTGTGTCCTATCAAAAAGAATAAGTGAA 

AATGTTCAATAGTAAGAGMCACTTTGTAAATCTCTGGCTGCTGCTCTTTGTGATTAGCC 

TCTCAGCACTCTTATTTGGMTAATCAGAAAAATACMCTATCTAMTTTTGGGAGGAGA 

GTATACTTGTGGAGaTTTGGGAAGAAAGAAAATAAGTATGGAAAGTTTCCCCCAATAATA 

AAATGAMTTCATTGGAAAGCCATTTCAAACATTTAGAAAGTTAACCCAGAAAAATAGAA 

AGG 

Sequence 1995 

GGCGGCCGCCCGGGCAGGTACTCGNNAAGCAGTGGTAACAACCCAGAGTACTCGGGAAGC 

AGTGGTAACAACGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAGAGTCCCGGGAAGCAG 

TGGTAACAACGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAGAGTCCCGGGAAGCAGTG 

GTACCT 

Sequence 1996 

CGGCCGCCCGGGC AGGTACCN N N N ACTN ATTNTGCTGTG ATCCTAAAACACTCTAAAAGA 

TAAAGCCTATTAAMCCATATTTTTAAAAAACTGACAGGTGTTCTTAAGATTAACTANTG 

AGTTCAGAGAATAAATNGAGGATGAGATGAATTTCTAAAAACAGCAGCACCATACATCAA 

AAATACCCAGCTAGAMTTATAAATTTTTAAAAGATCACAGAAGATAMCAAAMCAAAC 

CCCTCCCCATCTTATCTTTATTATTTNTTCTTATTTGTATAAAAATNAGGAGATATAAAT 

AAATAGAAAACACTACTGAMTATTTACAGGGATTCATGAAAACAGATTTTAAGAAATTA 

AGATACATATTGTCAGGTGTGGTGGGTCACGCCTGTANTCCCAGCACTNTGGAAGGCCAA 

GGTGGGTGGATCACTTGAGGTCAGGAGTTTGAGGACCAGCCTGGCCAACATGGCAAAACC 

CCATCTCTACTAAAAATACAAAA 

Sequence 1997 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGAACTATTTATAMCATCCTGCAGGAAAATGGCCTCTTTAAATATACACTTC 

TCTGTAGTGTATGCTAGAAATGGAGTGGCTGGAATAAAAGTGGCTGAATCATCTTCAACT 

CTAGTAGCTGAGACCAATGATGCTGACCTCCCTCAAAAGCTGCATTTCTGAATTTCTGAA 

GGCAAACTGTCTGCCTATATTGTACCTGCCCG 

Sequence 1998 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTCTAATCCCAGT 
TTACATTTAAAACACTTCAAATGGAGCCACTAGGACTCAAAGTTGCTAAATACACCCATG 
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CCCTCTTATCGCCTTCCACTGGGGGTGGTTCTTCGGTATTTTAAGTCTTGGAAGTTAATT 

TTTGAGGAAAAAAAAGTAATAGACATGTGACAAAAACAGGAAGAAGGGACACAGTTCAAC 

TGGGTTGTAGCTCTMTCTCACTTTATCAGCTGCTTGAAAGAGGGAAATGAAGGACAGAG 

CGGTAGGAAGTCTGCACACACCAACCTCATGGAGACCAGCTCAGCCCCCAATGCAGTCAA 

GCTACCTAACCTCCTGGAAACATAAAGATCGGCGTGTTACTTG 

Sequence 1999 

NCCGGGCAGGTACTTGCTTAGTCAGTTCTGTGGGCCAGAGGACACTCATGGGGAGAGATC 

AAGGTTACTTTTTGCAGGTGGGGCATTTTGTAACTGAATAAAACCCAGTCAGTAAAAGTA 

ANICNACATGCNGAATCTANGCAAGTGTTGCAATGTATGTNTGCTCTTGATCTGAGGAAGA 

CCATGCATTTGACTCAGACTAGAAACCTACACCACAGAAATCCCACCCACAAGCAATGCT 

CTATTACTGTAGGTACCTCGGCC 

Sequence 2000 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGGGCTACAGGCAGG 

CACCACCTCACCCAGCTAAI MM IGTAI I I I I AGTAAAGACAGGGTTTCGCCATGTTGC 

CCAGGCTGGTCTCGAACTACTGGCCTCAAGTGATCAGCTCATCTTGGCTGCCCAAAGTGC 

TGGGGTTACAGGCGTGAGCCACCATGCCCGGCCCCCGCAGGACTTTCTGCTGTTTCTCTC 

AATTTTATGTAAGTAAAACAACAGCAACAATAAAATCCCTGTCACTTGAACACAGAATGA 

ATTCTGCTTTATATATTTTGTATAGTTTGTATAGATATTTTGTATAGTTTATGAGGC^ 

GAACTTTCCAGAACAGCAACTACTGTGTATACTGATGGGGCAA 

Sequence 2001 

AGGTACATTCTAATCCCAGTTTACATTTAAAACACTTCAAATGGAGCCACTAGGACTCAA 

AGTTGCTAAATACACCCATGCCCTCTTATCGCCTTCCACTGGGGGTGGTTCTTCGGTATT 

TTAAGTCTTGGAAGTTMTTTTTGAGGAAAAAAAAGTAATAGACATGTGACAAAAACAGG 

AAGMGGGACACAGTTCAACTGGGTTGTAGCTCTAATCTCACTTTATCAGCTGCTTGAAA 

GAGGGAAATGAAGGACAGAGCGGTAGGAAGTCTGCACACACCAACCTCATGGAGACCAGC 

TCAGCCCCCAATGCAGTCAAGCTACCTAACCTCCTGGAA 

Sequence 2002 

CCGGGCAGGTACTTTATATTACTTACTTGTTCACTAATAAACTTCTGGCATTGCATGTTT 
ATTATTCTAATTTGGATTAATAAATATGTCAA 

TTTTCAGCTTATAATnTGTCTACTTAGTATCCTCTAAGTAGCATATCCTAATCTATTCT 

TAAGATTCACATTGTTATATGCTTGACATAACTTTACAACCATCACCATGGAGAAAGAGA 

TCTATAATGTAGACAGATTTTAATTTAATCATAATATCATC 

ACCTACACAGGTGCCATTTAAATAGGTGACAAGCTAATATTAAAATTATGATGGGAGTTG 
GGGA 

Sequence 2003 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTCCCCTCACACCTTCTGTTGATGCC 

TTATTACCACCAGGCAGATTAGAAGTTCTGGTTTCTCTAAATTTCAGTTGACTCTGTAAA 

CTTATAATACTTATCTTATAATTCTGTTGAGAATATAAAATGAAATGATGCATGTAATAT 

ATCAAGCCACTGTCTGATGTAGAATAATATTACTAACATTTATGATTTTGAAAATTAATA 

ACAAAGTTGGAGGAAATACACCACCAACTTCAAATGCTAATATGAAGTTACAACGATCAA 

GATAGTGTGGCATTGGTGTAAAMCAGACATGCTTATCAATGGCGCAGAATTAAAAGAAT 

AGAAATTAACTTCTGCTTTG 

Sequence 2004 

CCGCGGTGGCGGCCGAGGTACCTGCACTTCAGACCCTGGCTCCATGTACACCTGTGCTCC 

AAACCTGGCTCTGTGGCTACTGCACAGGTGCCAGCAGCTCAAACAGTCAAACAGTGCCAC 

CCTTGCAATAAGGAAACCTACAAGCTAGACCGGATATCAAGAAAGATTCCTTCAGACATG 

ACATCCCCTGTGGMGAAAMGACATTAGGGAGTCCCTAGCAGCCTTTGTCTTTGAAGAT 

CCCTTACTGCTGCTGTGGATGCCCACAGCCTTGTTTGTTGAAAACCCTTGCAATCTTCAC 

CAATTCTGATCTTCAGCTGACAAAAGATGCACAGGGGGAGGTTTGCCA 
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Sequence 2005 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAATGCTGTGTGGAGTCCTTTTTCTACTT 

TTGTTCCTGAGAAAGAGAGATTCACAAACTTCCCACATTCTTCCAGTGGCATCTAAGGAT 

CCCTTCCTAAATGGCCTTTCTCCCAGCCAATAGACTGAAGCAAACCATGTCCCTGATGCT 

TCCTGCCCACCACCATCCATTCTTTCAACCAACAGCTTCTCCGGGCTTTCACGTGCCATG 

CACTTTGCTAAGAGCTTGAGGGAMTACAGCAATGAAGAAGAGATGATTTCTGTTTGGAC 

CCAGT GCTCAGGGATTCCTCCTATTAGGAAATCAAAGGAGGCTTCCAAAAGCCCTACAAT 

TTTTTCCCTG NTAACTAAAG A 

Sequence 2006 

AGGTACGCGGGAGGAAAACACAACATGAACAGGCAGAGTGCACACGCCGTGGCCTCGGGA 

AGCCACAATCATGTGGCTGACATGGTTCCAGAGTGTGGCATTAGGGAGACCAACTTCTGA 

GCTGCAGATTCACAGCATGGCCATGCAAGCAGGGAAGATGTGCTCTACTCCAGGATCAAG 

GAGCAAGCCCATATTTCCTCTTGATTCTGGGACTCCCTCTTGAGACTGATTTGCTCTAAT 

GTTAGCTCTTGAGAGAACTGGGGTCCCTGCCTTCTTGGGGAGCCTGAAAAGMGTTTTGC 
T 

Sequence 2007 

GAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGTTGCAGTTGAAAGAAAG 

AATTGCTTGACATGGCCACAGGAGCAGGCAGCTTCCTGCAGACATGACAGTCAACGCAAA 

CTCATGTCACTGTGGGCAGACACATGTTTGCAAAGAGACTCAGAGCCAAACAAGCACACT 

CMTGTGCTTTGCCCAAATTTACCCATTAGGTAAATCTTCCCTCCTCCCAAGAAGAAAGT 

GGAGAGAGCATGAGTCCTCACATGGAAACTTGAAGTCAGGGAAATGAAG 

Sequence 2008 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGGAAAA 

ATCCAAGACAAGAAAAGACTGTAATAAAGCTAAATATAACAGTATAATGATACCAGGTTG 

CAGGCAGCAAATAGCAAACTTGAAACCATGGAAAAGGGAATTCATATAGTAGAGGACAAA 

TTTAAGGAGGGTTTCAAAAGTGTMGAACAAAAGACTTAAACTGATGAGCTAATAGTTGG 

ATGACAGGTCTTGGAGATCCCAATTCTTTGGGAATAACAGTTTCTGAAGGAAAAAACAAC 

A MGA GAACAACCATGTTCAAAGGGAACAAAGGAAAAATAAATCTTTCAATTAAAGGAAA 

GTTTTCTGCTTGNAAAATGAATGACCATTGGGTTAAGAGGGTAAAACTTATGAA 

Sequence 2009 

TCACAAAAAATCGACGCCTCAAGTTCANNAGGTGGCNGAAACCNCGTACAGGAACTTATT 

MNAGCATACNCANGGCGGTTTTCCNNCCCTGGNAAAGCTTCCCCTNCGTGGCGCCTCTT 

CCCTGTTATCACGACNCCTGGCCCGCNTTTACNCGGNATACCTTGTNCCGGCNCTTTTCC 

TCCCCTTTCCGGGGNAAAGNCCGGT 

Sequence 2010 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGATGCAC . 

ACAGATGGGAGGAGCTGCCTTGAGCGAGAGGACACTGTCCTGGAGGTGACAGAGAGCAAC 

ACCACATCAGTGGTGGATGGGGATAAACGGGTGAAACGGCGGCTGCTCATGGAAACGTGT 

GCTGTCAACAATGGAGGCTGTGACCGCACCTGTAAGGATACTTCGACAGGTGTCCACTGC 

AGTTGTCCTGTTGGATTCACTCTCCAGTTGGATGGGAAGACATGTAAAGATATTGATGAG 

TGCCAGACCCGCAATGGAGGTTGTGATCATTTCTGCAAAAACATCTGTGGGCANGTTTTT 

GACTGCGGCTTGCAAAGAAAGGGATTTT 

Sequence 2011 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCCAGCACTTTAGCCC 

TGTGTGGCCAGCGCCCCATGCGCAAGCGTCTTTCTGCCCCAGAGTTGCGGCTGAGTCTGA 

CTAAGGGGCCTGGAAATGATGGAGCTTCACCCACCCAGTCTGCACCTTCCTCTCCTGATG 

GCAGTTCTGACCTGGAGATAGACGAATTGGAGACACCTTCAGACTCGGAGCAGCTGGACA 

GTGGACATGAATTTGAATGGGAAGATGAACTACCCCGGGCAGAGGGTCTGGGCACCAGTG 

AGACAGCTGAAAGGCTGGGCCGAGGTTGTATGTGGGGATGTGACTGGAGAAGATGGACAT 
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CACTGGGAGGGGGTGTTCCGAATGGGGACCC 
Sequence 2012 

GGGGACCGCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGT 
TGTTTCATTA AGTGC AAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGC 
CCGAG ACTAA I < I I ICCATTGCTAACTCTAGATTCTCGGCCACTGGAGTCTGAAGATACT 
CTCTTTGAGAATGCATATTATTTTGCTC ACAGCTAAMCATTTAAGTATC ATAG C TGATC 
AGTGGAGTGAGATTAAAAGGTTTCTTTTTTGAATCATCAGCTAGAGAT 
Sequence 2013 

CCGGGCAGGTACGCGGGATGATTAATCTTCCCTTATCCACAAACTTAACTGTGGAGAAAC 
AGGAGAAAGTGTGTGCCTAGAAGCTGGATTCAGGGACCCATGGTTACCCACTTCTACTTA 
TGATTC C GTCATTACTGTTGTTTTAGGAAMTAMCAGATCTTGATTCTTTC ATAAAAGT 
CGATTCTTCAACAAGCAAATGGGAAAATCGGCAGGCCCAGATATGTGTGTATAGCAGCTA 
CTGGTTGGAAATTTGGACACAAAAGTCTTTATCTACCCAGCCTGTGAGCCACAAATCTGG 
AC TGAG TATAAAAAAGAATAAAACTAAATACCAC CATGGTTCTCACTTATTAAGTGGGGA 
ACTAAA 
Sequence 2014 

CCGGGCAGGTACCCTGGATCGTTTCAGAGATCTAACGTGTTGCCACACCATANCTCTAAG 

AAGCTGTGGG GCACAG TTCAAATTCTTANTGGAGTGTATGATTGCACAANGAAATACTTT 

AAATATGTTTACTTTTTMTTTCTAATNGACTAAGAAAATCAGCNAGCATAAAAA 

nrm ianaaggactccccaaaaattttgcatg i rrm i ccaggtn i i i < ia ttttta 

ATTTAN NTNTG NTANGGGGCATTAAATTAAATNTN GTTGGCC AN AATTTTAAAACNTTAN 

TTGNNTTACCCNTTCGNGCCCCGCCTTTCTTATGNAAAACCTTNGATTNGGGAAATTCAC 

CCCCCCNGGGGGGGNNTTGGGCMNGGGGTAAAAATTTTCCNGMTTTATTTNCAAAAGG 

ACCCTTTTAAATTCCGGTNAATNAACCACNGGTTCNNNAATCCCCTTTCTATGAAANGNG 

NGGGGGGNGGGGNCCNCCCCNGGTGTTAACCCCCCAAAAACC 

Sequence 2015 

CCGCGGTGGCGGCCGCTTTCTCtTACTGATAGTAGGATATTTCTGCTTTAGTTATTGTCA 

CCTTAAATATATTTTCAATGTTGAMTCCTCACAGCATGTTTGATGAMTCTAG 

AATTTTCTTAGGTATATTTCTGTCACGTTGGCATGATAACAAATGCAATAACCCAAAAGA 

CCCCAAMGCTAGTGTAATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCATGTTGA 

CAGTGCGTGAATGTTCGGTAGGCTTTGTCAAGCTTGCATACAATAAATTATATTATGTCC 

CTTTTTCTTTTAGGGGTCTCCTGTTTAAAGGATGGGTCTTCTGGAAGGGCTAACCTGCGG 

GAATGGAAAAGTTT 

Sequence 2016 

AGGTACMTTCCTTTATTCCAAGGCCTGCTAGAATCTTACATGTTTATAMTTGCCTTTA 

TAAGCTGGCATGCCTCTTGGAGTGGGGACAGACATTTG 1 1 I 1 1 CATGTGGTATGTCGGGA 

GTATGAACTGGAAAGACCGAAATCAGTTATAATATGTCAGCATGGAATTGATCGTCGGTT 

CTGTGAATCCAAGTTGAGTTGTATTCCTGGGCCTTGTCCMCTTCAGATGATTTGAAATA 

TACTATGATTCGTTTAGCAGTAGATGGAGCCCGATA7TTACATTGTTTGAGCATGGGTTG 

TGCCTACAAATATAAAGATGGGTGCAATTCGAGTTGCCAAAACTGTAATCTC 

Sequence 2017 

CCGCGGTGGCGGCCGAGGTACGCGGGCGGTCTGAATGGAGGAAGTTGGTAATACTTGGCT' 

rTTTTTTTI CTTCTCTCATGCAAGTTTCCAGGATATAGCTTCTTTTTTTGCTGTATGTAA 

TTGAGACCAAACTCTAAGCTTGGGGCCCTGGAGAGAACAGTTGTGGCCTGCAGGTCATTA 

GTGAGGTGAGCAGAAGGGAATCCTGGACTTCCCAGAGAAAGAGGAGACCATGCCAGCCTC 

GTGGAACACAGCTACCAGGAAATACAGAAGCCAGGTTCTATTCCTTGCCGAACTCCCGGA 

AGTCCATGGAAAAAAACCGGGACCCACAGTGGGGGTGGGCCTAAAAATGTCCCAGCCGCC 

AA 

Sequence 2018 



Page 286 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTG 

GATAAATAAGGTGTGGACTCACCAAAGACAGTTCCAAAGTCAATTTCACTCTGACACACT 

CTCTGTGATCTTCCACAGTCAGCACAATGCCTGCCCCCTGTGGGTGTTGTATAAATATTT 

GTTGAATGAATGAATCAATCATTCAACAGACCAAGGCCAAATCAGAACCCCAAACCCTAA 

GGTCTTTATACTCTCACTGTCCATCCATCGATCTTCCTGTCAGAAATCAGAATATACCTT 

TGCAATACCCTTTGCTAGCCTTTCAGTTATCTTTTGAATAGAGGCTCTGAGCCTTGAAAA 

TATTGCCTGGGAAAATATTAACACCCATT 

Sequence 2019 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCATACTCAATGTTTAGTTCC 

CACTTATAAGTGAGMCATGTGGTATTTAGTTTTATTCTTTmATACTCAG 

TGTGGCTCACAGGCTGGGTAGATAMGACTTTTGTGTCCAAATTTCCAACCAGTAGCTGC 

TATACACACATATCTGGGCCTGCCGATTTTCCCATTTGCTTGTTGAAGMTCGACTTTTA 

TGAAAGAATCAAGATCTGTTTATTTTCCTAAAACAACAGTAATGACGGAATCATAAGTAG 

AAGTGGGTAACCATGGGTCCCTGMTCCAGCTTCTAGGCACACACTTTCTCCTGTTTCTC 

CACAGTTAAAGNTTTGTGGGATAAGGGG 

Sequence 2020 

NTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTAGTCCCAGCTACTC 

AGGGGGCTGAGGTGGGAGGATTGCTTGAGTCTGGCAGGTCGTAGCTGCAGTGAGCCCAGA 

TGGCACCACTTCATTCCACCCTGNGTAACANGAGTGAGACCCTGTCTCAAAAAAAAGAAA 

AAAACATACACACAGACATATATGCATACATACAAGGAGCAGCCACTACCCTTAGGGCTA 

AGACAGTGTGTCCAAGAATGAGTCCCCCATTTCCCTACCACCACCCCAGGGCTTGATNAT 

TCATACCCTGGTTTAGNAAAGGGGCCTCAGTTGTGGGCTCACTGGAATGGCCAGAGNTAA 

TTGCNTGNATGTGCCTTCTCTGGGGGAAAGCGTTG 

Sequence 2021 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAAAMTTTCAATTAACCCAAA 

AAAGGCAGGAATGGAGGAACAAAGGAACAAAAAAATGACAAATAAAAAACAAATGACAAG 

ATAAGACACTTAGAAAACAAGTAGCAAGATGGGGGATCTACAGCAACCTCATCAATCATT 

ACTGGAATTGCAAATGGAGAAACTCCATGTGCAAAGCAGGGATTGTCACCAAGACCCAAC 

TCCATNTACAGACACCCACGTAAGATACAAAGACAGGTTGGGTGTCTAAAGGATAGAAAA 

AGATCCACAGTGGTCCAGGAGTGGTCCCAGCATTGTGGTAGGCACAAGAGTGAGGATCAC 

TTGAGTCTTAGAQTTTTGACCTGTAATCCA 

Sequence 2022 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCNNGTNCTTCTN 

TTTTNCATGCTTGATTTGTGAGGCTTATTAGAAAACAATGGCATTTGACGTTTCAT^ 

GACTCCTTGA7TTCCATTGTTTTCAGGAAGTTTTCATCTGCTGAAGGCAACTAATCCACA 

ATGAGCAAATTATCCTGCCCTGNAGGGTAAGCTATAATATCGGCCCCATGCTTCAGGAAG 

AGAAAATCATGTGTTCAGGGGTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACCGCAA 

A CAGT ANCAATCATTATTGGCATAGAACACCTTAAGCAAAATTGGGTCATTTTACNTAAN 

GTTTTCTGGCTTTCGTGGGAGAATTAATTCTGCCC 

Sequence 2023 

CTATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I HIIUIH 

m i t i i ii m iGGAGACGAGGTCTTGCTATGTTGTTCAGGCTAGACTTGGACTCCCAGCC 

TCAAGCAGT CCTAC TGCTTCAGCCTTCCAAGTAGCTGGAACCACAGGCACACACCACTAT 

AACCAGTTTATTTTACATTGNTCTTGTAATTCTGGTATCAGTGATTCTTATAACAATATG 

CTCTCAATTTATTTTMGCATGTTGAGACCTTAATGATAACTATTCAGAMC^ 

ACATTTTGATCTTGAAATCTTCCATCGTTTTCCATC 

CCATTCGCATG 

Sequence 2024 

CCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAAC 
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GTCGGTTCATGTAMTTAGAAATTATGGGGCCACTTTGCCATTCTTCACACCTGCAATGA 

ACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAACTCCATTTGCATGTTATAGTGGG 

ATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCATA 

TTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTCCTGGGAGGG 

AAACTGGAACTCAGCCTGGATGAAAAGCCACCCTGGTCCTGGGAGGTGAAAGNCATCTTC 

CAAATGACAGC 

Sequence 2025 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGACCTGACGC 

TGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCC 

AGGAACTATTTATAAACATCCTGCAGGAAAATGTGAAGTAGAAGAGACAGGGATATCCCA 

GAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTT 

CCCACCTTGCCCAACAGTAGAAAAACATAAAANAAAAAAAAAAAAAAAAAAAAAGTACCT 

GCCNGGGCGGCCGCTCGA 

Sequence 2026 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAGTTTTTCAGAAT 

CAGGCCTGCTTTGTCCCTAAATGCTTTCTTATTTCCCCAGAGCCCTTATTGGATTTAAGC 

CCCATATCATATCTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGTCCTTG 

T MGT TCTTTTCCGTGCCTAGMGTTGCAGAAGACCAAGGAACCTTGGAAGTGACTTACT 

CTTTTGCCTCATAAAGTTGCTTTAAMCA 

ACACAI I I I IAAATTTTAAGGTTAATTTAGTCATTAATTCTGTCAAACAAGTGTTCTTTG 
GAAGCTAGTTGTCTTCAATGTCTTTTACTTGGAGTTACTTGGMGGTTTGCTTAA 
Sequence 2027 

GAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAGATGTAACTGGGAGATACCTGAA 

TGTGCAGTCAGTGCCAAGTGGAGGCCTAGGTAGGTTTATTTGGTTATTGGGAAAAGGAGT 

GGCACCAGAAAATTGGAGGACTAGAGGACAGTTGGGTGAGAGCAGTTTAGTTTCTTTGAT 

TCTGTGCTAACTTTTTTGGATATTTGCTGGAAAATGCAATTTATAGAGGAT^ 

TGG CTATGGAATGCATTTG CTGTTTCTTCTCTTTTATACGTAAGATAC ATC TGTG AG ACC 

CTCTACAGGAGATGAATTCCTGGTGTAAAAGAGTCATGTGAATATTGTGGAGTAATTATT 

CTGAGCCAGGGGAGCAGGCTAATTAGCCTTCTGGGGAATG 

Sequence 2028 

AGGTACAGAGAAAATAAAACTGACATAmGAGTAGCTTTAAAAAAAAAAC I I HH1IA 

AAAAAAGGCTGCATGATGAATCAGTTATGTGGCTTTGGACCAGTAAGGAGTCAATTTCAA 

CCCAGACTGTTCTGACTTGGAAGGCCATTGTGATTCCATGTCTAACTCCAGAATATTGTT 

TTCACAATACCATMCATCTTCTAAACTGMCAAAATACTTGCACGAAATAGTTGATTCT 

CTTCTTCCAATTTCTTTTCATTCATGGAAAGGGAAAGMGACACATTGMTTAAGCATCC 

AACAAATATAAATACAGCATTAACATTTAMTGCTATCTCAGCAAATATTTTAAAGGGAT 

AAACCAAAAG 

Sequence 2029 

CCNCNGCGGTGGCGGGCGCNGNCNCGGGGATCTCACTCAATCTTACTCCCTTGTAGAACA 

GGCGATATCTTCACCATGCGCACAATGAAATCAATACTCANAGAAGGCAGATNATTCTCC 

ACGAANCCNGANAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGA 

TATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCA 

GTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGAT 

TTGCTAAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTG 

ACTCAAAGTC 

Sequence 2030 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTTCCT 

CCAGTGCCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGATGTGATAC 

TCTAACAATGTCTGAAGCTCTGTCACTTGGGTGCTGAATCCTTTTCTATACTTTCTCAAT 
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ACTGGTCAGGATTTTACGGTGTTGTCTAGGAGAGAGCAACCATGATGGGCATGCAATGCT 
C AAGTTC ACAGAAAMCATGAATAGCAAGACTCAAATCTGGCTGCCAGAmCTGCAAGG 
TA fill \ CTCTC CAMCG AGGTAATGTGTTTCTTATAACAGTCTTTATAATAAGG AG ATG 
TGTGTGGAAAAAAATCAAGTGTTAAACGGAGTCATGCAAATTAGCCATCTGGATAAAAC 
Sequence 2031 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCCCTCAACTTCTGAGAGCTTAAGGTATTTGTT 
CATGTTAGGCAGGTAATGTCTATAAACCAAAAGCTGCCACATACTATCTCTCTGCTCAAC 
ACTTTACTGGATACTGGAGGATTTAAGGGAAGGAAAAAAGTGACCTTT^ 
AGAGCTTGTAATCGGTAAAGGAGATAAAACTCACACAAAGGTCAGGGTATATTTTGTGCT 
AAGCATATGACGTAGACATTTAAGTGCTGCCGTAATGCAGAAGAGGAAGAAGTGGTGTGA 
TCTAGAGTMCATAGGTTTTGTTTTGTTTTGTTTTTGTTTTT1 TCI M M C I I I I ICTTT 
TTC I I I I ICTTTATCTAGA 
Sequence 2032 

CCGGGCAGGTACCCTTCACTCTTTAGGAGAAGGTTTGAAAAAAAACTGTGTTTCCTGTTG 

AAGGCCAGTGATACCTGGCAGCTTACAGTTAGAAAACTCTATAACTTTATTATTTTCATT 

TGCACTTAAAACTGCTATGAGTTTTTTMTGCAAAAAATAGTGATTGAGAMGAAA 

TGAACATATT TATAT TCTTCCCCCTCAGAGAGAAACACTGTTAACATTTrGTGTATTTCG 

GTCCAGGTTCTTTTCTGTGTGTGTAAGCACACATATGTGTTTATATTGTATAAAATAAGA 

GATTCATGCTGGACATACATTGTTTGTAACTTCCTTCCTTTCCTTCCTTTCTTTCCTGAG 

CTAGGGTCTCGCTCTGT 

Sequence 2033 

CCGCGGTGGCGGCCGAGGTACGCGGGAAGGTTAAGCATTTGTTTATTAAATAGCACAGAT 

TTCCACCCCCATTTGGTTTATCTAGTTTCCATTTGTAGTAACTAATAGAAAAAACTCTAA 

AATGCATCGGGAGGGAAACACATTTAAAGCTCTGGAGGCATAAAAACCTGCATATAAATT 

CA GGCT GCACCGATTGATAAGTGTGTTACTTTCTCAATTTACTTATTCTTTCTTAACCTT 

AGTTTTCTCATCAGCAAACTATCTTAATTACATGACTTTGTAATCCATAAAGCTTGCTTC 

AAGCTCTTAGCATGCACACAGCACATCTGCAAAAGTAAGCATTTGCCACTTTCGGTTCTC 

AGGATAACATATTCACCCTTCCCTACCTACTAGCAAG 

Sequence 2034 

CGAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAAT 

GAAGAGTGGCTACTTTTATTAGATCATAAACAGTTTCCATGTTACTAACCAGACTTTACG 

ACTATTAAGATAGCATCTTAACCTACGACTGTATAAATATTTTATGATCAATAAATCTTA 

ATTGGTAGACTCTGAGAATAAAGATGGTGTTTTTCCAACCCCTCAATATTTGCATACGTA 

AAAGTATTTAGCTACTATTTTAAAGCACCTAACCGGTATCTGATGTTAAGTGCATGCAAT 

CATGAAACATCTTTATGAAACACCTAACAGGATATGGCATTACATGAAGCTTTTATAAAA 

GCAATTAAACCAAAATAGCCTTTACCTCCTGGCAACTTAAACTCCAGCCAAATATTGATG 

CAGACTA 

Sequence 2035 

G CTTTG CTCCAGGCCTCAGGAGTGTAAGTTTAAGGGCGCGAACCTGGGTGGGACGGGGCA 

Gi I I I iCCAGCGGGCCATACGGGAAAATTTTTGGTTCGAGGAAGACMCCACCCTTTTTA 

AGGAGAAAACTGCATCTTGCCCTGCGTTATTCCTACGCGGTGCCCAGGTGGGGTGTGTGT 

GGACCAGTCAATGACCGCCCAAGCTCTCCGAGTAGAAAACCCAAACATGGTTTTGTGGGG 

TGTGTGCCTTTGACCCCGGACTTAAGCAAAAAGCGTGGTCTTGGGCGTAGCTACGAGGTG 

GTTGGTGGGCTCCAGGGCCTGCGTGCCCCTCGACGTGAGCGATGATTGG 

Sequence 2036 

AGAAGCATTCAAGTCAGGTAACGGGACAAACAAGAAAGAATGTAGGCAGGTAATTCTGGA 
GACTCTGAGAAGAACAGTTGAGTGAACTCACTTCAGATGCATGTAGATTATTTGCACTCC 
ANAATGTAGACTGTGGGCACTCCAGAGACTACCAGAATTGGGGGAACCACAGTGACTCAT 
TAGGTTTCTCTGTTACAATCAGGCATGAGAAATATATATATTTTGTTGATGTTAGCTATT 
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GAAATGAGATATATTGGAAACTGAGAACATTGGCTTTATGTTTAAGGAAGTGTGTTGTTT 

TGGTAAAATTGCCATTCCACAAAAATTT 

Sequence 2037 

TCACTNCTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGGCGCCAGGAAGCCGCCAT 

CGCGGATATTCAAAATGGACCAGGGAGCCGCGAAGTATGGTTCGTGCCAGCGCAGCACGA 

ACGCCAGGCGCGCCGCCAGCAGGCCGGCGATCAGCAGGTCGGAAACGAGGGGGCCGGCGT 

CGGGCAGCTGGCGGTAGCGCCGCAGCCACCACGCCACGCCGTAGGCGACGCACAGGGAAA 

GGAACAGCAGTACCAGGCCCGTGGGCAAAGCCAGGGGGCCGATTTGCAGGGACAGCATGG 

GCGCTCTCTATCACGTCGAATGGACGGCCATTGTGGCCCATGCG 

Sequence 2038 

CCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACAT 

TTTTCCCCACCAAAGTGTTCCCMGGGCAAAGGTAAGTGAATACATGTTTCCATGCTTGT 

GTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATCCCAGAAGC 

CTTCAAATTCCTTATAAGACTTGGGACTGCATCTTAAAATAATATTG 1 1 I 1 1 1 1 [ I A TTC 

TTTGATAATAAAAGMTGGAACCAAATCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTTT 

CTTATGTCTAACACTGCCTCGTCTTATGAATAACtCTGACATTACAGAGAAAGACTATTG 

TTTGTTCAAGTTCAACACT 

Sequence 2039 

CGACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 rTTTTTTTTTI 
rTTTTTTTTTTTTTTn I AAGCATATCCTCACTTATCTGATGTAAATGACCAAATACCTA 
ATTCATGCAAGGCTTAAMCCTAGGNGACGGGTTGGCTTAAAACCTAGGTGACGGGTTGA 
TAGGNGCAGCAAACCACCATGACACATGAATGCTTATGTAACAAATCTGCACGTTCAACA 
C ATGTNTTTAAN AACTTAAAGTAAAATAA I I II 1 1 1 I AAAAATGCAAAGAAACCATAGAT 
TAMGAGTCAACAAAATAAAAAGAAAACA^ . 

G AAAG AAGG AAC AG AG ATTAAAAAATNTGGG ATAAATN CAAAACTAATAGAAAATTG AT N 
TAAI Mill 
Sequence 2040 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCNTNGAGACTGNCNTGGN 

AGACTGGNNAGGTCACGTNTCTNAGACTTCTGGNGAGACCATNCANGGCTTTTGGCTCTT 

GACANAGATNGACCACTGGAACACTGACNAANGAGAGAATTCTACTGGTCACANACAAGA 

CTCTCTTGATCTGCAAATACCACTTCATCATGCTGAGTTGTGTGCAGCTGCTGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTTGAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAA 

Sequence 2041 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCTGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCCTGAAGTTCCTTA 

TGCT 

Sequence 2042 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATG 

TGAAGATGGCGGAATTCCCMCGTGCTTCTGCAAAATTTTATGTATTCATCCCCCTCTGC 

TCTC AGTAGGGTGTTACCACCTGCGGATCCTTTGCTTTCCCCGCTGGATACTAACAGCCT 

TTTTCATAATTGCTAGCACCTATGGAATTGAGTGTTGAACTGAACAAACAATAGTCTTTC 

TCTGTAATGTCAGAGTTATTCATAAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGG 
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GCAAAGAA TCAG ACAACCAGGATTTGGTTCCATTCTTTTATTATCAAAGAATAAAAAA^ 

ACAATATTATTTTAAGATGCAGTCCCAAGTCTTATMGGAATTTGAAGGCTTCTGGGGAT 

CT. 

Sequence 2043 

CACTACTTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGG 

GCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAAT 

AAATGAACAGTATGGCTCCACATAAGGTAGAGCGTGTCTGGTGCCCTGCCGTTTGGAAAT 

GACCCAAAGGCAGGAGTGGAAGATGGAGTTTCAGCAGGAGAGACTGAGGATCTCAGAGGA 

TTTGCTCTGGGATGCAGAGACGCTTGCCTATTTGCTAAGTGTCTATGCCATATGATTCTG 

CTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAMGTCAAACTATTTGCTAAGTGTC 

TATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCAA 
ACTGTAAGTA 

Sequence 2044 

ACTACTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 

imtnmimm IGGGACAGCTTTCTATTCTGTCACCCAGGCTGGAGTGCAGTGG 

CACAATCTCAGCTCACTGCAGCCTCCACCTGTTGGGCTAAATCCATCTTCCCACTTCAGC 

CTCCTGAGTAGCTGGGACCACAGGCACNCATCACCACTCCCGGCTAATTTTGTTTTTCAT 

A GCAC ATTACATTACCGATTTCCCAGTTTATAAGTAACCAAGATGCTCAAACAAGCAAAT 

ATTTT CCTTGA GTATGACTTACTACAAAAAAGAGAAAGAACAGAGAAATTAGTTCTGAAA 

GTGCAi i i I iNTAMGCCCATTTCAGTATTTGATAGCTCANATCATATANGAAGTTTTAA 

TCAAAATGCTAAAG 

Sequence 2045 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCTCCCTATG 

CATCTGCTGTGGGAAGTGTGGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCA 

CCAC AGG CAGCAGTGAATGAATGAAGTGAAATGGTATTTATCCATTTCCTGG AAAGG TC C 

AGGGTTTGGCTCTGCAGGGCCAAGAGAACAGCTTTAGCTGTGCCTTAACCCAGTCCTGGA 

GAAGCCAGCAGGCCGTAATCACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGG 

TGGTGACAAGGTGAGGTGGTCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGT 

GCTCTGACATTTGATGCTAATATTCCATATTATCAGGGCTTCTGTGGTGTTTAGGCCTCT 
AATTTTTCTCTTC 

Sequence 2046 

CGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTACCGGCGCCGCTGGCG 

CTCAGGACCGCACCCGCGTTCAGCGTGACGTCGGCGCCGATGCCGCGATAACCGAGGTCG 

AAGCCGCAGCAGTTGGCGGCCAGGTTCGGCAGGTCTCCCACCGTGCCGGCGCCATTGCCC 

TGATTGATCAGTTTGATGGTGGCGCCGGTGATGCTGTTGACGCTGATGGAGCCGGCCATA 

TGGTTGGTCTTGTCGCTGCCGTCCGCCATCAGCAGCGACACGTTGCCGGTGCCGGCGTCG 

ATCTTCGCCGCCACGCCCTGCTTGTCGACGCCCATGCTGAAGGGTGGGCGATGCCCGGCG 

TCGCGGTAGGCCGTCNTTGAACNCCCGTTTGGCCAAANCTTGTNCGTTTGGGCGGGT 

Sequence 2047 

ANGTACCGTTCCTNCAGTGCCCAGAGATGCTNTCCGCACCAAGCCACAGATGTGGAGGAG 

GCAGGTCTGGACTTNAGGCAGACAAGGAACTTCAGAAAATCCTGGGGAGAGACAGCACAG 

GGTAAGAGATGGCCTGTGAAMCTGAAGGCAATTTGGAAGGCATTTTTAAATTCCGTGGA 

GCCACTGGAGAAATGTTTACAGGAATGATGCCTTAAAATTCCTCAGTGGTAAGATCTGAN 

TACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGGCTGCAAGGNAATTTCG 

ATATCAAGCTTATTCGATACACGTGCGACCTCGAGGGGGGGGGCCCCGGGNACCCANGCT 
TTTGG 

Sequence 2048 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTG 
CCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAA 
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ACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCA 
AAACATCAAGAGAAGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTG 
GTGTTCTTGCCCAAGCTTTGAAGAAGTTTACAAAGTCTATATGTCAGAATACACATTTCC 

CACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGGAA 
GTTTAATGGGAA 

Sequence 2049 

CCGCGGTGGCGGCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCA 

AACGCAAACAGCAGAATCATATGGCATAGACACTTAACAAATCCAATGCCTTCCAGGCAT 

CTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATAG 

TGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGNGGGACA 

TTTCACCCAAGTTGTTCATTTATTAGTTTTCTGGCTTCGTGGAGMTTATCTGCCTTCTC 

TGAGTATTGATTTCATTGTGCCCCCCGCGTACCTGCCCGGGCCGGCCCGCTCGAAATTGT 

TATC 

Sequence 2050 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAATAAATACTTTC 

AACTATTTTGTTGCTAGCTTTACATATTTrrTTGCGATAAAM 

AATAAAAACTAAAACAAAAAATTCAAACATCCAAAAGTTAACATCCAAAAGTGTTG 

TTCTTCTCAGAMTAGCAATTTGTTCTGCTTCTCATCAGTCCCACTTAGGCTCTCTCTAG 

GACTTAACTGAGCAACTCCTAAGATCCAAGTTCCCTTAAATTCCTACAGGCAGTGGTCCC 

ACCAACAAAGACTTCTCTACTCTTCCTTTGCCTCCAGATGCTCAGACTTGGGCTTCTGGT 

ATTCTGAAAGGCATCCTTGTTTATTTTCCTACCAGAGATTGTTGGATTTGCCC 

Sequence 2051 

AGGTGGCACGGTGTATGAAGGCCTGTGTTTGAAGACTTAGGTCAGACCTTTTGCCCGCCT 

TTCCCCTATGCAGCTCTATGCAACTGTCTGTGCAAGGTCCTGTAATTTGCGTAGGACGCC 

TTGACACAGTCTCTGCTTATGGACGCTTTACAAAACTATCTTGCAAAGAAATAAGTACCT 

GCCCG 

Sequence 2052 

ATATATTCCACCCTCATGGCAAATATGAAAGATTTCCATTGGATGAGACACTGAACTCAG 
TAACCACAACAATGNGGGGCATAATGCAATGAAACTAAAAACATATTCTTAGGAAAGAAA 
AATGAAAGNGAGGAAGTAAGAAA A I I 1 I I I I 
Sequence 2053 

GTCCGAAAATTTTGGGGTAAGGCTNCTCAACTCCGTCGNGTTGGTGTCCGGGTCTCCGTA 

AGGGGTTACCTCCATTTTGGTGAAGGTNACTTTANTGTGATCCGTCNATTNCTATCNGGT 

GGTCTCTTNACCGGTGTGCNAATTCAANGGGTCTCCTTTNANAAGNCNCTCTTGCAATAT 

AGGNNAAAACCAAATCCAGTCTANTTCNAGNGGGGGGNCCTNCCTGGCTGGGCAAGGNAA 

GNCNCCGTCTCCACCNCCTTGNAAGCCCCTTCAACCCCNGGCTTTTTAMGTTTTTNCAA 

TTMNANAAMNCAANGGCNNCCCTTTGGGTGGNTGCTC^ 

ANGTCCCCAAGGGNTTTCCGGTTGGNTTATTTTNTNCTTTAACTCCCCATCCAMGNTTG 

TTTCGAATGNGGGAAACCTTNCCCCAAGGNCCGGNTTTATGGTANCCCAAGGTCCTANAN 

AANCCNAACCAAGAAAATAACCATTTTGGNTTATCTANCCCC I rTTTTTTTI I GGNAANA 

ANATCCNAAAANAACATCCCAANGGTTCGTNTTCAAGGGTGGATNTAAAAAGGNCCAAAA 

ATAANAACCAANAAAAACCAAAAANCCCCAANCCCAANAATTTCCCTTCAAAAAAACCCT 

CAAAGNCCTCCCGGNCCNGGTAAGGGGGGAAATAACCCCCCCI I t 1 III INCCCCTTTTC 

CCCCGGCNGGGTNNAACCCCCTTTGGGCCCCCCCCGGGGGGGGNCCCGGGGGGGCTCCCG 

GTTCNTTTCNTTTAAAGGGNAAMNCCTT^ 

NC 

Sequence 2054 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGA 
CAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAA 



Page 292 of 63 5 



WO 01/51628 



Table 1 - 



PCTYUS01/00798 



TGAACAACTTGGGTGAATTGTCCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGCTG 

CCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTGCTCTCCATAGGGAGGAT 

TTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCTAAGTGTCTATGCCATAT 

GATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCAAACTAGATGGA 

AGCATTCGACTGCCGGCACAAAGATGTTTGGTG 

Sequence 2055 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATGT 
GMGATGGCGGAATTCCCAACGTGCTTCTGCAAAATTTTATGTATTCATCCCCCTCTGCT 
CTCAGTAGGGTGTTACCACCTGCGGATCCTTTGC7TTCCCCGCTGGATACTAACAGCCTT 
TTTCATAATTGCTAGCACCTATGGMTTGAGTGTTGAACTGAACAAACAATAGTCTTTCT 

CTGTAATGTCAGAGTTATTCATAAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGG 
CAMGAATCAGACAACCAGGATTTGGTTCCATTCTT™ 

CMTATTATTTTAAGATGCAGTCCCAAGTCTTATAAGGAATTTGAAGG 
Sequence 2056 

CCGCGGTGGCGGCCGAGGTACTTCTGTGACATGTATCACCAACTCACACAGTGATTCTTA 

CGTTCTTAGGTCACTGCTTGGATTATGCAGCAAGAACAAGGCCATGTAAGGGGTAGGGGT 

GGGGCCAGGATTAGACCTGATCATTGAGAAATGGCAGATGGTAAGGGAAGGTCAGTCGCA 

GATACCTACACTGGTAGGAAATAAAAAGCATATGAGACAGAACANAGTATTACAAATGAA 

GTGTAACAGACCACAGGTCCTGGGGTGTACCTGCCCGGGC 

Sequence 2057 

CCGCGGTGGCGGCCGAGGTACTTACATGGGGACCGCCAGGGGCCTCGAGAATCGGTATCC 

TGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTMGTGCAAACACATTGTTCTTAAT 

TTGAAAACTGTGGGCAGAAACAGMGCCCGAGACTAATTTTTCCATTGCTAACTCTAGAT 
TCT 

Sequence 2058 

ATAG GGCG AATNGGAGCTCCCCGCGGTGGCGGCCCGCCGGGCAGGTACAATCTCTGGCCC 

TACATTTTCTAAACGTTATGCCACCCTGACCAAGGGGCAACTCCTACAAAGCCAGGCAAA 

ATMTAAAATCATATn*GTCTCTAGTGGAATGGATAACTATGCCTAAAACTGCCCCTTTG 

AAMGCAACTAGAGAGATAATTTCTGAAGTGTTTGTCCCTACCTGAATGTGTGGCAAAAT 

TCTAAACTCCCTGAAGTGTGAAAGTGGTTTNCAAGCCACATGCACATCCAGTTGTGGTAA 

AGGGTGAAAATCTAACTGGCTMGAGGGCTTCATAGCAACATTAACCAAAAAGTGGTTAT 

GTAGACTTTGCCTGCTTCATAATTCCCTAGGGCATTCTATGCTATTCTGTACCTTNGGCC 
GCTCTAGAACTAGTG 

Sequence 2059 

TCTCCGGCTTCTATTTTGCCCACCCGGAGTCGAAATACTTCGTCGTCGGCAAGATCGGCA 

TGGACCAGGTGGAAGACATGGCCAAGCGCCGCGGCGCCAGCATCGAGGACGTGGAACGCT 
GGCT 

Sequence 2060 

TCTCCGGCTTCTATTTTGCCCACCCGGAGTCGAAATACTTCGTCGTCGGCAAGATCGGCA 

TGGACCAGGTGGAAGACATGGCCAAGCGCCGCGGCGCCAGCATCGAGGACGTGGAACGCT 

GGCTGGCACCGAATCTGTCGTAGTGCCATAATATAAGCACAAGGCGCGCGGAACTTATCG 

CGCGCCGNAGCGCACTAACGCAGTATGCCGGTCGGNATGCTGNATGCCCGNTCCGGCCCG 

TTCCCTGACTGGCTAAGGAGGATATATGGCAAGCAATTTCAAACTGAAACGCTGGCAAGA 

TCAAGTGATCCTGTTGCTGGGCCTGTGG 

Sequence 2061 

CTATAGGGCGAAT TGGA GCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTT 
TGTCATACTTTAACTTTTGTCCCTTTCCCAGGTGGCCCATGCTTCTAGGCCAGCCTCCCA 
ATTCCTAGCTTGCTTCATGCTCCTGGTCTTTGCCTACTCAAAACAATCTGTCCTCCCCAC 
CCTGTCATCTCTGTTCAGAGTAMTACCTCCCTCCCTCTGCTTTTGCCCTAAGTTATTTT 
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TTCMCTCACCTAACATCTAACTTGGCTTTGCAGCTCTGTTTCTCCAAAACCCTCTCTGC 

CTGTTTCCTCCCAAGCTATCTATGACCAGTGCCAGAAACCAGAAAATCTTCTTGGAAGTT 

GCACTCTTTGCTTCTCTTCCCTCTCCTTTATCCTTCCACTTGG GAATCTCTCTC CTTCTA 

CCTGTCTCTATCTGCTGCAATCCTGTTCATCTTTTCCAGGTACCTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnGCTTTTGTTCCCTTTAANTGAGGGGTTAATTGGCGCCCTTG 

GCGTAAAT 

Sequence 2062 

CGAGGTACAGCACTGGGCTTACAGCGGCGCTCMCAGATTTTTTTCTTTTTMGAAAATG 

AATTCACTGAGGTTTAGAGATGTTAAAATACTTGCCCAAGTTCCTGCTTGCCCAGATGTG 

CCTTTGGGAATGCATCTTCTTTCTCAAGCCTCCTTACAGGGGTTGTAGTGAAGATTGGAA 

NGGGATAANGCAGGTG 

Sequence 2063 

GAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGAGATGCTGCCA 

CCTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACA 

TCCTGCAGGAAAATGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCNAAACAT 

CAAGAGAAGATGAGAGGAGTCTATATGTCAANAATACACATTTCCCACCTTGCCCAACAG 

TNGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGC 

AATGTGATGGTGTTTGGA 

Sequence 2064 

CGAATTGGAGCTCCACCCGCGGTGGCGGCCGAGGTACCTCTCCTGGGAAGCTCTTCTTCA 

MCATTGGTGACTCAGGGATTTGGGCTGCTTTCATCTTATAGCTCTGTCATCTCTATCAT 

GGAGCTTTCAAGGCCTTGTTTGCGCCAATATCCTCTCCTGGATGCTTGGGAAACACATCG 

ATGCATCCCGTCAGTCAGCAGCTGTTTACGGAACACCAGCTGCAGGCCAGGCACCGTGCT 

GCAAGGCGTAGATATGTTGCTGAACACGACTCCCTTCCAGATGTTCAGGATTTAACAGGC 

AAGTAAACAGAGGGTCCAGAGAAGGCACTCAGCACTGCCTCATGACATCAGGGAGGATTG 

CTCAAAGGAAGTGATGCCTAAACCCAGCTGGATCGNAGTGTAATGGATAGAGGGGTAAAC 

AAGAGCTGATTTCCTAGCGGTCCTGAGTCTCGACAGGTTGCCACCGTCAGGAGTATTGAT 

CTTTTTCTCCCCGC 

Sequence 2065 

CCGCGGTGGCGGCCGCCCGGGCAGGTACI I I I 11 I I I I II I I I I I II I I INAGGNNATAT 

GTATACATAGTAAGTGTATATATTTTTGGGGAGAAGATCATGCATTTCCTAGGACAGGAC 

TAATACATTTAGCTGTGGCTATTATCAAAGAGCACCAAGACCTTAGGTGGGGCAACCAGA 

CTGGCTATGGGGCTTGATGTAAGCCAGACCCATGACCCATAAACTATTTCAGGTAGGAAG 

TAAGCAATACTATAACTTGGACTCGAGGGGGAGCAAAAGGGCACCCCTCAATGGTGATGG 

CTGCCCACTTGGTCTCAACACTGTAAAACTTGGGACTTGCTGCCAAGCTCCACCAGGCAG 

GGGGCAGAGGACACTGAGTCTCCTCCAACTGAGTCAGGAGTGGCTCAGGAGCTGACACTG 

GCCTAAGCTTGCAGCAAAGGGCCCCTACGTACCTCGGC 

Sequence 2066 

AGGTACGCGGGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATC 

CTCAACATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGT 

AGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCACAAGTGAAGCAAAGA 

TACAGGAGATAGGGAAGGAAAGCTGGAAGCAGAGGTCACTGGAGGGAGAGGGAGATGGAC 

ACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCTCTTCAATTGTGGGACCC 

CTCAAAATGTGTACCTGCCCG 

Sequence 2067 

GTACCCGGGACGCTGGGAGGAGATGCTACCACCTAGGTTACTTGTAGGACCCTATACGGC 

MCCTNCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGTGAAGTNGAAGAGA 

CAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTC 
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AGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAANAAGAGAAAAACATTAA 
AAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTC 
ATANGGTAATTAATGC 
Sequence 2068 

AGGTACTGACTAGACAATTGAGTCTCAAAGACGATMTTTATCCAACATAATTGTGTGAA 

TAATGACAAAAAAAC ACCC AGTTTTGGGAAAATTACAAATGCAATTTCTAGCAGACACTG 

CTTGGTTATCTAGTGTTTTGAMTAMCMCTCCAAMCTTCGTGGCTTGAAATAAGCAT 

TATTTATCTTATTACAGTTGTTTGGGTTGATAATTTAGACAGAGCTTAGCTGGGATGGCT 

TGTCTTTGCTCCATGTTATGCTGACTTGACTCACTCATCTAGTTGCAGTTAACTGGCAGG 

GCAGCTGGAGCCTGGCTGGTCCCAGATGGCCACACACTTTGTGGCTGGTAGTTGCCTGGC 

ACTCATAAGGTGCCTTCATTTTCCTCCACCTAGCCTATCCAGCAGACTATCTTGGGCATC 

CTGGGCTTCAAGAAGAAAACCGCGATGTACCTGCCCG 

Sequence 2069 

AGGTACGCGGGGGTTTCCTGCGTTTGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTG 

ACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTNCTT 

TGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGTCAGAGATGGGAAGAAACAAGAA 

^^^^ACATGCTTGGTGTTCTTGCCCAAGCTTTGAAGAAGTTTACAAAGTCTATATGTCA 

GAATACACATTTCCCACCTTGCCCMCANTTNAAAMCATMGAAGAGAAAMCATTAM 

AAATGACANGGAAGTTAATGGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTN 

ACAAGGTAATTAATGCCCTTGT 

Sequence 2070 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGTGACTAAATTTGTAGGAATGATATT 

TTCATGGGATTACTCAATCTCACCCACCATTAGTTGCGGGTGACAAGAAAGCTAAGTTGG 

CAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTCCTTTGTACATGTGTTCCAGACATA 

CCAGTGGCTTGGTATTTAAACATCATGCTCAGGTGTGCAGGGTAGTTTTTGAGTTATAAT 

AGGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTATGTAAAAATTATTTTGCACTCACT 

TCTGAAATGAGTGTTAGTAGAATCATCTTTAGAGGAGGTTCCAAGGCATTGAACTGAGAT 

ACCTGCACTGTTTGCTGTAAATTTAAGCTTAAAATTGAAACCAGGTTATCANCATTTCAT 

GCCAAGGANAGAGTGGGCATGAATGATTTC 

Sequence 2071 

CCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAAAGTTTGAGTCTCACTTTCCTCT 

TATATATAAGGCTTCCTGAAAGCATTAACAGTGATGTATATGAACTGGCCAATACAAAGT 

AAGACCTCMTAAATGGTGGGCACCTTCTTTTATATGCCAGCTTGTATTCTTATTGGACA 

AGGTATGCMCCTCTTTGGGCCCCAATTTCTTTCTGAMGAATCAATTCTAGTGCTCAAA 

GATTCAGAACTTAGTGCTTACACTTTGGGAAGGAGGACTGATGTAAAGAATCAGGATTTT 

CTGGACAAGTTGATGAATCTCCTCCCTCTCTTTGGTGTCCCTGAATGAGTGATAGATGGA 

TAAAAGTCCAGTGTCCACATATGCACAGATGCTAATAAAATGAGGTGGAATGATGGTAAG 

GCCTTAAAGTAGAAAGTATGAAACCTCCACTCTTGTCTTGGCTTTCTCTCTTTCCCTTGC 

TTACTCACTCTATCTCCTTCTCTCTCTNCCTTTCTNCAAATGTCAAGATGTAATCTCTTG 

CCTTAAAATTTCN N NAAATCATCAACACCCTTGTN AAACACCTN CC AACCTTAGTTN 

Sequence 2072 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACTCAATCTTACTCCCTTGTAGAACA 

GGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCC 

ACGAAGCCAGAAAACTAATAAATGAACAGATGGCTCCACATAAGGTAGAGCGTGTCTGGT 

GCCCTGCCGTTTGGAAATGACCCAAAGGCAGGAGTGGAAGATGAAGTTTCAGCAGGAGAG 

ACTGAGGATCTCAGAGGATTTGCTCTGGGATGCAGAGACGCTTGCCTATTTGCTAAGTGT 

CTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCA 
AACTGTAAGTACCT 

Sequence 2073 
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CCGGGC AGGTC AGCAGCTTTG CATGTTGACAAATCC GCTCTCTGCTGCAGATGCCTAGGG 

GAAGTTGCAGACTTAAATTTTCTTTTGTAAAATGGGGGAACACAAACAGATCTTATGTCA 

CACTGGTTACTCCAGAAAAAAAAAAAGAAAAAGAAGGAAGGAAGAAAAGAAAAAGAAGGA 

AGGAAGAAAAGAAAAAGAAGGAAGGAAGAAAAAACAGAAGGAAGCNNAAAAAAAAAAAAA 

AAAAAAAAAAGGTTCCT 

Sequence 2074 

CGAGGTACTTCCGTGAGATATAAGGCTGATGATAAGGATTGGGGTAAAGAGTTGATTCGG 

TCACACTTGACTGCCTTGAAATCTGTCTTGGCATCGGGTCCTCTTCCTCCATCGATAGGT 

GGTCAGTCTCAGAGGGTTGGTCATCTCCCTCTTGCTTCAGAGACACAGCCTGGAATCCTT 

CAAMCTCCCAATTGTTGACATTTCTTCAGCCTGACTAGGCTGGTCCTCAAGGCTCTTCT 

TTCTCTTTCTGCACAAAGGGGGCTAGACCACAGATAGTGGCCAGGAGAGG I GTGACTGGC 

TCAGATAAAGCCAGGCAAGACAGTGCCCTGTGCTGTGACCCCACTCAGTCTGAATCTCAC 

TTCACCAAGTAGGCTGAGAGTTACACTGCCACCCTCAAATAGTCAAAAAAAAAAGTACCT" 

GCCCGGGCGGCCGCTCTAGAACTA 

Sequence 2075 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGG 

GGATTATAATGTATGTTGGAATGTCCTCGTTTGAGTAAATTTTACTTAAGGGTCTTTGGG 

CTGCATAAATTTGTATGTGCATATCCCTATCAACATTTGGGAAGTTCTCAGCCAGTATTT 

CTTTAAGTAAGATGTTTATCTTTTCTCTCTCTCTTCTCCTACTGGAAATTCCATAAATTA 

TATGTTCATATGCTTGATAGTGTGCCATAGGTCCCTCATACTTTGTTCACTTATTTCATT 

CTTTTTTTCTTTTGTTTCCCCAAGTGGCTAATTTTAAATAACATGTC 

ATTCCTTCTTCTGCTGAATGAACTTTAAAAGA I I I \ I AATAGAAG I I 1 1 I I GGTAATCAA 

AAAGAAAAACATATAGTAGAAATGCTTTTTAAAGTTCAGGCATGGTGGCTCATGCCTGTA 

ATCCCAGCACTTTGGCCAGCCTAAAGTAGGAAGATCATTTGAGGCCAGGAGTTTGATCCC 

AGTCTGAGGAACATGGCAAAACCCCATCTCTACTAAAAAAACGAAA 

Sequence 2076 

GCCCNGCAGGTNCCCTNGGGACACCCTTTTTTACCATGATCCATTTTTTGAATACAAAGA 

CACAGTACCCCGCTTTTGGGAAACAACTGAAAAATAGGGGAATAGATAAGAAAGTTTT^ 

TGCTTACTTGI I I I IGTGI I M I i I I ATATTAAACTAAAATTTCATTC ACTANTAAAGTT 

TTTTTTAAAGAATAAATTGCTTGCTGGATGTATTGAGGTAATAAAAGTTT^ 

TTATATGAAGCAATATAGGATATTAGCTTTATGAGGCACTTGCCAGTAAA 

Sequence 2077 

CCGCGGTGGCGGCCGAGGTACAAACCAGGGCTGTTACTCCTCAACAAGGAGGAATAACGC 

CTCTTAGAAGCATGGCAAGCAGGTGCCATAAAACATGTCTACTACAATTTCTCACTGCCT 

GGMTAATTGGAAAAGTTTCATAAATGCTCTGGCTTTCAGGTTCTTCACTTTTGCMTAA 

AGCTGCTAAGTCTGGTTGCAAAGCTTTTATGATGATTAGAAATAATATATGTAAAATGCC 

TGATGCACAAGAGTCACTCAATAAGTGACAGCAGTGTGATTAATCTATGACATANTAAAA 

AGTTCTAACTTGAAACACGAGAGAAAAAAATAACTTCCAGTGCGNAAAGAGATAATTTCA 

TGATCTCAGGGACACTAA 

Sequence 2078 

CGAGGTACTCAAATCTCTGCCTCAGGAAAACATCCAACTTTCAAAGTTTCTGGTAAGGGT 

TGTAAAGGAAGCAATTAGTGTCATTAGAAAGAACTATCGATGTCACTATGCCAAGGGAGA 

AAAGGGGTAGCAAAAATGAGAAAGAAGGCATGAGGCACAGACATTGAAAACTAACCTGTA 

TTTAAGCTTAGTGTATTTTCGCTTTGCCTCTACTCCAAGTGGGGAGTTTTCATTGCAGGC 

ACTACAAACCCCAGGTATGACAATGACATAAGAATCAGGGAAGTGGCTGGAAAGGGAAAG 

CTATATAGTTAGACTGGCCTGCCTTTGAGTCCTGCCCTGCCATTTGTACCTGCCCG 

Sequence 2079 

CCGCGGTGGCGGCCCGAGGTACTGCAGGAAACCATGTTGCCAGCACTCTCCTAGCCCAAA 
AAAATTACATCTAGGAAGCAGGGGCCATTAACCTACTGGAATCATTGACTATGTCATAGA 
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AGTCCAGTCACCCCGCAGGAACCAGAATCTTCCCATGGATATTTAAACCCAATTAGAAAT 

GGCC ACCAG GTGGTGCCAAACT GCAGC TAAGGTCCAGGTGAATTTGAACCATGCAATCTT 

GGATCTTTTGGCTCTCCTCGTGA7TTTTATAGAGTTTTTTTCA 

AATAAATGCTGTCAGTTTTGATAACTCTTTCTTATTGTCTATAGGTAAAACAGAAAACCT 
TTAGCATTGGATAAAAGTCTTCCTTATrrTTTACTGNGACCTCTTTCTTTC 

CTMTATTTAGTCGCATGGGAACACCAACTGTTAACTGMTTTCCATTTTTCCAT 
Sequence 2080 

GGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGA 

AGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAG 

GACCCTATACGGCNNCCTCCTTTGCCAGGAACTTTTATAAACATCCTGCAGGAAAATGCA 

GTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAG 

AGGTCAGAGATGGGMGAMCMGMCTTTGACATGCTTGGTGTTCTTGCCCAAGCTTTG 

AAGAAGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAA 

AAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGT 

GATGGTGTTTGGAG GTGGA GCCTTCAGAAGGTAATTNATGCCCTTGTAAGAAGAGGCCAN 

AGAGCTTGCGCACCTTTTTCCTTGCCATGTGANGAGCCANGAAGCCGGCTGGCTGCAACC 
TGCA 

Sequence 2081 

GGAG CTCC CC CGCG GTGGCGGCCGAGGTACACTTGCCAATTGAGATAATATGAAATTGTA 

TTCCATTTTAGTTTTAATTACATTTCCATGATTACTGCTGAGGTTTA 

TATTACGGAA TTTAGTTTAC TCCTACGTGCATTACATTrrCATAATmG 

CC TTGGG TTG \ I 1 i I ATTTTCTTACTGATTCCAAAGGTATCTGTAAATTATTGAAATGAA 

TTATT TTAGT ATTGTGCTTTTCAGCTTTTCTTTTCACTTTAATTGTGCAT^ 

AAAAGTTTTCATTTTAATGTAATCTTAGTTGNAA 

CTG NT ATTTTTTA 

Sequence 2082 

CCGGGCAGGTACAGGAGAGTTCCCATAGACTNCTGCCCCCTNCCACAGACAGACTACCCT 

AGAATATTGTAAAAGGATCTATCACTCATTTTCCTCACCTCTCTTATAGATGTCTCTCTT 

CAACAGTGAGATTNTTCCGTAACTCTCTGTNNCAGGACTGGGCTTGTTTGCACTCAAATC 

CATGACTCATTGTTTCTCTGCCTTTCCGTGTGTTACAGGTGGGCTGATCCCCCTGCAGCC 

AGTTTCCCATAAGCAACTGACTTCCAACTGGGAATGTCTCGGGGGATAATGGGGGTGGG 

Sequence 2083 

AGGTACTTGCAAACCAAATTG G C AGCATATTAAAAGAATTACAC ACC AGGG AAAATAACC 

CTAGACCTTGCATTCTAGTGCTCTGCTTATCTGTGACTTCAGACAAAGATGGAGAGAGCT 

GACCATCAGTATMTTTAGCTGMTGAAATAATGAGATATTTTAGGAAGAAGAACATTGA 

ATGTCAGAAGTTAAATGTGAGATCCAAGAAGTAACAGTGAACAGACAAGTTTGTGGGTAT 

GTGGATATGTCTTAAATAAGTCAGGTTGTGTCAAATTATGATGGATGATGATGACTGCGA 

GGCAGGAAGGAAGATGAAAAGGAGGAGGCATTGAACACTAATGGTCACAGGGAGCACCCG 

GGAGCGTGGTGACCCCAGGGTGAGCTGCTTQTTCTGTGAGGGTGGAGGGAA 

Sequence 2084 

CGAGGTACTCTGACATCATCTGACATGGCAGCTCCCCAGCCATGGTGGTGGTGATAGAAA 

TATAAATGGTGACAGCAGCAGCAGTGTGATAGTCACGGTGGAGGGAGATTGCTTTTCTGC 

AGCCAGGCTTCTGGATCTCCCAACCTGGGTGGGACCCAGCTGAAGATCTGCAGAGAAACC 

AGCCATTTCTGAGAGCAAGGCAATCAGTGGGCATTTACTGGAGGGTGTCTAGGTTCTTAA 

CATCTCACTGTTATTTGGGCTCATGGGATCATAAGAATGACAAAATAAAGAGAAGTCCTN 

ATTTTNTTACCACAANAAGCTTGACACAATTTCCAGTAAANTCTTGGATTGAAAGGAGAG 

ACATGTTTTCATGTGTTTTATATTCCCTTAACAACCCAATA 

Sequence 2085 

CCGCGGTGGCGGCCCGAGGTACTCCACTTGTAACTGCTTCTCCCCTGATGCCACCACTGA 
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CCTCTGTAGCCATGGTTCTGCAAGTTCTCTAATTTTCTGAGCCTCTGAGTCACTGACGAA 

GTCATGGTAGAGAGCAATGTAGGGCTCCAGGTGGATGACCTCCTTCCGGATGGGCTGGAG 

CAGCAGGTAGGCGTTGGAATTGGTCTCATAGGAACAGTAGAGGCTAGGGATCTGGTAGAG 

AGTGGGCTGGGAACCCAGGGTCTGACATAGCCCCTCGTAGGTGTCTCTGGTCTGCAGGTG 

GGGTATATTGGGCCTCTGGATGACGGCCTCAGCTACCACGTGGTTGGGGCTCTCTGCCAA 

GAGCCTTTCATATTTCAAGACATTCCTGGCCATCCTCTTATTATCTGGGCTGTAGAGAAG 

AAACTCCCGCCGCGTACCTGCCCG 

Sequence 2086 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTTACAGTTTGACTTTGAGTCAGC 

TCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGT 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCAC 

ACCGTCTGGTGGGACATTTCACCCAAGTTGTTCATTTATTAGTTTTCTGGCTTCGTGGAG 

MTTATCTGCCTTCTCTGAGTATTGATTTCATTGTGCGCATGGTGAAGATATCGCCTGTT 

CTACCCCCGCGTACCTGCCCGGGCGGnCGCTCTAGAACTA 

Sequence 2087 

GCTTTTAGAGGGATGGGGCACGGCATCTCAATACCATTTCCTATTTTTGAGATGGTTGCT 

TCATTCACAAAGAAGCTACATTCCACAAGCAGGTGTCTCTTCCTACTTTATCAACTCATC 

CATTCTTTTCCCACAAATACTTACTGAACACTCTCTATGCCTGTCGTATTGCAGACATGC 

GGAATATATCGACAAAACTGACTGAACTTNCTGCCCTTGGGGTGCTTCCTTCTTTATCTG 

GCTCAAMCTCTGTTCGCTGCAGGTGCTAGCTGCAGGTTGTTGCTCATTAAAAAGGAAAA 

TCTATTCTACAATTAGAAAMTACCTTGGGAGAACATTGCAAAACAGTATTTGAATNAGT 

NAAAGCTGGTGTCACCCCATAGGAGAAGNTTGTACCTGCCCGGGCGGACGCNCTAGACTA 

NTGGANCCCCCGGGCTNNAGNGAATTCCATATCAAANTAANNGAACNCGTCTACCTANGN 

GGGGGGGCCC 

Sequence 2088 

CCGGGCAGGTACGCGGGGTCTTACCTCAGAGCTGAGCTGGGCATGAGTAGATGCTCAGTA 

AGTGGTGCACAGGGTTGGTCCCTATGGTGGAGGCCCCCTAACACCGCCCAACCCCCCTCC 

ATGTTCTCACAGCTCCACGCACTGAGCACGGGCATGAAGGCCATGATGTCAGAATTCTGC 

ACCCAGGGAGCTGAGATGTGCCGCAGGGCCTGTGGCGGACATGGCTACTCAAAGCTGAGT 

GGCCTGCCATCACTGGTCACCAAATTGTCGGCCTTCTGTACCT 

Sequence 2089 

GATCAGCTTGGATATACACTTAGGTCATAATACTAAATATGAATATTCG MINI NCCAT 

AACCATAGGGTGTAATCATATGCCTATGANTCTGATCCTCTACTTTATCCATCTGAATGC 

TTCTCGATCCACTCATGCAAGATGTGTAAAGCTCATCATGGAAATAATCTATACTAAGCT 

TTCAATATTGACATATTTATATAAAAATTTATNGTMTANGATTGAATAAACANTAAGTA 

ACTTTTTTTATATTGATAAGNCACCACAATNTTTATCTTAAAAAGCAATCTT ATTAIN 

AGTATAAATTCAGTATTTTTTCAGATCCTACAATTCGCCTGGTTNGGTGTATCATTAGNC 

TATAATAAATNNNNGTGGAATGGCAATTTTACCAAAACAACACACTTGCTTAAACATAAA 

GCCAATGTTCTCAGTTTCCAATATATCTCATTTCAATAGCTAACATCAACAAAATATATA 

TAT 

Sequence 2090 

AGGTACAATATAGGCAGACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAG 
GTCAACATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 
AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 
TCAGGTCTAAGCACACAAGTTCTTCCTTCCATCTACCCCCGCGTACCTGCCCG 
Sequence 2091 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGAGGAAGGATTTAGGACA 
GCATCCTAATCAAAAGGGCCTGGAAGCACTTTATAAAAGAGAGAGACAAGATCGCATGTC 
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AAACTAGAAGGAAGGAGGTGAGAGGAGATAGGGCTCCAGAGTGGAGCAAGCCCCTTCTGT 

CCCCGTGAACTTCCTGCCGGTGCATGGGTTACCTCTCATTAAATTTAATAGTACCTGCCC 
G 

Sequence 2092 

AGGTACTGTATATAAATACTGAAACAAAACAATGGAAAAAGCAAATAGGAAATAGAGTGA 

GMTCAGGGGAACTGAGGGGMGCACCAAGTTTATCTTCTTTAGATAGTTCTTATTTTGG 

AGACAGGATATTTATTTCTTGCGAGCTCTAAGGACTGTGCATTTAGCAAGGACCGTTCAT 

TTTTCATCTTCCTCACTAATGTCTTAAGGGAGGGAAAGATAAATCTGAGTGTCATCAGGA 

TACATATGGAGCTGAGGGCCACAGTTGCTMTTMGTCCTCTTCAAAGACCCATTTGGAT 

GCCTCAGCATACGTGATAAAAMGATTACCCCCAATGATGTCATGTTTTTCCAGGCATCA 

AMGMCCCCAMTTACCTTGTGTTAACAAGAGTGGTTGGAGAAAGCTAAGATACCACTA 

CAATCTCCCCTTACCCTCTGCATAGTGCTGGCTMGTGGTATGGTCCTGATTTTCTGATA 

ATTACTACTTTTAAGGAGA TGGTA TTTTGTTGCAGTGTCACTGAGGAAACGGGATTCCAA 

TCATMCAGCCCCGAACAATTTTMGTGCTGCAAACCCGACCCAAATANAATCACTTGAA 

CCTTGNCNGAAATTGGMTNTACTTTCTTGAACCTCMGGGGAGAGAAAAAGTAATTTGG 

GAACAAACAGTTCCCTTGCCCGGC 

Sequence 2093 

TTTCAAAACCCGCTTATTTCACGCGAGGACTCCTCTGCGAAGACAGGAAGGATGGGGTCC 

CACACCTGAGCCAGGCGCGGTCCCCACTGGGTGCTCCGCAGCCCGGAGTGGCTCCGGACG 

ACAGCCCGCTCAGCGGACCTTTCGCCAGGCTCGGTGGACCTTCTCGGCTGTTCTCCTCAG 

GGCGCAGTCAGGTTTCTTCGACACACACCAACCGAAGTTGAAGCGCGTGTCCCGCGTACC 
TCGGCCG NTCTAG AACTAN 
Sequence 2094 

GGAGCTCCCCG CGGTG GCGGCCGAGGTACATGAACTAATAATGCATTTCTAAAGGTGAAA 

AANGAATAGGTA I i J 1 I CTGTTTAATATTCAATTTTATAGAGAGTAGTACGTTMTTTTT 

TTAAACCCCAGAAGCTCAGGATCTTATCATTTTAAAAGAAATTATCACCAGTTCTGTGTG 

AGTAAA TAAAGTA TTATAACACTTTGTGTTTTTCATCCATGATACCTTGTATTTACTTAC 

CTGAGC \ Mill ICTAGGGAAAGAAAAATGCTCAGGTAATAACAGAGCCTTGAAAAATTT 

GGATTTTCAAAACTACCTATTTATGTATAGGCCTTTAGATCATCTGATGTTGAATACTCT 

TTMGTGATCTAAAGGCCTACATATAAAAAGGTATTTTTATTAAATTCTGGAATTAAACA 
TT 

Sequence 2095 

AGGAGCTCCACCGCGGNGGCGGCCCGAGGTACTTN iTTTTTTTTTTTTTTTTTTTI i GTG 

CAATACCAGCTTTCACCCTGTCCTCTGCACTGCTTAGCCTTGGCTCTGCTGAGTTGTGCA 

ACA CATT CTGAGAGATGCGATACAATGCTCCAGGCAAACTTATAGAAAAACAGGAATGAG 

TGATTTTATACGGGATGTGTTTCAGCTGTCCAATTCAAAATAAATCACGTCCAGGTGCAT 

TCTTTCCTMmGTGACCCAAACTGCTTGCACTTTATACATTAATGAATTATT^^ 

AAAGAAATTAGCTCTATCATGMGACAGTGCTATCTAACCCTTTGTGGTCTTTCTCCCTT 
AAAAA 

Sequence 2096 

CGACTACTATAGGGGCGAAATTGGAGCTCACCGCGGTGGCGGCCCGTCACGGTGATGTTG 

TTGTCGCGCCGGGGCACGCTCAGCGAGCGCACGCCGGCGTAGCCGGCATTGACATTCACG 

CTGGTGCGCGCACTGGCTTGCCAGTTCCAGCCCACGTTGGCGCCGATCTGGCGCGTGCGG 

TCTTCGCTGCCGAACTCCATGCCGGGCAGCAGGGCGCTGTCGATGCCGCTGGCCGTTTGC 

GCCGTACGGTCGA 

Sequence 2097 

CCGCGGTGGCGGCCCGAGGTACGCGGGTCCCTGAGTTCAGAACATAGGAATTAGATTGAT 

AGACATCAACATACCCGCTTTATTGCTGACTCATGACAACTAATGGGAAGACATGGCTCA 

GATGTGCAGCCACAGTGAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGT 
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TGCAGTTGAAAGAAAGAATTGCTTGACATGGCCACAGGAGCAGGCAGCTTCCTGCAGACA 

TGACAGTCAACGCAAACTCATGTCACTGTGGGCAGACACATGTTTGCAAAGAGACTCAGA 

GCCAAACAAGCACACTCAATGTGCTTTGCCCAAATTTACCCATTAGGTAAATCTTCCTCC 

TCCAAGAAGAAAGTGGAGAGAGCATGAGTCCTCACATGGAAACTTGAAGTCAGGGAAATG 

AAGGCT 

Sequence 2098 

CCGCGGTGGCGGCCGAGGTACATCTCTAGCTGATGATTCAAAAAAGAAACCTTTTAATCT 

CACTCCACTGATCAGCTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATT . 

CTCAAAGAGAGTATCTTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAAT 

TAGTCTCGGGCTTCTGTTTCTGCCCACANTTTTCAAATTAAGMCAATGTGTTTGCACTT 

AATGAAACAA 

Sequence 2099 

CCGGGCAGGTACATGTCAAAGGAAAAACACGTGAAAGATGAATTCAGCCAAACCCACCAG 

TGTTCAACCTCAGTCTAATCAATCTCATACTCCTAGAGGCTTAAGTATCAGCAGGTAAGA 

TCGTGATGACCTGTCTCTGAGGCTCCAGACAATAATTTCTAACTGCCAACTGGAAATCCT 

TATATGGTTAGGCTGCCAACATCCCAGGGAACAGGACCAAAATAAAAAGCATCACTCATT 

ATCCTACTGCAATTTTCCTCTTCCCTTTGTCAAATGGGAATGATCTTTACGATCATGATC 

CTTTATTGCAACCAGGACAGAAATCATGAAGTCATCTATGGCCCCTTCCTCTCACTCCGC 

CTCCAATTAGTTGTCCTATCTGCCCCTTCCGTTCTCCTTCCATCTCCATAATGCCCAAGC 

TAGTCTATCACCTCTCCATTCATCTTTCACTCAACCTCCAGAATTATCTTCCTAAAACCC 

AGACTGATTCTGATTTTTCCTCCTCAAGAAACGCTCTCTAACACTAMGCACTTTGCTTC 

TACCTCTATCGAGGCCAANGGTATTCATCCTTGNAATCTCTCTAAATGAGTTCCCAGCTG 

N CC ACCCCACG AG ACTAGGAGTTTGGTAAGG AC AGAATCACTCATTTC ATCCTTG N ACCT 

Sequence 2100 

CACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGGCAGGTACTGTGTGGTTTAAAGA 

TAAGCAGCCTGTGCTGTCTTTTATTTTCAGAAATAACATAAATAAACAACMTTACTGAA 

CGCAATGTGCMTCTCTATCTTCTGTGTTTCGGGGTCAATTGTCTAACAGGAGCCACAGG 

AAGAGCTTCAGAGGCACCCAGATGCCAAGCCTCAATTCCCACTGTGACTCTACTCTGGCA 

GAGGTGAGACTTCACCTTCCAGAAACTATGCTGGGGGCCTCTGTGTCTGTGGCATATTGT 

TTATATTGTGTGTTTATAGGCAATTTTAACAATCACTGTTTCTAGATTGCMCCCTCCTG 

AGGCACTGCGCTGACACAGCAAAGTTATTCCCCTTACTGAAGAGAAAAATTAGAGGCCTA 

AACACCACAGAAGCCCTGATAATATGGAATATTAGCATCAATGTCAGAGCACAGTATTTA 

GGACAGATACTGATAAAGACAGCATAGATGACCACCTCACCTTGTCACCACCTGAGCGAG 

GAGCCCTTAAAAGATGGGTTTCC 

Sequence 2101 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAATAGT 

CAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATGGT 

GAGCAAGGAAAGCAAGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTATATGA 

TCACACTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCA 

CAACAAAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTG 

GGGACCCAAATGCTTATGTGGACTCACCAGGTATCGAATTAGACATGAATAGGAGTGTTT 

GTTGTGATCCAAGAAACTATATAATCAAATGAATACAATGAAACTTTAAAA 

Sequence 2102 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTACTTACTATTTTATGAGAATTGAI I H I AC 

TAAGCCTGATTTGAATGGGTTGCATTTACACCAACTGCTATTTATTAATGAAAGTGGTAG 

CCTGCTTTTTAAGTAATATGTGTTTTGCAACAGGAGAGGGTTTTCTGAGCTTT^ 

AGTAACCCTGAACTTCTTCCAAAATCCACACCTTTTATTCCCCCTTTTCCTTGTCTCCTT 

CTAGGTGCCTCAGTCCTGGCATCTCGGGAGGACTCTGCCATTACTCATCTGTTACTGATT 
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C TG AATCTTGTGCTC TAGC TCTAC AG CC C AG TG AAG AAG C AATATATCTTTAATTC AGGG 

CTTGCCTCTTGCTTTCCAACTCTAACCTGAGAGCAGTGCTACTTTTGTAATTGCAGGTCT 

AAATAACTTAAGCGTATATACAGAAGCCAATCACTGATCACTTCCATTGCAATGAGGCAT 

CTGTTGTCAMGMGGAAACCATTTGTCAAGTCCCTTATAGATATTGAATCTTCAAACCC 

AAATCTGGGGATGTTTACATTTCT 

Sequence 2103 

GATGGCGGCAAGCGATGCGCCCTCCTCGCACACTGNNCCGGCCAGCATGNNCGTGCGCAG 

TTCGGCCAGCACGGCGCGCGCGCAGGCCAGGGTTTGCACGCGGCCGTANTGTGCAATCAG 

CTCAAGGCAAGCCGCGTCGCCCAGGATGCGGTCGACCGACGGCAGGCGCACGGTTTGCCC 

CGTCGTCATGCTGGCTGGCCCTCGCCGCTGTCGTTCTTCCCCAGCCACAGCAGCGGATTG ' 

CCGCTGGCGCGCTGGTAGCCCGCTTCGCCACCAACAGGTCGAGCATAAGCTGGCCAGATC 

GTCGGCAGCGGTTCGACGTTGTAATCGNGTTCCTGGTTGAAGACTTTGCGATAGGTATGG 

CMTCGTCGCANGTTTCCGCGCGCGCCACCTTCTTCGGATCAtTCGCCTTGTTGGGCAAT 

TTGTCGTCCTTCGCCGCACGGGGTCGAACGGGTTTGCGTNGGCGCTCTAAAACTAGTGGA 
TCCCCGG 

Sequence 2104 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GGTA AGCCCTTGGTTAAATTTATTGACTTCATTTATGTATCTCACAACACTTCTTTTCTG 

TTTTAGTTCTAGGAATAGAAACTTACTTTTGGAAATACAGTACCTGCCCG 

Sequence 2105 

CACCGCGGTGGCGGCCGAGGTACAGGGCTCTTGATATATCCTGGCCTTAAGTCTCAGGTT 

CTCCAACAAAGCTGGCAGCCTGTGGTATGCCTTCTGGGGCCAGAGAGTGTGGCATGATCC 

GAGGTCCTGAGTCTCTCTCTCAATGAGCCCATCATATTGGCTTAACCAAAGCCAGTGGTC 

TTCAGGACACGGAAAAGGGAGTCCCTTAGGCTCCAGGTTTTTTTCCCTTCAGTTCACACT 

GGGAGAAGGGTCACAGAGCTCCAGTGAAATCTGGTCCTGTTAGATCATTTGTGGTTGGCT 

GCAGATGCACATGTGGGCATGTGATAGTTTTGATTTAGATAAAAAGCCCACAACTGAGTG 

GAGTGTTGAAACCTCAGGGTTATAGACTGGTGTATCTGTGAGAAAACCACAACAAGTATG 

GTATTTGCCCGCGTACCTGCCCG 

Sequence 2106 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGCC 

ACACTG ACTAT CACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTT 

TCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAG 

TTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACAC 

GGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTACA 

GAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGTAGTCTGTCTGTGGGAGGGGGCAAGGAG 

TCTATGGGAACTCTCTGTACCT 

Sequence 2107 

CCGCGGTGGCGGCCGCCGGGCAGGTACCATGGCCAAAGCTGAGGCAGCCAGGGTGGGACA 

GGGGCCTCCAAGCAACACAGCAGTGTTAGCAGTTCACCTGGACCTGTTGATGAGCTGTCT 

TAGATGCACGAAGGCAAGAGATGTCTCATCTCTCTCTAAGATCTTAGATCTTAAGTGCAG 

AGGGAAGGCAGCTGGGAGTGTGGACGGCAGCCACAAGGGAGAAAAGTGCAAGAGCATTTA 

CAACCAGAAGAGAAGCCGAGGGGACATGATGGCCCATGGACAATGTAGCCATGGACAGTG 

CCCTGGGAGGCTGGTAAGGGGGGTGCTGGGACTGGTGGTGAAGAGATCATTTTTAAGAAA 

AAGAATCCCGCGTACCT 

Sequence 2108 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGCC 
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ACACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTT 

TCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAG 

TTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACAC 

GGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTACA 

GAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGTAAGTCTGTCTGTGGGAGGGGGCANGAA 

GTCTATTGGGAACTCTNTTGTCCTNGGCCGCTTTTAAACTAATGGATCCCCCNGGCTTGC 

AG 

Sequence 2109 

T I I I I I I I I II I I I I I I I I i I GGGGAACAAGGTGAGGTTCTeTGAGGTAACATTCCCTAA 
GACAGGMCCCCAGGACTTTCCMCTCTMGGATTTCCCACATTCAGGATCCAGAAGTTT 
TCAAAATTACCTCTTAAGTTTTCCTACTAAGTTTATGGCCCCAGAGGCTTCTACTCCAGG 
TAAGCAGTTCTCTGCAACTCTGAATTTGCTTGTA^ 

CCCTGTAATATCAGTTCTCTGACGGGTCCAAGACAAGCCATCAATTTTMGGTTCTTTAA 
CTATTTCTTATTGNAAGGGATGGGAATTGGAMTGCTAAAGCTCTTTAAAGATTTGGGC 
Sequence 21 10 

AGGTACACACAGACCCCTGAAAAGCACCATGGCAGTAAGACCTTCCCCACATGAGTTGGT 

MCCTGTAAATTATCTTAGCCTTTCCTTATTCAACTTATCAAACCCTCCTCAGAACTCCA 

CACTATTACTTCCCCTTTGGTGACGTGAAGGGTTGGAACCCCAGCATCTCCCATTCCCGA 

TGAGTCATAACTCACACACTGCACAAAACGCAGTCCACATAACTGA 

Sequence 2111 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGGTTTGTAGATG 

GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGGTAAGCTGAGACCAATGATGCTGACCTCCCTCAAAAGCTGCATTTCTGAATTTCT 

GAAGGCAAACTGTCTGCCTATATTGTACCTTGCCC N NATCAAATTTT 

Sequence 2112 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTGAAGGGCTCTCTGTAT 

ACTGCTG AAG ATTCTATAGTTTCTCCTGAAATCCATG AGGG AG CC CCCTGAAGGATTTTG 

GCCAGGAAAGTGTCATGAATCAAATTTCTATTTTAAGAATAGTATCGAGGCAGCAATGTC 

AMGTTTGTTTAGAAAATAGGATGGTTCCTGTAATCCCAGAACTTTTGGAGACCAAGGCG 

GGAGGATTGCTTGAGTCCAGGAATTTGAGTCCAGCCTGGGCAACATGGGAGAGGACCTCA 

TC TNTNCTAAAATTTN ANTNNTTNTAATAAAAGGTTCCCT 

Sequence 2113 

CCACTCCACTGAAATTTGCATTGGCTTATTGAAAGTCAATCACAAGTCAAAACCACAGAC 
TCACATAGGAGAACACTGTTGTTTGGGCCCAGCAGCTCTGAAATCAATACCTTTAATGGT 
AGAACCTAAAACTAAAATAAATAGGAAGGAAGGAAGGAGAAAGAAAGAAAGAGAANGAAG 
AAAGAAAGAGAAAGGGAGAGAGTGAANNANAAAAAATGGAAAAATAAGTACCT 
Sequence 2114 

GGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAATAAGGTGTGGACTCACCAAAGACA 

GTTCCAAAGTCAATTTCACTCTGACACACTCTCTGTGATCTTCCACAGTCAGCACAATGC 

CTGCCCCCTGCTAGGCCTGATGGATGATCTGATAGAAAACTGGCTTCAGCAAGGAGCTTC 

CACAGGTAACCTTGGTTCTGCAGTGATAACAGCTCCTATGAAGGAGACCCCATCAGACTA 

ACAGTGGATTTCTCAGCAGAMTCATAAAGGCCAGAAAACAGTGGAATGGCATTTTCAAA 

ATGCTCTTTCCATTGCTTTTCCATACCACTGCTTT 

Sequence 2115 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCCGCCCGGGCAGGTACCAAAAGAAT 

GTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTGCATCAAT 

GGTAGAATGGATGACTAAATTGTAGTGCATTCTTACAATAAAATACTATAGAGCAATGAA 



Page 302 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



AATAAATAAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGACA 

AATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACAAT 

AGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGCCAGAGTATTGCAGA 

GGGGGCTCCGGGTGTTTGTAATTTCTATTTCCCAGTGCANGTGTGGTTTGCATGAACATG 

TTCCTTTGN GGAAATTC ATTGN N GCTGTACCTAAATATA 

Sequence 2116 

CCGGGCAGGTACGCGGGGGTAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAA 

TACTCAGAGAAGGCAGATMTTCTCCACGMGCCAGAAAACTAATAAATGAACAACTTGG 

GTGAAATGTCCCACCAGACGGTGTGATATTTAGTAGTCCAGAAAGCTGCCAAGGGGTTGA 

ATGACACTATCTGAAGATATGAACCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGG 

AAACAGATGCCTGGMGGCATTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTT 

TGCGTTTGATTTAAAAT 

Sequence 2117 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAATTTGGGCATAGTCAGAAACTCTGCTGACT 

CTCTCTTCCCATTTAACCTAATCTTACCATAGCACTCCTCACCCTATGACTTGCCCATTG 

CAGTAAACCAGAMGCAAAAAAAGGTCAAGCACGTGTGTTTGAGATCCAACTCTAGTGTT 

TGTAGGAGCTCTGCAGAACCACAGTTTGCACATCTGTAAATTGAGCAATAACAACTGCCC 

TGCCTGTTTCACAGGGTTGCTTAMGAGTTATATGAGCTATTGCATGTGAAAGTGTTCCA 

TAAGCTGC CATT TGCTCTGAAGCTCACGGGTATAATTATTCTAGTCTAACCTACTTATGT 

TTGCCCTCTTTTCTGCAGTTGCATCTTTGTGCATAAAGATTCTCTCTCCAACCTGTGCAA 

TTGATTCATCTTTAAAGACCMCTCAAATTCGACCTTCTAAMGAAAATCTTCTCTGACC 

TTCMCTCTAAACTAGAGTAATTTACTAACTGGTATTTATCACTTTCAAATTCTTCATT 

GGTCTTGAATCATAGAATATTGGTTACGCATTG 

Sequence 21 18 

C GACTC CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGCG 
GTTrTTAMTCTGTCACTGTTCTTCCCC^^ 

TTCACATTGCAATTGCTGCAGGTTAATAATATTGTTTTATCAGGAMCATTTATCTTGAG 

GAAGAAAGTTATCCTAATACAGCAGATAACTTATCCAAGGACAGATAGAAAGGGGACTGT 

GTCTAGAGGGTAATGCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACAC 

TATTG GAGGG GAAGAATMGTCATCGTTTCTATGAAATATTATTTTCCTGATTGGAACCA 

TTTCCATTTTCGCAGCTTAAAACATTGCATTAAAGATGAGATGACTTTAAAACCCGGGGA 

CAGCTATCTCAAGCCCTGTGGATTATNAATGGGGTATAGGCTTGTAATCCTATGGGTTTA 

GACTAAAGTATTTCTTCATAAATTGAAAAGAAAACNCACCTTGGMTTCAGAAGATTTTA 

ATTAAAGGTGGTCAAAAMTTTAATGGGGGAATTTNAANGGCTGACTCCCTAAGTAGTTT 

TGGTTAATGATTTAATTTATAGAAGAAAGGTCTTNGGCNAATTTGGTATGCCCAAAAGAG 

CCTACTATGNCCCAANNNTTTTTCTNGGAGCNTTATTATTAAGNGGCCCGCTTTTCCNGG 

TGGCNTCCCTNAGTCATTTCNGCTTCATCCAAATTTGTGGAATTAANCCTACCCCNTTTA 

AATGGGGACCATTGGCTTTTTAAGGGGNNAAGACCCCAAAGGGTANATANCCTGGCNTTA 

TGATAANTATTTTTTTT 

Sequence 21 19 

GGGCGAATTGGAGCTCCCCNTGGTGGCGGCCGCCCGGGCAGGTACCAAAAGAATGTTCAT 

AGCAGCATTATTTGTCATAGTCAAGAGCTGNAAGCAACCCAGTTGTGCATCAATGGTAGA 

ATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATCTATAGAGCAATGAAAATAAAT 

AAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGACAAATGTAG 

CCAAACCAAAA AATA CACATGTTCTATGAATCCATATATGCAAGGTTCACAATAGGTGAT 

AGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTGCAGAGGGGGCT 

CCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGTTTGCATGAGCATGTTCACTT 

TGTGGAAATTCATTGAGCTGTAACTAAATATACAGAATTATTTATGAATGAAATGATATA 

TAATTCAATGGGAACCTGGGAGGATAGGGAGGAGGTGGGGGGANCTNAAAAAACATGAGT 
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GGCCCGAATGCTTAACAATTGGTATTGTTAGGTTATGGGATCCCGGGGGGTGGTTGTCCT 

0 

Sequence 2120 

AGGGCGAATTGGAGCTCCACCCGCNGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAG 

ACTTGAAAGGTCACGTAGCTGATACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGA 

CAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTC 

TCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTC 

TGA GCGC TGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACA 

AGATTTTACTCCCATAACCCAGGCAGAAGTACCT 

Sequence 2121 

CGAGGTACTTCATACTACTTTAAGTTACCTCCTATTGGGGGCATTATAAATGGGTAAGCA 

GAGAACATCATGGAAAGACATGGAGCTTAGTATATGCAAATGTTGAGTTACTCAGTGTAA 

TGTGTGAAAAAGGAGTTTCATAAGTTTCGGTCAGGGAAGAAGGCAGGGTCAAAATTTCTG 

CTTGAGAGTTTGGGAGCTTGGGGAGACTTTAAACAGGGAGCAACACAGCGCCTCTGTACC 

TGCCCG 

Sequence 2122 

CGCCCGGGCAGGTACTAAMCTATTCCAGGAAGACTTTAACTTTTGTTGATTCAAAAGGA 

ACATGGTGCATCATTTAGCACCGTAGGGTCATCGTTATGATAATCCATTT7TATTACAGT 

CCTGTGGTTCTCATCTACCCTAATATCCAGCAAGTGGTTCATTGCAGATTCTGATTTATT 

CCATCCTACCTCAACCCTATTACCAACTATCACCACCCACCACCCGCCACCCAAACTGGA 

AGACAACCANAGATAAATCCTCAAGAAA 

Sequence 2123 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGTCCATGTGATGAAT 

TAGAACATGCTTTTCCACCTATAAACTACAACTTMTTTGCATAACTTAATATAAGTAGT 

ATTMTGTAAGTGTTATACATATTTCAGGTTATMTTTTGATTACAATTTG 

ATACAATTGGTTTAGTTATAAAATTACACTACTTACTTGCTACTTTATAACTGCTCACTA 

CCTCCTTAGTGGAACCCAGATTTTCTCTATGATTTGAGAAAAATTACTCCACATGTTGCC 

TAGTTATCAAACTATATTGGGAAGCAGAAAAAGCTATGAAAGGCTTCAAAGCTTCTGAGA 

ACCTTAGCACGAAAAACTCTTCCAAAAAGCTTTTTTC^ 

ATCACAT7TATAATCACTCACCAGCAATCTCTTTTCCATGCTTG ATTTG TGAGG CTTATT 

GAAAACAATGGCATTGACCGTTCATTCTGACTCCTTGATTTCC 

Sequence 2124 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAACTATTC 

CAGGAAGACTTTAACTTTTGTTGATTCAAMGGMCATGGTGCATCATTTAGCACCGTAG 

GGTCATCGTTATGATAATCCATTTTTATTACAGTCCTGTGGTTCTCATCTACCCTAATAT 

CCAGCMGTGGTTCATTGCAGATTCTGATTTATTCCATCCTACCTCAACCCTATTACCAA 

CTATCACCACCCACCACCCGCCACCCAAACTGGAAGACAACCAGAGATAAATCCTCAAGA 

AACTTC l l l l l AAGAGGGTGGTTAACAAAGTTAAATTGGGTAGAAAACAGATGAAGCAAG 

TTTTAATCTCTTTGCTGAGTAAGAATAGTGCCTACATGGTGTGGGTGATCATGTGTAAGA 

CTCTGATGGGCAAACCTGGGGCCCATTAGGAG 

Sequence 2125 

AGGGCGAATTGGAGTTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAATGAGAGC 

CTACAGTTGAAAGCACACTTGGATGAGGCTCGGACCCTGCTTCATGGCACCAGAGGAACC 

CACCAGCACCAGGTTGAGCTTATTGAGCGAGATGAGGTTAGTCTTCATAAGAAGCTGAGG 

ACTGAAGTAATTCAGCTAGAAGATACATTGGCCCAGGTCCGCAAGGAGTATGAAATGCTG 

AGGATAGAATTTGAGCAGACCCTTGCTGCCAATGAACAAGCAGGCCCTATAAACAGGGAG 

ATGCGCCACCTCATCAGTAGCCTCCAGAATCACAATCACCAGCTGAAA 

Sequence 2126 

CCGGGCAGGTACGCGGGAATGAGAGCCTACAGTTGAAAGCACACTTGGATGAGGCTCGGA 
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C CCTGCTTCATGGCACCAGAGGAACCCACCAGCACCAGGTTGAGCTTATTG AG CGAG ATG 

AGGTTAGTCTTCATAAGAAGCTGAGGACTGAAGTAATTCAGCTAGAAGATACATTGGCCC 

AGGTNCGCAAGGAGTATGAAATGCTGAGGATAGAATTTGAGGAGACCCTTGCTGCCAATG 

AACAAGCAGGCCCTATAAACAGGGAGATGCGCCACCTCATCAGTAGCCTNCAGAATCACA 

ATCACCANCTGAAAGGGGAGGTCCTGAGATATAAGCGGAAATTGAGAGAAGCCCANTCTG 

ACCTGAACAAGNACACGCCTGCGTTANT 

Sequence 2127 

GCCCCGCGTGACGAGCGGACTTGCGTCCGGCGTATCGGCGATTTCGCGGTTCTCGTTCAG 
CAGGGAAMCAGGCGTTCCATGTCGGCCAGGCTTTGCTTGATTTCGCGGTAAATGACGCC 
CAGGAAATTGAGAGGGATGTACAGCTGGATCATGAAGGCGTTCACCAGCACCAGGTCGCC 
CAGGGTCATGCTGCCGTTGATCACGCCCACGGTGGCGCGCCACAGGATCAA 
Sequence 2128 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTC7TGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCQCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTC 

Sequence 2129 

CCGGGCAGGTACGAGATGCCACGGCACGAAGTCTACGTTCTCCTGATCCGAAACATCTTT 

TTGAAAATATCAATCATTGGCATTCTTTGTTACTATTGGCTCAACACCGTGGCCCTGTCT 

GGTGAAGAGTGTTGGGAAACCCTCATTGGCCAGGACATCTACCGGCTCCTTCTGATGGAT 

TTTGTGTTCTCTTTAGTCMTTCCTTCCTGGGGGAGTTTTTGAGGAGAATCATTGGGATG 

CAACTGATCACAAGTCTTGGCCTTCAGGAGTTTGACATTGCCAGGAACGTTCTAGAACTG 

ATTTATGCACAAACTCTGGTGTNGGATTGGCATCTTCTTCTGCC 

Sequence 2130 

AGGTACAAGGACTGCATCGAGTCCACTGGAGACTATTTTCTTCTCTGTGACGCCGAGGGG 

CCATGGGGCATCATTCTGGAGTCCCTGGCCATACTTGGCATCGTGGTCACAATTCTGCTA 

CTCTTAGCATTTCTCTTCCTCATGCGAAAGATCCAAGACTGCAGCCAGTGGAATGTCCTC 

CCCACCCAGCTCCTCTTCCTCCTGAGTGTCCTGGGGCTCTTCGGACTCGCTTTTGCCTTC 

ATCATCGAGCTCAATCAACAAACTGCCCCCGTACCTGCCCG 

Sequence 2131 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGT 

GCCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAG 

AGGGGTGGTATCGGTTATTNAGGAC I I (TTTTTn I CTTAAATATCCTGTGCTTCTTTCA 

ATCATTTGMGGTAAAAACCAGCNTCCCTGTGAGTGGTAANCTGATTTTTAGGTTNTTAT 

GAAGGTATTATTTTCTGNGTAGATAGTNGTTAACTTGGTGTCCTTGCANGGTAAGACGAT 

CAGCGAAGCTTTCTGTTCCACCATCTTGCTCCTCTCATTTTANACCTAATA 

Sequence 2132 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAAGAACAAACGGAAGCCTAAAGATGGACC 

ACAAACCTTTGGATATCCAGGATAACTGAGTTAAAAACCTACCACACGGTTAATAAAATA 

ATGATGTCCACTGAGTCCATGACCACAAAACAAATCACCAGCAGCAGGATGGCATGGGCG 

GCCCTTTTCGCTGGGGAGGGACTGGAGCAGAGGCTGGGGCTGTGAAGGTGATGGGATCAC 

CTCTGATGGTTAAACAAGAGTACCT 

Sequence 2133 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCGGTTGCAAGCACAAGC 
AAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCACAGGTCCTATCCAAAGAGCACTCATC 
CACATCTTGGCMGACTTCTCATCTGTTAATAATTTAAATCCTTTCTTGCAGCCCGCAGT 
CAAAACTGCCCACCGATGTTTTTGCAGAAATGATCACAACCTCCATTGCGGGTCCCCGCG 
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TACCTGCCCG 
Sequence 2134 

GGTACGCGGGGACCCGCAATGGAGGTTGTGATCATTTCTGCAAAAACATCGTGGGCAGTT 

TTGACTGCGGCTGCAAGAAAGGATTTAAATTATTAACAGATGAGAAGTCTTGCCAAGATG 

TGGATGAGTGCTCTTTGGATAGGACCTGTGACCACAGCTGCATCAACCACCCTGGCACAT 

TTGCTTGTGCTTGCAACCGAGGGTACCT 

Sequence 2135 

CCGCGGTGGCGGCCGAGGTACGCGGGATGGAAAAATCACACTTCACAAGAGATCCATCCC 

AGCTTAAAGGTGTCCTTGTTCGAGCATCACTGAAAAAAAGCACAATGGGATTTGGTTTTA 

CTATTATTGGTGGAGATAGACCTGATGAGTTCCTACAAGTGAAAAATGTGCTGAAAGATG 

GTCCCGCAGCTCAGGATGGGAAAATTGCACCAGGCGATGTTATTGTAGACATCAATGGCA 

ACTGTGTCCTCGGTCACACTCATGCAGATGTTGTCCAAGATGTTTCAATTGGTACCTGCC 

CGGGCGGCCGCTCTAGAACTAGTTGGATCC 

Sequence 2136 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAGTTCCTTATGCTACTTTCAC 

Sequence 2137 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCAGAGGAGTTCACTCTTCTTCCCTGTTC 

TGAATGCAGCCATTTTCAGGTCAAAGTCAAAGAAACCCTTCAGTTCCTGCACATGGGTGA 

GATCATAGACGGTGTAGACATGAGAGCTGAGGTTGGAATTCTTACCCGGAATATTGTGAT 

CCAAGGAGAAGTGGAGGACTCATGCTACGCAGAAMTCAGTGCCAATTTTTTGATTATGA 

TACCTTTGGGGGACACATTATGATAATGAAAAATTTTACTTCAGTCCATCTTTCTTATGT 

GGAATTGAAACACATGGGTCAGNAGCAGATGGGGCGATACCCTGTTTCATTTTCACCTGT 

GTGGGGGGACCTTGGGATTATAAAGGGAGG 

Sequence 2138 

CCGCGGTGGCGGCCGAGGTACAACAGGCTTGTAGTTCTTTGTGAAGGATGCAGGCAAGAG 

GGATGTCCAGTTTAGACGGGCCCTGACAGCAGCAGCCTCTTCCACATCAAGTGAAGTGTG 

ATTGTTMCGATCATTGCCTGCACAAAGTATAGTTGCCTTTGAAGTAGTCCACCCAATAC 

TTCACCATGACATAGCGGGGCAGGATTGGGGTGTTCACATTTTGCTGATAAGATGGTGTG 

TTAAAATCTAAAGCTTGAAGAACTGGATAAATTAAAGCTTGTAGTTTGAGCTTA I I I I II 

AGGCTGGCATCTTGAGTAAACTGTTGTCCAAGGGCAGCAGCCAGATTTCCACCAGCACTG 

TCACCAGAAATGCAAATTCTGCCTGGATCAACCATATACTTCTGTAAGACTTCTGGCTTC 

AGGAAATACTTTGTGGCCCGTC 

Sequence 2139 

ATAGGGCGAATTGNTNCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACAGGCCATCTC. 

GCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCAACATTTTTACGAAGACAAAATGA 

AGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATG 

TGGTGTGAGAAGCATCATCAGCATCATGCTATTACAATCCCAAACCATGGGGGTTTCTCA 

CAGCTTTACACCAAAGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCT 

GCCCG 

Sequence 2140 

GTAATTTAATCTTCACATCACTTTTACTTAGACCTTGATGAATGCTCCCAGTCCCCGAAA 

CCATGCAACTACATCTGCAAGAACACTGAGGGGAGTTATCAGTGTTCATGTCCGAGGGGG 

TATGTCCTGCAAGAGGATGGAAAGACATGCAAAGGTGAGTGAAGAGTCTCTGACAATCGT 
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GACCCTAGTCTGTATCTTTTCTTTCAAATTTAAATATTAGATTATCAGGAATTATGCCAT 
GTGCATCACCTTCATTAAAAGGTCCCATCCTAGTGGCCATGTTATTGCTCATAAGAAAAA 
CTCTACCAATATGCTCATTGCAGGAAGCAGAGAGGATTAAAAC 
Sequence 2141 

GTAATTTAATCTTCACATCACTTTTACTTAGACCTTGATGAATGCTCCCAGTCCCCGAAA 

CCATGCAACTACATCTGCAAGAACACTGAGGGGAGTTATCAGTGTTCATGTCCGAGGGGG 

TATGTCCTGCAAGAGGATGGAAAGACATGCAAAGGTGAGTGAAGAGTCTCTGACAATCGT 

GACCCTAGTCTGTATCTTTTCTTTCAAATTTAAATATTAGATTATCAGGAATTATGCCAT 

GTGCATCACCTTCATTAAAAGGTCCCATCCTAGTGGCCATGTTATTGCTCATAGAAAAAC 

TCTACCAATATGCTCATTGCAGGMGCAGAGAGGATTAAMCCCMGTTAAAACAGTTTG 

CTTTC TTATAGAAAACTG CC ATAAATTT 

Sequence 2142 

TATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGTCAAGGACGTCTTTAA 
CGTCATGGACGGCTCCTAGGACAGCCACAGAAAAAAATGGGGTAAAGGGGTAGGAAGAAA 
AAAGGAAGGGAAAGAGAGCCAGGGAATTGGGGAAGGAGAGGTAGGGGAAG 
Sequence 2143 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGACGGGGCTGTAATCTAGTAACTGT 

ATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGAGGGAGGAAGGG 

GCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGAACCCTTTCTTGGTG 

TGCCGAGAGCCTGTGTATTTCAGAGGCAGCAAAAAAGTAAAAAAAAAAAAAAATTGATCT 

TTGTTTAGATTAACAGACCCCTGACTATGAAGAAGGAAGGCATCCATACCAGAAACCGAN 

AAATGTCTAGCAAATCCAAAAAGTGCAAAAAAGTGCATGACTCACTGGAGGACTTCCCCA 

AGAACAGCTCGTTTTAACCCGGCCGCTCTAAGAACTAGTGGATCCCCCGGGC 

Sequence 2144 

CGAATTGTTCTCCACCGCGGCGGCGGCCGCCCGGGCAGGTACTTTTCTTGAACTCAGCAT 

TGCTGCCCTNAACCCTCCCCCAGCTGCCTTCCCTACTTTGGGAGCCTCTATTTCTAGAAC 

AGGAAAACGCTAGGCTAAATATATGTCCTAATGGTCTCTTCCATCGGTGTGAAGGGAAAN 

NAAAAATGGGCCCAATGTTGTTGCAAGGAATTAAGGGAAGGTCCNTGCCTTGCCACCCAG 

TGGGGNTTAGGAAATCCATCTTTTACCGGGNCTCCAGTAGTAAGCCTGGTTGAACANGGG 

TTCCCAGGACCATCAAGACAAATTCCTGGTTTTCCCTGGAGGGAGGCNTTTGNTGGGGGG 

GCTCAAATATTGGNGGGGAATGAATNCTTGGGGTTCCCTTAGGACAAAAAGGGAGGCCCC 

NCCTAAACCTTGGAATGGGGGGGGNGGTNCACNTGGGNGNATTANGCCAAGGGGGGGGGC 

CCCAATGGGGCTTGGAACCTATTTTTTTGNTTCCCTNGGGGGA 

Sequence 2145 

GCGGCCGCTCCGGGCAGGTACGCGGGATACTTGGGTTGACCCTAACCAAGGATTGCAAAT 
TGGATGCTATCAAGGTATTCTGNAATATGGAAACTGGGGAAACATGCATAAGTGCCAATC 
CTTTGAATGTTCCACGGAAACACTGGTGNGACAGATTTCTAAGTGCTGAAGAAAGAAAAA 
CAACGTTTTTGGGTTrTGGGAGNAGTCCCATTGN^ 

AGNCTTACCGNGCCAAATTCCTTGTAAACTTTNCTGNAATNCATNGTNNCTNTGGAATTG 

T TGCT ANNCTTGGGNCAATTTTNCTNTTCGGAACTNTCCTNCTTNCCANGCTCNGNAGAC 

NTTTTNTCNNNAAGGAAANCCAATTNTACCAATTANTTCNAACCTTGGCCAAAAAAAAAA 

TTATGGTCAATTTTNGCCCATTAACCAATTNGGGNAATTCCANNGGGGCTCCAAGGTTGG 

GGGAAAAAATTGGTTTANANAAGAAAAAGGGGCCACNCTTGGGAAAGGCCTTTGGAATTG 

GGGGGGGNTTACAAAAAAATGGTAAAAGNGGGTNGNAAAATTTTNNAACAGGGGGCNTTN 

NNAAAGGGGGAA 

Sequence 2146 

AGGTACGCGGGAAGCACATATTTGGTAGCCTTTGCATACAAGAATGTGAAATTTGTTCTC 

AAGCACAAAGTAGCACAGAAGAGGGAGGATGCTGTTTCCAAAGAAGTGACTCGAAAACTT 

TCTGAAGCTGATAATAGAAAGGATGTCCTCGGTAAGGGGAGAAAGATTGAAAAGNAATCC 
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TTGTTGTNAAGAAAGAMTGAAAGTTC 

TTTNCTATAAACAANCACTCTTGTTTCCCTGGGTCCGATGGGCTCATNTGATTGCTTTCC 

TTCTTTCATTANTTGNAAGGAACCTTTNAACTCCCACAAGTTNNAAACCTMCANNATTT 

GGTTCCATTAAAGGTNGNCTTTTCAATTCCAGGGGAACCTTCAATNCGGNCCCCTTCCCT 

NGATNCTAACNTTGGGGCCTTCCCCAAAAANTTAGGTAACTCAATTGGTTCAAGANCTTT 

TCNACCCCCCCCCTNGNGCCTNTTTTGATTGGTCCTTAATTGGGG 

Sequence 2147 

AGGTACAGAGGCGCTGTCTTGGCTCACTGCAACCTNTCCCTACCAGCTTCAAGTGGCCCT 

CTCATCTCAGCCTCCCAAGTAGATTGGGATTACAGACACGTGCCATTATGACACGGCTAA 

MM IGTAI I I II IGTAGAGACCGGTGGTTTTAATACATGTTGACCCAATGCTGGGTTCT 

GAGNCCACACATNATGCCCGGCCCTTGATAGTCCTTATTTTCTATTCAMGTCTTTCTTG 

GGNGTATTNGCCN (MINI NTCTTTAATTAAAAAAAAAANAAACTTTTCCMGGTTTTG 

GAAGNGTGCCTTCNCANAAAAAANGGTNGNAAATTTACCGAANAATAAGNTTACANTAAT 

TTTTTAAAAAAAATTTTTTAATTAATTTCTTAA 

TTAAANANGTGNNCTTTGNTNTTNGGGGAAAAGGAAATTNTCCCTCGGTTNATTTTTGGG 
CCAAAAAAATTAATTTTNTTNCCTTGGCCTNAATTAAANTNTNGGTTATNGGGGTTGG 
CTTTN NGTAAAAAATTTTTATTTACC N CC AATTAAT ATAN AAATAAATTAACCCCTTTCT 
Mill! IGGGAACCTTANAANAAAGNGGAACCNTTTGG'I Mill 
Sequence 2148 

CGGCAGGTACCACACACACACAGGCACACACAGGCACACGCAGGCACACACGCAGACACA 

CACACATAAACACACAGAGTTCACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAG 

TGACAACACCGGAAACAAGGTAATTCATGTTAAAACACAAGGGTTATATCAGTAAGAGAT 

GGGATCCCCGAAGTAAACCGTGGAATTTGAATCAAGCTTCGAAGAGCTAAAAAAAGAAAT 

TGGAGTTTCAACATTCACCTTCTTGAATCCTTAAGAAATACAGAAGTTCAAAATAGAAAA 

CATTACAGTTTCAGGATACAAAAGTAGAAACATCTGAGA7TA 

Sequence 2149 

CGTGGTCGCGGCCGAGGTAC n ITTTTTTTTTTTTTTTn I GAACC AAAATCATAG ATTT 

AGAGTTGGAAGTGACCTTTGTGATCATTATTTCAATCCTTTCAGAGAATTCTCTCTGCAG 

GAATATCTATGTAGGACTTGGATTTGACCCAAGCACATACAATGTCTCAGAGGGCTCTCT 

GCTCCTGGCCAAGAGCATCAGGTCTTGTTTTTAATATAATCAGTGTTGTAAATTATGGCA 

CAGAGAACAGGCTCATTCAGCTGTGGAATGAACTGAGTTAAATGGACTTGTAAACATCCC 

TAAATGAAAGAATTAAAAAGTCAAATGAGGAATTATATAGCCGTCCCTTGG 

Sequence 2150 

CCGGGCAGGTAC I M I II M M M I M M I M I I I I CACAACACCACTGTAAGTCTTTGC 

CATTCCCTTGTTCTTAAAAACATTGCTGTTTCTTTTAATGCATGGAAATTCTAAAT^ 

CAGATTGGGAATTTAAAATAGTAAATAGGGG AAATAA I » I I I I I M I M I I I I IGANACA 

GTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCNCAATCTCGGCTCACTGTAACCTNT 

GCCTCCTGGGTTCAAGTGATTCTCCTGCCTNAGCCTCCCGAGTAGCTGGGATTACAGGCG 

TGTGCCACCAGGCCTGGCTAATTT 

Sequence 2151 

AGGTAC I I I I I I I I I I I I I t I I I I I I U 1 1 I i I I 1 AN AAATTGCCTCANACAGGG ATTGA 

TGTCAGGAGGCTTCAAGGAGAGAAATATACATGAATGGGAGACCCCAGGTTGCAGGCAAC 

AAGAGTCACAGGTATGGGAGAAACCAGGAAGTCTGTGCCATGTTAACTTTGCTCATCTCA 

TCGTGCTTTGTACCTGCCCGGA 

Sequence 2152 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCG 
GTCCCAGATTTGGTGGGACTCTCCTCTGGTATCAGTTTTCAACGCCACTTCTTGTTTGGA 
GTCATTCCCAGTGCAGTCATGGACTCTATGMCAAATACGGTTATTTAMTGTATTTGTC 
CAGTATGAGAATCAGAATGAACTAGTAGAGGCTTTACAGATCAGTTGGTCTGACCCATTT 
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TACAGATGAGGAAACAGGCCTGAGAAGATGGAGGGAGTTACCCACAATCTGAAGGGGCTC 
TACCATGACTAGACCCAGGTCACCTGCCTCCCAGGCCTGGCTCTTTCCACTCCGAGGTGC 
TGGCTCACCACAGACTTATTCTTTAATGGAATTTTGGAAAGGCCTCACTCCAGTGGACTC 
TTTGGGATCT 

Sequence 2153 

CTNCTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTT 

ACTAAGGCAGCAGAGTAGAAAGATGAAGTGCAAGGACCTGGAAATCAGACACACCTGGGT 

TCCTGGTAAGCTTCACTGCTTCCTGGTCTCTGATATCAGCTCTGCCCAAATTTCAGCTGT 

CACAGAATGAGTGATCCTCATAGTATAATGGAAATGCATCATCTCAGTTTCCCCATGGCC 

TGAATATCCATACAAMTGAGAGCCCGGTAGAGAAAMCAGAGGACGTGAGGAGTTTTTA 

AAAATTGAGGAAAAAGTGGGAAGGGAAAAAGTAAGGGATGAAAGGTGAAAGGATGAAAGG 

GAATATGTTCAGGAGATGCCTAGGCACTTGGCATCCCCCACCCACACAG 

Sequence 2154 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAGAA 

AAATGGTTATTTCTAAGCCTCAGTGAAACGTGTCCAATGAAAGAAAAGCCTGAATTACCA 

ACATGGGTTAGCATCACTCACACGCAGGCAGAGATAACAGCAAGTGTCATTTAAGACTTT 

TCTAAAACTTCATGCCTTAAGGAGTAAAGATCCAGTTCATATAGGGTTAGCAACTGCCTT 

CAGCGAAGAGGAAAACAGTAAATAATTAACAAAAAGATCAAGTCAGGAGATAAGGCCTAA 

GAGAGACAGGACTGAAAAGGTTAGAGGGAGGATAGGTGAGGGCTTGGGAGACCTTGGAAA 

AAACAGNAGGGAGCCGGTGGAAGAATTAAGGTGGGGAGGACAAAACGGGCACCAC 

Sequence 2155 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCGCGGGGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
TGAGTCAAGGAAGCTTTTCTTTTGAGCTATTTACAGCTTTTAGCAATTGAGTAAAG 
CTCCTGTGAACAAAATTTGGAACATATTTGTTTCTCTC 

GAACTAAGTTGTAGCTGAGACCAATGATGCTGGACCTCCCTCAAAAGCTGCATTTTCTGG 
AATTTCTGGAAGGGCAAAACTGTCTGCCCTATTATTTGTACCCTCGGCCCGCTCTAAGAA 
CTAGGTGGGGATCCCCCCCGGG 
Sequence 2156 

CCGCGGTGGCGGCCGCCCGGGCAGGACGACCAGTGTGAATGAATGGAGCAACCAAGAGGT 

TAGGCAGTAGGAGATGAGGCCAGAGAAGAAAGAGGGTGTGGTAGCCCCCTAATGCTGGCA 

CTTACTTCCACTGGATCTAAAATCCAGCACGGAATTATGCTGACCACATCAGCCTAACTA 

ATATTGATCTAAGTAACATTTATGTTCTAGCACCTAAATCAGGAGCCTGCAAAGGATAGC 

CCACTGCCAAAATCTGGCCCACTCTCCAGTTTTTGTAACTAAAAATGTTATCAGGAACTT 

CMGCCCAAGGTCCATTTCATTTTACCAAANCTGGTCTTACGGGTGGGCAGGAGACCAAT 

TACCGGCTCACAAAAGGNCTTAAAAATATTTTACCTCTCTG 

Sequence 2157 

CCCGCGGTGGCCGGCCGCCCGGGCAGGTACCTGTTTAAAAAATTCTAGAGAGAAAACTAT 

AGAGAGTGTAGTGTGTGATTCAGACATACTAACCATCCTCACATGGACTCACTCTCTAAA 

AACTCCAGACATAGCCAGAAAGAGGCAGAACAGCCACAGCAGGAGAGATCACTGATGCTC 

ACTGAGGCAGAGGACTTTGTCATAACCTGTAGAAAACATTTTCCCAATGTCTCTTTACCA 

CATTGGGATTATAAATTTTCTACATTCTTCTTCTTCTTTTACTGGGGCATTNTATANC^ 

NCTTGGAAATACANAANATATGGNGAANAATTAATCNTTAAAAATNTTAAATTTTCACCC 

ATCCCTTGTNCAGGACAAAATTAACATTCATTTATGGAATC 

Sequence 2158 

TACACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCAGCTCTGTCTCTCCTCGGAG 
CAGGATTCCATCTTTTCCATGGCTGGGGCCCTTGACACATGTCACTGTGCACTGCTCTTT 
TCGCTGGGATTTCCAAAAGACCTTGGCTCAGGTGTCACTTTTGCAGAAAAGCATTTACTG 
ACTGTTGACTTGGCCAAATCCCTGTGCTGTGTTTCCTAGCACGGTGAGTTGCATTTTCAC 
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ATGMCTCATGCCTACCTTCTCTTGTTTTATAAGCTTCATGAGGCAGGTACCTCGGNCCG 
CTCTAGAACTAAGTGGGATCCCCCGGGCTGCAGGNAATTCNATATTNAAGCNTATTGAAT 
NCCCNCNAACCTTAAGGGGGGGG 
Sequence 2159 

C CGCG GTGGCGGCCGAGGTACATTTTACAAAATTAAAGTCACTTTGTAGGTATATAAAAA 
GTTTTAAAAGTTGGTTTATTAATGAGAGTAGTGTACT 

TAAATATTAACCAGTTTAC T I I I I I f [ AAAGTCAATTTACTTTTCCAGCCTGGACAACAT 

GGTGAAACCTCGTATCTACAAAAAATAGAAAAATTAGCTGGGCGTTGGTGGCTCACGCCG 

GTAGCCGTGTTTCTTTTGGTGGTGATACAAATGGTTCTAAACTGACTGTGGCACTGGGTT 

TATTACAACCTCTGTGGACTATTACCTAAAAAATTCACCTGGAATTTGTACCTGGCCCCG 

GGGCCGGCCCGCTCTAGGAACTAGTGGGATCCCCCCCG 

Sequence 2160 

CCGCGGTGGCGGCCGAGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAGTGGAG 

GCGGAGTTCGGGTGAGTCACAGAGCTGGGGCGCCGTGGGGATGGATTGAAGTCGTCGGGC 

CCAGAATTCCTTTCCTTCTGCCGTGGGGCCTGACAnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTAAATTGCCGCCGCTTGGGCCGTAAT 

CMTGGTCATAGGCTGTTTTCCTGTGGTGAAAATTG7TATCCCGCTTCAACCAATTTCCC 

ACCACCAACATTACCGAAGCCCGGGAA 

Sequence 2161 

CCGCGGTGGCGGCCGAGGTACGCGGGGGAAAATGTCAGAGATGGGAAGAAACAAGAACTT 

TGACATGCTTGGTGTTCTTGCCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAA 

TACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAA 

TGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAA 

GGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGT 

GAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCA 

TACTGGCATCCTCATCTTGGCTTTCCAAGTTTCCAGAACTGTGAAGAAGTATATGTTT 

Sequence 2162 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I I I I I I I I I I M I GAACCAAAATCATAG 

ATTTAGAGTTGGAAGTGACCTTTGTGATCATTATTTCAATCCTTTCANANAATTCTNTNT 

GCAGGAATATCTATGTAGGACTTGGATTTGACCCAAGCACATACAATGTCTCANAGGGCT 

NTNTGCTCCTGGCCAANAGCATCAGGTCTTGTTTTTAATATAATCAGNGTTGTAAATTAT 

GGCACANANAACAGGCTCATTCAGCTGTGGAATGAACTGAGTTAAATGGACTTGTAAACA 

TCCCTAAATGAAAGAATTAAAAAGTCAAATGAGGGAATTNTATAGCCG N CCCTTGGTATC 

CTCAAGGGATTGGGTTCTAGGGACCCCTGGGGGATACC 

Sequence 2163 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGG 

GAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATG 

AGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAA 

GGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCAA 

C 

Sequence 2164 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGAGTATTGCTTAT 

ATGTCCTTATGGGG ATGG GTGCCACAAATAG AAAATATGACC AG ATCAGG G ACTTG AATG 

CACTTTTGCTCATGGTGAATATAGATGAACAGAGAGGAAAATGTATTTAAAAGAAATACG 

AGAAAAGAAAATGTGAAAGTTTTACAAGTTAGAGGGATGGAANGGTATTGTTNAATNNTT 

GAATGCCATGGANNGGCCNAAATNGN I I I I It I I NGGCCNCCAAAAACCCCCCCCTTTAA 

ANNTTNTTTTTGGGGACNTTTTGGNAAANGGTTTTTAAAAGTTT^ 

CCTTTTTTTTTTn ICCNCNCCNCCAAAAGGGGGGGTNTTNTTTTNNAAAAAAAAAAAAA 
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AAAANANGGGGGGGGCCCCCNCCCTNTTTTTNGGGGGGGGGGGGGGNGGNAGN ft II I TT 
111)1 I AA 

Sequence 2165 

GGTTAAACGAGCTGTTCTTCGGGAAGTCCT 

TGGATTTGCTAGACATTTTTCGGTTTCTGGTCTGGATGCCTTCCTTCTTCATAGTCAGGG 

GTCTGTTAATCTAAACAAAGATCAA I I I I I M I I I ( I I I M I I ACTTTTTTGCTGCCTCT 

GAAATGCACAGGCTCTCGCACACCAAGAAGGGTTCCACTGACCGCCCCGGATCAGGGATC 

TGTTTCCA7TTCAGCTGGCCCCTTCCTCCCTCCCACCTTCTCCTGTCTCCTTCCCAGGGA 

CTGAAGMCTAAAATACATACAGTTACCAGATTACAGCCCCGTCTAGTACCTTCGGCCGC 

TCTAGAACTAGTGGGATCCCCC 

Sequence 2166 

AGGTACGCGGGAAGGAGATGTTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCT 

CCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACA 

GGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAG 

AATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAA 

AATGACAAGGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGGAGGTGGAGCCTTC 

AGNAAGGGTAATTAATGCCCTTGTAAGGAAGAGGCCAGAGAGCCTTGTGGCACCCTTCT 

Sequence 2167 

CCGGGCAGGTAC I I I I II I I I i I ! I i i i I [ i I I I I I { 1 1 GGGCAGCTAAAATCTAGGATG 
GNGTTAAGTTTCTTCATTTTGTCMTTATATA 

ATTTCTTGGATGAGATAGTTAATMCAAACTATTTCTCAATATTTGTATACTAAAAACTA 

GTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTTCTGAAAGGA 

GCCTATATAATAATATAAATGGTATGTANTAAATGAGGCACTGTCTTGGCTGGGACTGCT 

ATAAAAAAATTACCATAGACCATTGGACTA 

Sequence 2168 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCCTG 

CTGAAAGATTATTTCTAACAGGCTTGTAGAGAAACGTCGGTTCATGTAAATTAGAAATTA 

TGGGGCCACTTTGCCATTCTTCACACCTGCAATGAACAGGTGTTTATCTGCAGTTCTGAC 

TTATCTCTTGAACTCCATTTGCATGTTATAGTGGGATGCAGCTGATGCCCTGTCCAGATC 

TTCTTCAGGCCACTACATCTATATATGCATTCATATTCCAGTGGCTGTGAGTGTTGGCTG 

TTGGTTGACAGAGGAGCTGCATCCTTCTGGAGGAAACTGAACTCAGCTGATGAAAGCCAC 

CTGGTCCTGGAGGTGAAGC 

Sequence 2169 

CGAATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAG 

CCATTGTTGCTCCCTTTGCCCTTCAGGGTGTCTTTCCCCAGCATCTGTCCCTGCAGCACC 

CCCACCCCCTCACTCTAATTCTGAGCACTCAAATTGCCCATCAGTCCAAGCCTTCTCCTA 

ATCACAGCCCTCCTACCTCTGAGCCCCTTCATGGAGCCCTGTTCATGGTGCTTGGCATTT 

TATGACAACTAAACCAAAMGCAGTGAGGTTTGAAAAAAAAGAGTCAAAAGTCAATCCTG 

GCAGGGGAGGATTTGCTCTTTGTCTTAAAAAACCAAGAGTACCTGCCCGGGGCGGCCGCT 

CTAAGAACTAGNTGGGATCCCCCGGGCTGCAGGAAT 

Sequence 2170 

GGTGGAATCCGGTNGTTCTTTCCTCACATTTGGGATGAATTGG 

GCATGGTTTCCCTCCCTTTTCTTTCACCCCTCCCCCTTTTTTTCTTTCT 

AGAAGAAAACCTTTCAAAAGGTTTAATGGGGGAATGGGTCCGGGGATTCCTCACCAGGGC 

CTTGGAGGAAACCTTCGGTTTCCACCCTCCCAANGCCANTTTTCATGGAAAAAAAANGCC 

TNGCTTTCCTTTATTTAAATTCAATTACCAAAAACCTTCTTCMCCCCATGGM 

AAAAGMGGTTTTTCAACCAAAAAATTCNTTTTTCAAAAAAATA 

TGG AACCTTTAAN N AAAAACCTTGGCC N NG N NTTN N N ATN ANTG AN AATAN AN N N N N CCC 
AAAAAGGTTACCCCT 
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Sequence 2171 

GTGGGCCGGGCCCGCCCCGGGGGCCAGGGGTTACGGCCGGGGGCCGGCCTCTTGGCAACA 
AGCAAGTCAAAAAGCCCAATCNTCTTTCCGGNCGGGTTGGCCGCCAANGGGCCGGGGGCC 
ATTTCGGCCGCCACCTTCTTAATGGGGCMTTTGGCTTGGGTTTCTTACCCAAACCGTTG 
GMCCAGGGGTNGAATCCAAAGGTTTTAAAAGAAAMGCCTTC 

^ ^ C G AAC C C C TT G G G N TTTAATTG AAAC C AN TG TTTAAAAAAG G TTC AATTCCC T 

Mil IGCCCCACCCGTTGGTTGGTTTCCTCCGTTGGTTCCAAGGAAAAGGAAAAGGAATA 
AAAAACCAACCGGCCCCAATTCCAATTAAGGTTTGGGGAAAACCCCCGGGGAAGGAAAAA 
AMGGG GGGG GGTTAAAAAATTAATTGGTTGGGGGTCCTTTGG I I I I M I I I GGAATTCC 
CCCCCCTTTTGGAATTGGGGGGAATTCCTTTTTCCCAAAACC 

GCCCC MM I GGTTNGTTTNCCCCGGNTTTTGGGGGAAAAACCCCAAATTTT"t I I I I I I I 

GGGGGCCAATTTCCTTNAATTAAGGGAAAAAM^ 

GGAATTGGGAAGGNCCCCCTTTTTTCCTTTGGNAMGAAAA^ 

TCCCCCGGCCAAAAAACCCCCAAGGGGGNCCCCCGGGGG 

Sequence 21 72 

CCG GGCAG GTACTTGTTGCTTTGGTGTAGTGAAATGAATGCCTTGATGAMTTGCATTGC 

ACCATTTTTGAAAGAGAGAATACTCAAACGTGTCACTTCTGTTTCTTGCAAGCAACTGTG 

ATCCTGANCTGTGCCACACTTTCTTGGTTGGGATTATTTCTCGGTTTCTACTTNCCTNGT 

TCTG AN AGTATGTTGGCCAN NGTGTAGTAGTTGAN ACATAGTC N TGN ATGCTCCAC NCAC 

TCTAGGCAATCAACNAGCACCNCAGTAATCTTCAAACGTAATTTTTCCCATTGGGGTACN 

CACCAAGNCA7TCTTCCAATTCTTNTTGGNGACTTGGAATNCTCACGNAANATAGAAACC 

CCCNCTTTACCTTGGANTAGNTTCNCNANNNAGGGCCACCC I UMM f NTTTGAAAAAA 

GGGGCCTTCCTTGGCCTTTTTTGGAACCACTTGNANAGGNTTAATTTTTT 

Sequence 2173 

AGGTACTTTAi I I I l I l I I 1 1 I I I I I I I i I i GGCAGTGGGAGAGTTTmATTCAATTTG 

TGAAGGACAGTTTTAAGAACAAAATGTTANTANACACCTCTTAGAAACCACTGGGTTNGT 

TCATrTGGACCATTTTTATCTGCCACCCCAA 

AAAAAAGGTTAAAAAAATTTNCATTTCCCAAATTAATTTT^ 

NACCCCACNTTAAGTTNTNGCCCNTTAAGNAACCAAAAAAAA 

TTAACCAAAAAAAATTTCCANAAAAAANACCTTTTAAAANTNGGCCAAGG I I I I i l I I TA 

AATTTTAAAAGAAAAGGAAGGTTCCANAAAAAATTTTTCCTTC 

CCTTGGGGAATTTATTAACCAATTAAGGGTGGGGGNTAATTAN 

AGGCCANGGAAAAAAAACCCCCCCCAANAAAAAANAAANCCAAAAAANNAACCCAGANGG 
GGGAAAAAAAANAAAAGGGGAAAAAAAATTTACCCATTTGGTTTNCAAAACCN 
C AAGG NTT ATAAAAAATTAA I Mi l l) I GGAA 
Sequence 2174 

AGGTACGCGGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATAC 

GGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGT 

CAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTA 

AAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTT 

CAGAAGGTAATTMTGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGC 

CATGTGAGGAGCCAAGAAGCC 

Sequence 2175 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTGGCGTAGCATTCCATGT 

TCAGCrTTGACATTTATmCTCATATCMCCCTTTTTACACGTGAA^ 

CTTGAATTCTTAACTGCATGATCTGTGAAACCTGTATTTATATTTTCTTCAGTGGTATTC 

TTfGTCGGTGTGTGTGTCCTAAACAAACCAAAAGAAAACTTTCCAAATCTAAAGTATTCA 

TTCTCCAATTGGAGCAAGAGGAGTCAGTTAGATACTATCACGGCATTCATTTGTGGCTGG 

CTTGTCATATTTACTTATGATTGATAATAAATCTATTTTGCTrTTTAGAGCCTCCCGAGA 
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AGCCATCTGCCTTCGAGCCTGCCATTGAAATGCAAAAGTCTG 
Sequence 2176 

AGGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTTTCTCATATCAACCCTT 

TTTACA CGTGA AACACAATCCCGCTCTTGAATTCTTAACTGCATGATCTGTGAAACCTGT 

ATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTGTGTCCTAAACAAACCAAAAGAA^ 

CTTTCCAAATCTAMGTATTCATTCTCCMTTGGAGCMGAGGAGTCAGTTAGATACTAT 

CAC GGCAT TCATTTGTGGCTGGCTTGTCATATTTACTTATGATTGATAATAMTCTATTT 

TGCTTTTTAGAGCCTCCCGAGAAGCCATCTGCCTTCGAGCCTGCCATT 

Sequence 2177 

AGGTAC I I II II I I I ! M I M I I I I I I II I I I I NACTTGTTAAAGTTCTTTTTACTGNGG 

CTGTTATAAAATTTAAAATTATATGTGTGACTTGCATTTCATTTTTA 

CTCCANA AGGTT GATGCCTGAMCTAGANATCCATTACCTTTCCTCCTCTCTAGGGCTTC 

AAATTAGGTTTTAATTTAANAACACCTTTGCTATATATMTCAATACAAAAA^ 

CACTTATTGTACTTTGCCCTGTTGACACTGACCTCCCATTCTCCACCANAANAAATAATC 

AAACCCATCACCAACAAATCACCCATCCATTATTCCCCGCGTACCTGCCC 

Sequence 2178 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCGCATTCCTACTTCATTGCCCCTGATGTAACT 

GGACTCCCAACAATACCCGAGAGTAGAAATCTTACAGAATATTTTGTTGCCGTGGATGTG 

AACAACATGCTGCAGCTGTATGCCAGTATGCTGCATGAAAGGCGCATCGTGATTATCTCG 

AGCAAATTAAGCACTTTAACTGCCTGTATCCATGGATCAGCTGCTCTTCTATACCCAATG 

TATTGGCAACACATATACATCCCAGTGCTTCCTCCACACCTGCTGGACTACTGCTGTGCC 

CCAATGCCATACCTGATTGGAATACACTCCAGCCTCATAGAGAGAGTGAAAAACAAATCA 

TTGGAAGATGTTGTTATGTTAMTGTTGATACAAACACATTAGAATCACCATTTAGTGAC 

TTGAACAACCTACCAAGTGATGTGGTCTCGGCCTTGAAAAATAAACTGAAGAAGCAGTCT 

ACAGCTACGGGTGATGGAGTAGCTAGGGCCTTTCTTAGAGCACAGGCTGCTTTGTTTGGA 

TCCTACAGAGATGCACTGAGATACAAACCTGGTGAGCCCATCACTTTCTGTGAGGAGAGT 

T TTGTA AAGCACCGCTCAAGCCGTGATGAAACAGTTCCTGGAAACTGCCATTAACCTNCA 

CTTTTTAAGCAGTTTATCGATGGTCGACTGGCAAAACTAAATGCAGGA 

Sequence 21 79 

CGAATNGGAGCTCCCCGCNGTGGCGGCCGACGTACNGTGATCATTACTCCCCAGCGCAGC 

TCAATCCACTTNCAGACTGNTCTAATTGTNGGAATAATTNTCCTTATGTAATTCAATNTC 

CCATCTATTAATTNATGAGATATTTATTTAAAACTGACCGTATTCCAGGAACTGGGCTAA 

ACGAAGAAGAGTAGTGAATTAATACAGAAATCACCCTNGATTTGGCAGAATATTNTCCCT 

GGTAGTGTAGGTCACCCCTATTGTGGTGATAATCCCCCANATCATACAGCANAAGAAACA 

ACACACAGACAAAAACCAAGTCATNCTTCTAGGATGTTTTCATCCMGGATTTGAAAAGA 

TTTAGGTGATCTTCCATTNTTCATCCTATGCAATCTGGGCTATGCTACTACTTTACAGAT 

TGAAAATATATTTAAAAATCACTGAACTTTCAATAGTCACCAATGATCACTGTTCTAGTT 

GCAGTTCAATTAGGTGTTCAAGGAAACACTTATAACTTAMTTTAATGTCTAAATCTTAC 

ACCTTTAATTGGAAAAATAATGTAATMGTGAGCAATTAGATTATCTAGTATTAGAAAAA 

TAACTGAGGATTGGCTGGGCATGGGGGCTTATGTTTGGTATCCCAGCACTTTGGGANGCA 

GAGGCAAAANGGATTGGTTTAAGTCANGAGTTCCNAAAACAACCTGGGCAAAACCGGCAA 

GAACTNCATCTNTTNAAAAANNANNNNAANNNNNNANGGTCCTTGCCGGGCGGGCCNTTT 

ANAACTTGTNGGATCCCCCGGCTNGAAGAATTCCANTTTAANCNTATNGATACCGNGNAC 

CTGANGGGGGGCCCN 

Sequence 2180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTGAACTGAGGAATTATAGATAAACCTTAG 
GTCAAATCATTTCA CAATTG CATTGGTGGTATTGAAAAATGATGAGATTTCTCTGACAGA 
GAGCTTTGTCCTAGTTTTTGTCTTCATAGGTCAAAACTGGCAATATTCTCTTGTCTGCAA 
GATAAAGTGTTTGTGCTTCTATCACCATATGCATGAACATGTAAGAATCAGATACAATTT 
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CTGCTTCATCAGTTTCACATGTTCATGTTGTCACTGAAAAAATGCATCTACTGTTTATAG 

CTCCCAAGGAGACCCCAAATCC! I i I I I I I I CTTTTG AGATGGAGTCTTGCTCTTGTTGC 

CCAGGCTGGAGAGCAGTAGCGCGATCTCAGCTCACTGCAACCCCCACCTCCTGGGTTCAA 

GTGATTCTCCTGCCTCAGCCTCCCCAGTAGCTGGGATTTACAGGTGCCCGCTACCATGCC 

TGGCTAATTTTTGTATTTTTAGTAGAGACAGGCTTTCACCATGTTGGCCAGGCTGATCTC 

AAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCGAAAGTGCTGGGATTACAGGA 

GGTGAGCCACCCGCGCCTGGCCCCCGTATTCAI I I I I TATACTGG AAAACA I mTTTTN 

GAATTTT 

Sequence 2181 

TATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAACTTATCCC 

CAGAGGAAACAGGGTTTATGAGCACTGACAGATGTCTTCCCTGGGCAAAAAAAAAAAAAA 

TAGTATATGTATACACACACACATACACATTTATATTTATATTTCTTAAAGCTTTTAATC 

CCTTTCATTCCCTGATATCTCAGAGATTTCAAATCATTGAACACTGAAGTATATTTTTCA 

GGCCAGATGAAAAATTGTATTAAAACCCTATTCCTGGTCAGGCGCAGTGGCTCACGCCTA 

TAATCCCAGCACTTTGGGAGGCCGAAGTAAGCNGATCACCTGAGGTCGGGAGTTCAAGAC 

AAACCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAACTACAAAAAAATTANCCTGAT 

GTGGTGTTGTGTGCCTGTAGTCCCAGCTACTTANGAGGCTGAGGTAGGAGAATTGCTTGA 

ACCTG 

Sequence 2182 

ACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGCGGGGCTTGCCT 

GGAGTCTGATCTGTCCCGGCCCAGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGTGC 

TTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCTTTCCTGCTAGGACTCATGATTGTCC 

CCTCCCtGGCATTTTTTACACCTGGAGCAGCCAGAGGACGCATGCATGGCTCTTCGGAAG 

CCTTCTCCTGCCACGGCATGCACCCACACATGCGAGCCTCCCGGGTACCTGCCCG 

Sequence 2183 

CCGCGGTGGCGGCCCGAGGTACTGATCTAACCAAGATATTTTGTTTTTCTCATCCACCAG 

TCACTTTCTCAGTCCTTTCTGTATCCCTTGCAATTTGAACAAAGCTTGGTGAATAGTGTG 

CACACAAAAAGCACACTAGGTGAAAGACAGGTACATAAAAAGGGTAAAGTCATGATATTT 

TAACAAACCTATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTCTTTAACCCTCTC 

TCCACTGTTGGATGAAATTTGCTGCATTCAATTCCAGTTCCCACCCCMCTTCCTTCTTA 

ACACAAGGTCAGGGTTAAGCCTTCGGTGCTTTMTCCAGAGGAAAATTACTTATTTTAAA 

AAGCAGTGAAAAACACCGCATTCCTTTGGCACAGGATGAAGACCAGCTCAGCTCTTCAGT 

TGTTGATCATTGTCTATTGTTCTCCAAACAGTAAACCAGTATTTCACACTGAGATTGTCG 

GCTGCGGGTATATTCCAATTCCCCGTCTCCTCATGAATATGAAGTGAAGGGCTCTGACCC 

TGGAAGTGGTAGGTAATTGCTTGGCG 

Sequence 2184 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGAGTCTATATGTCAGAATACACATTTCCACCTTGCCCAACAGTAGAAAAACATAAGAAG 

AGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGCAATGTGATGGTGTTTGGAGGT 

GGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTT 

CTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCAC 

TAGMGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCAGAACCGTGAGAAGT 

ATATGTTTGTGGTTTAAGTCAATGGTCTATGGTAA I fTTD I ATAGCAAGTCCCAGCCCA 

AGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCTNCTTTCAGAAACC 

CATGTTCAAAATAAGAGATAAGATCCTGGAAA 

Sequence 2185 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCTTGGAAAGCCAAGATGAGGATGCCAGTATGG 
CTAGCTTCTAGTGAGGGTCCTCTTGCAGGTTGCAGACAGCCGGCTTCTTGGCTCCTCACA 
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TGGCAGGAAGAAGGTGCGCAAGCTCTCTGGCCTCTTCTTACAAGGGCATTAATTACCTTC 

TGAAGGCTCCACCTCCAAACACCATCACATTGCTGACATTTTCCTGCAGGATGTTTATAA 

ATAGTTCCTGGCAAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCAT 

CCCCTCCCAGCGTCAGGTCTAAGCACACAAGTTCTTCCTTCCATCTACAAACCCCCGCGT 

ACCTCGGCCGCTCTAGAACTAGTTGGATCCCCCGGGCCTG 

Sequence 2186 

CGAGGACGCGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGAC 

CCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCGGGAAAATGCAGT 

GAAGTAGAAGAGACAGGGATATCCANAAGGTTATGCAAAACATCAAGAGAAGATGAGAGG 

AGTCTATATGTCAGAATACACATTTCCCACCTTGCCCA(3CAGTAGAAAAACATAAGAAGA 

GAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGCAATGTGATGGTGTTTGGAGGTG 

GAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTG 

TTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACT 

AGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGT 

ATATGTTTGTGGTTTAAGTCAATGGGTCTATGGTAAI I II II I ] ATAGCAGTCCCAGCCA 

MGACAGTGCCTCATTTACTACATACCATTTATATTTATTATATAGGCTCCTTTCAGAAA 

CCCATGTTCAMTAAGAGATAGATACTGAAACACATTACACCCTTCACTAGTTTTTTAGT 

ATACAAATATTGAGAAAAT 

Sequence 2187 

CGAGGTACATTATCAAGTGTGAGATGTTTGGGATATACGCAAGCTGTGCCTGGAAATCAT 

GACAGTGCCTCTATGCTGTTGATGTACCAGCCATGCCGTTTCCTGCCCTATATCCTTTCT 

TAAGGGTAAGTAATCTTAGTGCCTGGTGMGCTAATTTTGTCTCACGAGCCCAACTGTTA 

ACCTGAATCCAATACCTATGCTGCTGGTGAGACAGAGTGGGTATGAAAAAACATATAACC 

TGAAAAATCTACTTATATATGCTAAACTAATAAACCTGAGGGATATGCAGGGGGAAAAAA 

TCAAATTTTAAGGAAATCTAACACAGAAAATAAATTCACAATGAAATAAAAACATAAAAT 

TGTGCAAGCAAAATAAAAATACTACACTTAAACATAAAATATACTTGTTAAAATATGGTA 

ACTCTTTACCCCACAATTAAAAAAGCTTT^ 

CTGAGATTCTTTTCTATGAGAAATACATCACAGGTTGATTTACTTTAAAAGCAAGCCCAT 

TTAGTGGTTGAAANTAAAAAAAAAAAAAAAAA 

Sequence 2188 

CCGCGGTGGCGGCCGAGGTACGCGGGGTTCATGCTCTCTCTCTCCATGCACAAGAAAGAG 

CACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCATGCTGGCACCGTGATTGC 

AGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATATCTGTGGTTTAAGTCACCAAGTCT 

GTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATGATGCCTTTCACCACTGGGTGGTC 

CAGCCTTTCATGGMGCTGTCAGCTTCTCAGCCTTGCAAAAAATGCTTGCTGGTCACTCT 

CTTTGGTCCTTTC AG ATGCCTTTCTTAGTG GTGCCTCTACCAGGC CCTTG ACTTCATCCT 

TTTCTGTCAAGAGGAAGGGTGAGTAAAGTACCTGCCCG 

Sequence 2189 

CCGCGGTGGCGGCCCGCCCGGGCAGGNACTTTN I I I I 1 1 I H i M I II 1 1 H H ) I I I I I 

AACNGCTGTTCACTGCTACCCATTTAGTCAACTTGGACCACCCTNTAGAGGGGCTTCAAC 

TGTTATTTTCAATACATTGTCACAGGGACAGAGGAGGGTGTTAGGGAACTNTGTGACTAA 

ANAGTTTTCCATAAATGGGTGGACCTCAGTTTACAAGCATTAAGTTGTTGGGCCAGGAGG 

GGGAACAGATCATGATTAGGAGAGTGGAAAANAAAGATGACAAATCACTTTAAGTCANAC 

AAACCTCTTTCTGTACCT 

Sequence 2190 

CGCCCGGGCAGGTACGCCTACTCAACCCGGCTGTTCACCATTGGTGGCATCAGCATCCCA 
TACACATGGAACCACACCGTTTTCTATGATCAGGCACAGGGAAGAATGCCTTTCTTGGTT 
GAAACACTTCATGCATCCTCTGTGAATCTGACTATAACCAGATAGAAGAGACACTGGGTT 
TTAAAATTCATGCTTCAATATCCAAAGGAGATCGCAGTAATCAGTGCCCCTCCGGGTTTA 
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CCTTAGACTCAGTTGGACCTTTTTGTGCTGATGAGGATGAATGTGCAGCAGGGAATCCCT 

GCTCCCATAGCTGCCACAATGCCATGGGGACTTACTACTGCTCCTGCCCTAAAGGCCTCA 

CCATAGCTGCAGATGGMGAACTTGTCAAGATATTGATGAGTGTGCTTTGGGTAGGCATA 

CCTGCCACGCTGGTCAGGACTGTGACAATACGATTGGATCTTATCGCTGTGTGGTCCGTT 
GTGGAAAGJG 

Sequence 2191 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAMCTTAAACCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGG 

GTAMTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGC 

TCCTGTGGCCATGTCAAGCAATTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTC 
TGAGAAGA 

Sequence 2192 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC M I H I I I I M M M M M M M I I CAGGATN 
ATTAGG AATCATTTCC AACACCAGG AC AAATATGC AAAGTAAAG N GGGTATTGTTATTTN 
TTTGTCCCTAGTTC ACATGATTTAN ATTGATN AAAATG N N AATTTTTAC ATATGTG AATA 
AAACGTAAAGGAGCAAACTGGANATTGCTGCCATGTGCTANCAATCATTGNGGGNGGGTA 
AACTGA CAACATGTM CTGC ATATTAANACAMTGTAGAAmANACAAAGTNTTCCATC 
MUM NAGATGTATACTTTTAMCCTCAGGMCTGTTTTCAGTAGATGTCATGCCTTTT 
AACTTGAA 
Sequence 2193 

GGNGGCGGCCGAGGTACGCGGGGGTAAGATGGANGGAAGAACTTGTGTGCTTAGACCTGA 
CGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTT 
GCCAGGAACTATTTATAAACATCCTGCANGAAAATGAGTCAAGGAAGCTTTTCTTTTGAG 
CTATTTACAGCTTTTAGCMTTGNGTAAAGTNTACTCC^ 

TTTGGTTCTCTCTAACTGATTNICTCCAGAATTTGGAACTAGTTGTAGCTGAGACCAATGA 
TGCTGACCTNCCTCAAAAGCTGCATTTCTGAATTTCTGAAGGCAAACTGTCTGCCTATAT 
TGTACCTGCCCG 
Sequence 2194 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCATCCCCAAGGACA 

CGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAACCAG 

ACGCCTTCAATCCTGACCACTTTCTGGATGCCAATGGGGCACTGAAAAAGAATGAAGCTT 

TTATCCCCTTCTCCTTAGGTAAGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGGGCA 

GGTCCTATCCTCAACTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTTAATAT 

ATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAGGAGGC 

TTGA GAAAGAAGATGACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGCAAGTA 

TTTTAACATCTCTAAACCTCAGTGMTTCATTTTCTTAAAAAGAAAAAAA 

Sequence 2195 

TCCCCGCGGTGGCGGCCGCTTTCTCTTACTGATAGTAGGATATTTCTGCTTTAGTTATTG 

TCACCTT^TATATTTTCAATGTTGAAATCCTCACAGCATGmGATGAMTCTAG^ 

TCAAATTTTCTTAGGTATATTTCTGTCACGTTGGCATGATAACAAATGCAATAACCCAAA 

AGACCCCAAAAGCTAGTGTAATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCATGT 

TGA CAGTGCGTG AATGTTCGGTAGGCTTTGTCAAGCTTGCATACAATAAATTATATATGT 

CCCTTTTCTTTTAGGGTCTCCTGTTGAAGATGGTCTTCTGAAGGCTMCTGCGGAATGAA 

AGTTTCTATTCCAACTAAAGCCTTAGAATTGATGGACATG 

Sequence 2196 

TTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAACTCACTCC 
TTTTAAAGAAAAAAATTGAAATGTAAGGCTGTGTAAGATTT 
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ATTGGANCCACTAGGAATCCTGAGAAAGAGGAGTGGACATACTCAGAGGAGTATAGGCCA 
TTTGACTCGGCATTGGGAAACCTGGAGCCACACCTGGTCATTTTCTGTGAGATCGATGAT 
GGCACTCCCTGAAGCCTGATCCAGGTAGCCTTTGGTGTATTCATCATANGTGTACCT 
Sequence 2197 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAATATAGGCAC 

GACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTC 

TC AGCT ACAACTAGTTCCAAATTCTGGAGAMTCAGTTAGAGAGAAACAAATATGTTCCA 

AATTTTGTTCACAGGAGTATACTTTTACTCAAATTTGCTAAMGCCTGTAA^ 

AAGAAAAGCTTCCTTGACTCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCAAAGG 

AGGTTGCCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCGTCAG 

GTCTCCCCGCGTACCT 

Sequence 21 98 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAATATAG 

GCAGACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTG 

GTCTC AGCT ACMCTAGTTCCAMTTCTGGAGAAATCAGTTAGAGAGAAACAAATATGTT 

CCAM7TTTGTTCACAGGAGTATACTTTACTCAATTGCTAAMGCTGTAAATAGCTCAAA 

AGAAAAGCTTCCTTGACTCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCAAAGGA 

GGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCGTCAGGT 

CTCCCCGCGTACCT 

Sequence 2199 

CCGCGGTGGCGGCCGAGGTACAGAGGTGATAGATCCCTTCTTGGTAGTGGTAATTCTTTC 
CTGCATAGTACGCGGGGGCTGTAGTGGCTTCGTCTTCGGTTTTTCTCTTCCTTCGCTAAC 
GCCTCCCGGCTCTCGTCAGCCTCCCGCCGGCCGCTCTAGAACTAAGTGGATC 
Sequence 2200 * 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGAGGTGATAGATCCCT 
TCTTGGTAGTGGTAATTCTTTCCTGCATAGTACGCGGGGGCTGTAGTGGCTTCGTCTTCG 
GTTTTTCTCTTCCTTCGCTAACGCCTCCCGGCTCTCGTCAGCCTCCCGC 
Sequence 2201 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTT 

rTTTTTTTTTTTTI IGNTTTCAAAACCCGCTTATTTCACGCGAGGACTCCTCTGCGAAGA 

CAGGAAGGATGGGGTCCCACACCTGAGCCAGGCGCGGTCTCCACTGGGTGCTCCGCAGCC 

CGGAGTGGNTCCGGACGACAGCCCCGCTTAACGGAACCTTTCNCCCAGGCTNGGGTGGAC 

CTTNTCCGCTTGTTTTCCTNAGGGCGCAAGTCAGGTTTNTTTGACACACACCAACCGAAG 

TTNAAGCGCGTGTCCCCGCGTACCTNGGCCGCTCTAGAACTAGNGGA 

Sequence 2202 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACNATGTN 
TAGTGATGAGTTTGCTAATACAATGCCAGTCAGGCCACCTACGGTGAAAAGAAAGATGAA 
TCCTAGGGCTCAG AGC ACTG N ANC AN ATC ATTTCACCC N CGTACAGTTTAGGGG ATCCTT 
TNTAATGACAGGAAGGCACTGCTTTCCTCAACACTGTNATCTGACCTGTGACAAGTCTGN 
ACCT 

Sequence 2203 

TCGCACTCATTTACCCGGGGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGC 

CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGT 

GAGCTTGCTATAGAGGAAGAAGGANCCGTCGGGAAGTCCCAGCCACNGGGAAGGGCCGTT 

NNTTCTTTGTAAGNTTGGTTTCNTCCCCGGCCTGGCCCTACCCCCGCGTAACNCTCCGGG 

CCNGCTCTTAAGAAACCTANGTTGGGGATCCCCCCCCGGGGACNTGCTAAGGGNAAATTT 

CNNAATATCANAGCCTTTATNCGNATTNCNCGTTCTGTACTCTTCGAAGGGGNGGGGGGC 

CCCCCGGNTTAACCCCCAAGCNTTTNTTTGGTTTNCCCCCTTTTTTAAGGTTGGAAGAG 

GGTTTTAAAANTTTGGGCCGGCCCGNCCTTATNGGGGCCGGTTANNANTTCCAATTGGGG 
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NTTCCANTTAAAGGGCNTTGGTCNTTNTNCCCCTTGGNTGGGTTNGNANANAAATTTTGG 

GTTTTAATTCCCCGGCCTTTCAACCAAAA 

Sequence 2204 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTGTAGA 

TGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTT 

ACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAG 

GAAAATGCAGTGAAAGTAGAAAGAAGACAGGGGATATCCCAGAAGGTTATGCAAAACATC 

AAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTA 

GAAAAACATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAA 

TGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAAGGTAATTAATGCCCTTGTAAGAAGAGG 

CCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGC 

Sequence 2205 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGGTTTGTAGATGGAAGGAAGAACTT 

GTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTA 

TACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAAGTCTAT 

ATGTCAGAATACACATTTCCCACCTTGCCCAACAGTTNGAAAAACATAAGAAGAGAAAAA 

CATTAAAAAATGACAAGGAAGTTAATGGAAGTCACCAATGTGATGGTGTTTGGAGGTGGA 

GCCTTCANAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTT 

CCTGCCATGTGAGGAGCCAAGAAGCCGGCTTGTCTGCAACCTGCAAGAGGACCCTC 

Sequence 2206 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATG 

GAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTC 

TTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAG 

ACTCTCTTGATCTGGAAAATACGACTTCATCATGCTGAGTTGtGTGCAGCTGCAGCGGAT 

TCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCT 

GGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGGAGTCCGGAGGGAGCAGTCT 

CTTCTGTCCCGCTGGAACCCATGGTCCACTTGAAGTTCCTTATGCTACTTTCACTGAGCA 

TCCTATG AAATAC ACC AGTG AG AAATTCCTTG AAATTTG C AA 

Sequence 2207 

CCGCGGTGGCGGCCGAGGTACTTTTATAAATACGCCAGAGATTGAGTCGAGGCTCTGAAG 

AAAGTCCACCATGTGAAATTCTTTCCGGTGACATTATCCAGTTTGGAGTAGACGTGACAG 

AGAATACACGGAAAGTTACCCATGGGTGTATTGTTTCCACAATAAAACTTTTTCTACCCA 

GATNGGTATNGGAAGCCCGGCTCCGCTCAGATGTCATCCATGCACCATTACCAAGTCCTG 

TTGACAAAGTTGCTGCTAACACTCCAAGTATGTACCTG 

Sequence 2208 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAG 

GAAAAC7TAAACCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTC 

CATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGAATTACCCTAAT 

GGGGTAAAATTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAAGTCTCTTTGC 

AAACATGTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCT 

GCCTGCTCCTGTGGCCATGTCAAGCMTTCTTTTCTTTCAACTGCAACTGTGTGTAAGAG 

CTTANTCTGAGAAGAAATGTTCAGAAGCTCACTTGTGGGCTGCACATCTGAGCCATGTCT 

TTCCCA 

Sequence 2209 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAA 

GACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTG 

ACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACT 

CTCTTGATCTGCAAATACGACTTCATCATGCTGAAGTTGGTGTGCAAGCTTGCAGCGGAT 

TCCTCTGAGCCGCTGTCTATCGCATCTGCCTGGGGCAAGTTCACCTTCCCTGGGATGTCC 
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CTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCT 
CTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCAT 
CCTATGAAATACACCAGTGAGAAATTCCTTGAAATTTGCAA 
Sequence 2210 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGGTCT 

TTTGAACTCCTTCGGCCACCG TCGCC GCGTTCTCGCTGTGCACTCTTATTCTGCGCCTGC 

GCGCGGCTACAGCACGGTTCGTTTTTCCTTTAGTCAGGAAGGACGTTGGTGTTGAGCATA 

CCGTATCAAGGACAAGTAACTACCATGGCTCCCGAAGTTTTGCCAAAACCTCGGATGCGT 

GGCCTTCTGGCCAGGCTGTCTGCNAAATCATATGGCTGNANCATTCGNNCTATCCCT 
Sequence 2211 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACAAGCAAAGTCT 
CCCATAACTGAAACAGAGGATTCTGTTTTAGAAAAAGGCCCAGCTGAGCTTANGAGCAGA 
GAAGGAAAAGAAGAAAATAGAGAGCTTTGTGCATCTTCTACGATGCCTGCAATTTCAGAG 
CTTGCATCATTGCTTATGGGAAGGAATCTTCAGATTGAAGAAATTAAACCTTTCTCTCCC 

AAGATCATCAGCCTAGAGTC GAAAG AACCACCTGCCTCTGTAGCTGAAGGAGGCAACCCA 
GCMGAATTTCAGCCATTTACTTTTTC 

AGCCCGACTTTAAAMGGGAGGAAATCAAGAAATAGGCCCATTACCACCAACTGGAAATT 

TGAAGGCACAAGTCATGGGGAGATATTTTTANAATAAGCTAAGTG 

Sequence 2212 

CCGCGGTGGCGGCCGAGGTACGCGGGGACAGCGATGTGAGCTGAGGTGCAGGCACCAGAC 

CTAGGAATTCCTAGAAAAATAGTCAGGAAGCATTTAGACACATCAAATGTTAAACGAGTC 

CTGATTATGATGATAATGATGATGATTTTGGTGGTTGCAATAGCAAAGCCTTAAGTATGA 

AGGAGACGTGCCAGCTGGAAATACAGGTAGACAATGAACAACTGAATTTAGAGGACGAAG 

ACATTGAAAGCATTGATGCCACCAAATTGAGCCGTTTCATTGAGATCAACAGCCTCCACA 

TGGTGACAGAGTACCTGCCCG 

Sequence 2213 

CACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGGACCCGGGGGCTTGN 

CCCCTNATGCCTCCGTGCTTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCTTTCCTGC 

CTAGGACTCATCATTGTCCCCTCCCTGGCATTTTTTACACCTGGAGCAGTCAGAGGACGC 

ATGCATGGCTCTTCGGAAGCCTTCTCCTGCCACGGCATGCACCCACACATGCGAGCCTCC 
CGGGTACCTGCCCG 

Sequence 2214 

AGGTACAGGCATGAGCCACTGCGCCTGGCCCCATGTTTGGTTATTATTAGTGCTTAGGAA 

GAGGCACTTGCTTACATAGTAGGAGTTGAGAAGCTTGGTTTGTTCTTTCCTACCCCTAGA 

TCTATTCTCACCTCCTGACCATGCTCTTTCTGCCACATCTATTATCATTACAAGTTGCCT 

TATCTGAAATTAGTGMTCAGAAAATAAAGCAGGGGATACTTTGTGTAGTTTCAACGTTA 

GGGAAAGTTCAGAATACTGTCTGTCTAAACTATCTCTCTAGAAGGCCTGATGGGCCACAA 

CCTGGGCCAGAAGCATTCAGTTCAGATATGAGAATGGTGGGGTGTAAGGGGCAATGGCCA 

ATGGGCCATGGCCGGAAGGAAATTGTTACAGAGTAGTGGGAAGCCTGCAAAGACTGGCTT 
CTGTCCG l I I 1 1GCCTT 

Sequence 2215 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAGGGCCTT 

AGGATCTACTGGGGGGAGTCCCGGAGGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGG 

TCCCTGGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAA 

TTCCTTGAAATTTGCAAGTTGTCTGGGTTCATGTC 
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Sequence 2216 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTGACATCCCACACTACC 

TGATTTGATACACTAAGAAGGGCTTATCATTTCTGTGGTACTCTTGGCCAACAATGCACG 

TGTAACATTTATTCATTGAMCACATTAGGCAGMTTGCAGGACATTCTTCAAAATAGCT 

GTCCCAATACTCTTCAAAAGTGTCAGGTCCTGGAAGACAAAGAGATACTGAGGAACCATC 

ACAGAGTGGGAGAGGACATAGAGTGATAAAAACTAACTGTGATGTGGAATCCTACATTGG 

ATCATGGACCAGAAAGACAGCACTGATGGGAAGACTGATGAAATCTGAATAAGTCTGTAG 

N TTTGGTTTAAAG AAGAAGAATAATAAC AATAATGG TTTAGCTGCTGGC TCCTTAATAAA 

ATTCCCCTAGTTACTGTAATGTCTGAAAATGAACCCC . 

Sequence 2217 

GACTNCTAT AGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I u M I I ' M I I 

" iilim t'GAGCCTCANTTGMCTATTCATTCTTGTCTTCCANAACTCACTTCAAATG. 

GGTTTTCCATCTATCATGAGTCTAGTGTTCTCACCTTTGTCTTACMTTAATTCTTCTGC 

ANATCCCCTCACACTGATCTCCTCCCCCTGAGCTTTGGACAATTATATTCTCTCATCTCT 

T TCACA TAAATCGCACTTGGAAGACAAACTCCTCAGGTTAAGCTGCCATCTGGTGTGTGT 

An I I ICATCAAGAACTAGCAACACCCACATCAATAGCATCCACCTTTACCATCACCTAT 

ATMGCAACTTACTTTGGGAACCTATTTCCTTATTTGAGCATTCTCATTTGAATCAATCT 

AGCCTAGGTTAGGGACACTGGAACCAAACTTTGGGCTGTCCCTGGACTGGCCTGGG 
Sequence 2218 

TACGACTACTATAGGGTGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC i T I I I I 1 1 I 

nrTTTTTTTTTTTTTTTTTI 7 CGNCAATTC AAAGTAAACC AAATC ATCACATCAAG AAAA 

TACATTGNGCAACTNTNATGTCCCCTTNTGACAAAAAAGGCACATAGCCAGGNGTATGTT 

CCCTGGAAGCTTAAAATCTANTTGGGGACAATCAGGCAAGAGCACAAACGGCCTNTTGNT 

TGGGCTGNGGCAAAAAGCTGGATTTTATCCTAATTGCNACAGGATGACATCAGGAGAGCA 

ACATGATCTCATAGANATTCTTTTAAATATATGMTTTTTAACTT 

Sequence 2219 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGC 

CGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAAC 

CCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGG 

ACAAACTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTT 

CTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGA 

GGCGATCTATGTTTAACACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCC 

TATGTCTTTATTC CAATCCAC 

Sequence 2220 

GGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAG 

ATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAG 

CAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGC 

CAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAA 

TCATTAGCTGCTGTGC TGTTTTAG AGCCCTGGG AG CG ATCTATGTTTAACACC ATCTTAC 

TAACCATTATTGCTATG 

Sequence 2221 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACTTATTTTTATATTA 
AGTCAGACTTCAGGATTTATTTTAGCCTTCCT 

TTAAGCAGCA l l l l iCCTCATCGCACACGGATCTGAGCAGGTGACCGCGGCTGGGATAGG 

TCTGTTGTGTTGTGTGCAGGTGGGCTGGCCAGCCGTTGGGTGTGTGTTTTTCCTAGAGTA 

AGAAGCACTCCAGGCTGAGATGAGAGCTGTTGAAATGAGTAACATTTCCGTCTCCTGTGA 

TACGCGCTGCTGTTGCTTCTTTCAAATGATCAGATTTACATTCTTTTAATGGGTCCTT^ 

AAAATGTAATCGAGNGAGAAGGACTTCTAAAGATTTTCTTTTGGCGTGGAATCACACNAA 
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GGGGAA7TGGAAMTTACTAATTCATGAAAATGAAAATGTGGGCTTCTTTTAAGGAAAM 

TTCCTTTGAACATTGACAGAAGTGGGCCGTAGGGAAGGGGAAGAATTTGCCTGGATTCTT 

CACTTAACTCTTGTCACCACTGGAGCATNCTGACTCCCTAGACACAAAGTGATGGAAGTT 

ATTTTGTTTCCACTCTTAAACACTGTCTCAAGGGGGACATTGATGGGATGGGNGG 
Sequence 2222 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAACAGAATGAGAAGC 

TACAAACMGAMTGGGAAAACTCGAMCAGATTTTAAAAGGTCACGGATCGCCTACAGT 

GACGAAGTACGGAATGAGCTCCTGGGGGATGATGGGAATTCCTCAGAGAACCAGAGGGCA 

CATCTGCTCGATAACACAGAGAGGCTGGAAAGGTCATCTCGGAGACTAGAGGCTGGATAC 

CAAATAGCAGTGGAAACCGGTAAGAATTCTGAGAGTGAGCAAATTGTCTTGCTTATGCAC 

AGCAGTCTTCACAACACATGACATTTCAGGGAAACTTCAAAGGAGTAGCANAGACAGCAN 

CCCGAGATGTGGTTTACATATTGGGGAGACAATTGGGAGCTTATTTGCGCTTATCTTTTT 
TCAAGGT 

Sequence 2223 

AGGTACACACAAAGACAAACCTGAACTTAATTTCMGGAAMCTTAAACCCATGCACAAA 

TAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAGGACTCATGCTCTCTC 

CACTTTCTTCTTGGGAGGAGGGMGATTTACCTAATGGGTAMTTTGGGCAAAGCACATT 

GAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTGCCCACAGTAACATGA 

GTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCAAGCAAT 

TCTTTCTTTCAACTGCAACTGTGTGGTAAGAGCTTAGTCTGAGAAGAAATGTTCAGAAGC 

TCACTGTGGCTGTACCTGCCCGGGCGGC 

Sequence 2224 

CTACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACAAAGACAAAC 

CTGMCTTAATTTCAAGGAAAACTTAAACCCATGCACAAATAATTGGTGAGCCTTCATTT 

CCCTGACTTCAAGTTTCCATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAG 

GGAAGATTTACCTAATGGGTAAATTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCT 

GAGTCTCTTTGCAAACATGTGTCTGCCCACAGTAACATGAGTTTGCGTTGACTGTCATGT 

CTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCAAGCMTTCTTTCTTTCAACTGCAACT 

GTGTGTAAGAGCTTAGTCTGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGTACCTGCCC 
G 

Sequence 2225 

CTATAGGGCGA ATTGGA GCTCCCCGCGGTGGCGGCCGAGGTACCATGATTAGTTAAATAT 

AAGACTCCGTAATTTTTACAATTTTAACAATA^ <2% 

GGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTTTATGGAAAAT 

GGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAAC 

CCAGTTGAAGTCTAGAAGACAGATGATAGTATCTTAGACTAGGTTGGTATTTGAATAGAT 

ATT GGTAA TATCAGTAGAATTTAATMTACATTAGAAAGAAAGAAATCAGAGAAGATTCT 

TTTATTTTCACTTGATACTTGTGTTGTTACTTTCAATGAGATAAGAAAGACAGGCAAAG^ 

AGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAAGAATTTTATGTTGGA 

Sequence 2226 

AGGTACCCAGCCCCTCCCCTTCTGAGATCTTTCCTGTCACACACTCTTTCCCCAGCTCTG 

CACATTCTGACACTCACCACACCAGCACAGCAGAATCTCAAAAAAAAGCCACAGCAGAGG 

GCTCCGCTGGTAGAGTTGAAAATTTTGGCAAGAGGAAGCCACTCCTCCAGGCCTGGGTCT 

CACCCTCGGAGACACATCCAGTCTCAGCTCAGCCAGGCGCTGGAACTGGGTCAGCCAAGC 

ACAGACTTCATCCTGTGAAGCCAATGAATGCCACGGCCACCAAGGTTGCTAACTGCAGCT 

TGG G AACTGCC ACC ATCATC AGTG AGAACTTGAACAATGAG GTCATG ATG AAGAAATACA 

GCCCCTCGGACCCTGCATTTGCA 

Sequence 2227 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAG 
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TGGGAAGCTCGCAGCAGCTGGGGAGGAGTCAAAGCCTCGGCGCTCACCTAAGCCGCAGGG 

AGATAC ACCC AACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACA 

GGATCATTTTNNCGNAAGGANCAAGGGCTNNANAACCCAGACCTGGACTTTNTTAGGACA 

AACTTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCT 

CCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTTGTTTTAGAAGCCCTGGGA 

GCGATCTATGTTTAACACCATCTTTACTMCCANTTATTGCTATTGGGTGGGTATTACCA 
CTGCCTATGT 

Sequence 2228 

GACCTCCGCNTGGCNGCTTGGGTTCCNNNTTCCGGCTTGCCGGGCCGAAGCCGGGTTATT 

TCAGCCTTCMCCTTCAAAMGGGGCCGGGTTAAATTACCGGGTTTTANTTNCCACCAAA 

GGANNATTCCAAGGGGGGGNAATTAAACCNNNCCAAGGGGAAAANAGGNAANACCCAATT 

GGTTTGGGAAGGCCAAAAAAAAAGGGGGCCCCAANNCCAAAAAAAAGGGGGGCCCCCATG 

GGGTAAAA ACCNCCGGTTTN ANAAANANANGGGGCCCCCCGGCCGGTTNTTGGGCCTTGG 

GGGCCCGG I 1 I I I I I I I I 1 INCCCCAATTAAGGGGGCCTTTCCCCGGGCCCCCCCCCNCC 

TTTGGAACCCGGAAGGGCCMNTTTCAMCCAAAAAAAAAATTCCGGGAAACCGGCCNTT 

CAAAAAGGNTTCCANANNAAGGGGGTTGGGGGCCGNAAAAAAANCCCCCCGGAACCAGGG 

GGGAACCCTNATTTAAAAAGGGATTTAACCCCAANGGG 

Sequence 2229 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAGTGATCC 

TCCA tCTT AAAGAAGATCAGACAGAGTACGCGGGGTGTTTTTAAAAAAGAATTTAATTAG 

GAAATTTTGATACTCTTCATTACTAGGATTTATCCA7TGAAATTATTTA7TGCTATTACT 

TTTAACAGAGTCTCTGTGAGACTGTTTCACAGAAGGATGTGTGTTTACCCAAGGCTGCGC 

ATCAAAAAGAAATAGATAAAATAAATGGAAAATTAGAAGGGTCTCCTGTTAAAGATGGTC 

CTCTGAAGGCTAACTGCGGAATGAAAGTTTCTATTCCAACTAAAGCCTTAGAATTGATGG 

ACATGCAAACTTTCAAAGCAGNAGCCTNCCGAGAAGCCATCTGCCTTCGAGCCTGCCATT 

GGAAATGCAAAAGTCTGTTCC 

Sequence 2230 

TACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGCGGGGTCCCGGCCC 

AGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGAGCTTGCGCGTGTGTCATTTCCTCT 

CTCCAGAGGACCTTTCCTGCGTAGGACTCATGATTGTCCCCTCCCTGGCATTTTTTACAC 

CTGGAGCAGCCAGAGGACGCATGCATGGCTCTTCGGAAGCCTTCTCCTGCCACGGCATGC 

ACCCACACATGCGAGCCTCCCGGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnGTTCCCTTTAGTGGAGGGGTTAATTGCGCGCTTTGGCGTAAATC 

ATGGT 

Sequence 2231 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAAATCCACTTTCATCATACTGAGTTGTGTGCAGCTGCAGCGGATT 

CCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTG 

GACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTT 

CTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAGTGAGAAATTCCTTGAAA 

Sequence 2232 

TCGACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGTCAGGA 
GTCCCCAAGCCCACTCCCAAAATAGAAAATTCACTAAAAGGAGTCAAAGGATTCAGCATA 
CAGTAATATTAACAACTAAGATTTATTCCAGCAGGATACAAAGCAAAATAAGAAAGGAAA 
AGTCGCATGGGGCAAAATCAGGTGAAAGCTTTCCAAGAGTCCTCTCCCAATGGAGTTACA 
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CAGGATGTGCTTAATACCTCCAGCAACCAGTTGTGACAATACATGCAAAGAGTGCAAAGT 
CTTGTCCACGACGGATGTTC I rTTTTTTTTTTTl I GAGAAAGCCTTGCTCTGTCACCC AG 
GCTGGAGTGTGGCACGATNCTGGGTGAAGAGCAAGACTCCGTNTCAAAAAAACAAAAGAA 
AAGGAAAAGTCCAGTCCTTGTCTTCAAAATTACTTCCAG 
Sequence 2233 

NCCGGCCGCCCGGGCAGGTACNTTATGACCCAACATTTACCTCAAAAGCTNTNAATGACC 

TTTGCGGGGAACTGTCCCCAAGACAACCTNAAAGAGACAGCACAACATTTGCAGCTGTTT 

CTCTTTCAGCCCAAGGTCGCAAAGGATATAAGAAGCCAATGCAGCTTGGANGGAGAAAGG 

GATTCCTGTAAATCACTCACTATGATTTCCCACAGCAGGCAAGTGGATCCCCTAAAGAAA 

GMGGCTTATAGAGTTCCCTCAAAAGAGCTCAGAACTTCAACAATGATTCTATTCACATA 

GGCATTTCTGTCTTCCAMTTTTAGCTTTAAGAGTCTGAGGAAGTACCTCGAGCGGCCGC 

CCGGGCAGGTACTGGAAAACCTCCATCTTGGCTCCCAGAGCTCTAGGAACTCTTCATCAC 

AACTAGGATT 

Sequence 2234 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCTCTTAGGGA 

TCTAAGGAAAA GATTCT GGACCAGAATGGGCCTAGGCTTGAAGTGGATCTCTTCCAAAAG 

AATAGTGMTAGTTTTTATCTTTTCTAAAAGATGGATTG 

AGTAAGTTCTTGTAAGTTTAATTAATTTTGTMCATCTTTTGAGAATTTACCA 

CTCCTTGAGCTGGGTGCTTACTATGTGTTTATTATTTTAACCCTCACAACAACCTCAACA 

CCCTCATGAAGTAGGTCCTATTAGGATCCCTGTTTTGAAGATGMCTGAGGCTCATGTTT 

AGCCCAAGTTCACTGAGCTAGTAACTGGCAAAGTTAGAATCTGAAATGAAATCTGCCTGA 

TGTCAGAATGATGTTGTTCATTCATAGGCTGATATTCTT 

Sequence 2235 

TACTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC iTTTTTTTTTTTTm 

TTTTTTTGAGACAGGATCTTACTTTGTTGCCCAGGCTGGAGTGCAGTGTCACGATCACAG 

CTTGCTGCA GTCTCA AGCTTCCANACTCAAGCGATCCTCCCACTTCAGCCTCCAGNGAGT 

TGTCTTCTCATTTTTCTGNTCCTTNAACCCTTTCCCTCTCCACTAGCTGGTTTCCCTCAC 

AGGACCAACACNCATGTGCAGTATTTGCTTCCACCTCTTANAGATTTNTCAACTACAAGG 

ACTTAGGCAAGGTTTCCCTTTTGACTGGAAGGCTTGTAGAAACGTGACAACCCGCCTTCC 

CTAACACTCATGTCACAGCGGTAGCGGACCGTATTCTGGGACAAATGGGAACTCACGACA 

TTCCCCCCGCGTTACCTGCCCGGGNCGGC 

Sequence 2236 

CTATCA CTACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATC 

C AC ATA I I I I IGAGAAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGG 

CAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCC 

AGGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAA 

TATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCT 

AAGAAAGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTT 

AGTTCTCCTCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCT 

TAGGTGAGCGCTGAGGCTTTGGCTCCTCC 

Sequence 2237 

AGGTACGCGGGGAGTCTTGACACCCTAGATCCCAAGATCTCCAAGGATTTTGGTGGGCAT 

ACCCACTCCAGCACACAGGAMGCCATGGAGGGTTTCAATGGMCTTCTTCCCTTCTTTC 

CTTGAACAGCTTCTTGGGCAGGGAAGCCCCCTGGGTTCTTGTNNCCTTATTGAATCCCAA 

GAAGGGCCCCGCCCTTNTGGCCCCCAAGGGAATCGGGGGGGAAACCCCCTTTGCCCATTG 

AAAG C C ATTC AAGCC ANGCTTTTAAAAAAGGG AAAAAAATTG NCCAN GGTTG GAAAAGAA 

CCCCCANGGGGGTTTTANGCCCCAGAACAGGGGCCCACCCAAAAAGCCCAAANGGGAAAG 

GCCAGGAANGAATTCCCCAAGCCCCNTTTNTTGGGGAAAAAAAAAGGGCCCTTAAGGAAC 

CGGGGAAGNCAAAAAAAAAAAAAGGCNTTGTTNGGGGGGGGGGGAACCTTCGGGGAAAAA 
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AAACCTTAGGGGAAAAAAMGAATTGGCCAAGGTTTCNGGAAAAAGAATTTCTTANGGAA 
AAAAGGCCGNTGGG GGTT AAAMNGGGAAGGCCCCGTTCCCATTGGAACCGGTTTTTAAA 
AAGMCCGGTTTCCCTTTTGACCTTTCAAGTTANCCCTTGGCCCCNGGG 
Sequence 2238 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGC 

CGCAGGGAGATACACCCAACCGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAAC 

CCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGG 

AC AAAC7TACTG CAGC TTG AAGG AGCC AACCATG G ATTTG AGG CG TG TG AAGG AATATTT 

CTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGA 

GCGATCTATGTTTAACACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCCT 

ATGTCTTTA7TCCAATCCACATTCGCCTGGCTTGGGA 

Sequence 2239 

CCGCGGTGGCCGCCTACCTCGGCCTCCCAAAGTGCCGGGACTACAGGCATGAGCCACTGC 

GCGCAGCCCCACTTGCCCMTTCTTGACCATCATTTATTGAGCTAAATCCAAAAAGGAAA 

CTCTACGACTGACTCTTTTCTTTCTCATGGCCTCCAAAATGMTCCTAGGTTTTTCCAGC 

AGGGTAAATTCAAMGGACTCAGGTTGGAAATTGACTTCTTGGTTAAAAAAAAAAM 

TAAACAGAAGGTCCTTTTTAGAMTGAGGTTTATTTTCAAGGCTACTCAGTCCTAAAAG 

CTGCATATTTACTTTTTTTCCTTCCTCCAGCCCGCGTACCT 

Sequence 2240 

NTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGAAGAACTTNTGTGCT 

TAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCA 

ACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGA 

ATACACATTTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAA 

AATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAG 

AAGGTAATTAATGCCC7TGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCCTACCAT 

GTGAGGAGCCAAGAAGCCGGCTGTNTGCAACCTGNAAGAGGACCCTNACTAGAAGCTAGC 

CATACTGGCATCCTCATCTTGGCTTTCC 

Sequence 2241 

GGCCGCCCGGGCAGGTCCTCGGGAGATCAGGACTCTTACAGCCAGTCATTGAAGTCTCCT 
TTAAATGAGATGTCTGATGATGATGACNATGATGATAGCAGTGACTAAGGACACATTTGG 
GAGTATTTAATCAGGTGTGGCTATC CGAGAAATCAACTTTG GGGGAAATGTAANATTCTG 
AGCTCTCTGTTTNGTTCTAGCCATGAATTTGCCTGACAAACTTGTAACCTATGTGCCTCA 
ATATATTCCATAGAAAGTAGGTCCCCCTGCCTTCTCCCACTCCTCACACTCTTCTACAGG 
GATAGGCTTTTGCAAATATATCAGATAAA I 111! i GTTTCTTGTTTATTTTTAGGTTATT 
TTCTTGGAAGGTTGGGAAMGATGTTTGTTTTAACAGATCATGTACCTC 
Sequence 2242 

CCGGGCAGGTAC I I I I I I I I I I II I II ! I I I I I It l I I I CTTAAGTATCAATGAGTTAAT 

TTTAAAGGTAATAAACTGTAACTTGTTTTNTTCAAAACTGCTTTTAM 

CTATTCCTGCACACTAACTGAACAAGGATGCTGGCAATGATAACTGTAAGACAGAGAAAT 

GACATGATTCTGAGCTGCATTTTTANAAAGATTACTTTGGCAGCCAGGCNTTATACTGAT 

TAGATGACAGAAAGACCAGAGAANCNCAAAAGATAACGGATGGGTCTTCGGNTTAATACC 

GGGGTGACNAAATAATNTGTACAACAAACCCCCGTGACACAAGTTTACTTATGTAATGAA 

CCTTCACACNCNCCCCGGAACCTAAAATAAAAGTTAAAAAAAAAAAAGAAGAAGATGGAA 

AGACCAGAGGAGGGAGGC 

Sequence 2243 

GGCGAATTGTTTTCCNCCGNGGTNGCGGCCGCCCGGGCAGGTACNNTGATTAGTTAAATN 
TAAGACTCCGTAN I II I i ACAATTTTAACAATAATTTTATTTCTTCAAGCTTGTTAGTTT 
GGGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTTTATGGAAAA 
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TGGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAA 

CCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTTAGACTAGGTTGGTATTTGAATAGA 

J^GGT^TATCAGTAGAAmMTAATACATTAGTMGAAAGAM 

TTTTATTTTCACTTGATACTTGTGTTGTTACTTTCMTGAGATMGAMGACAGGCAAAG 

GAGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAAGAATTTTATGT 

Sequence 2244 

CTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC iTTTTTTl 

TTTTTTTTTTGAGGCAGGAGGATCACTTGAACTTGGTAGGCCAAGGCTGTAGGGACCTGA 

GATTATACCACTGCACTCCAGCCTGAGCAACAGAGGGAGACTCTGTCTCAAAAAAAGAAA 

AGAAAACAAAACAAMCAGAAAACCTAGAGTTTAAATCACACTTAATAAAGACTAAATAC 

ATTTCTCTCAAMTCAGGAAAAATATATGGTTGTTACTTCTTTATTCMCmGTA^ 

TGGTCATGGCCAATGCAATATGACATGAAAGGAAATAATATATATATAAGAAAGTAGTAA 

ACAATATTTATTTATAGACTACATGAGCATGTACCT 

Sequence 2245 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATATTTGTGA 
AGTATCTCCTGGACCCCACTTGAATATGATCGTTGGAGCCAATGGAACAGGGAAGTCGAG 
CATTGTGTGT GCCATT TGCCTTGGTTTAGCTGGAAAACCTGCTTTCATGGGACGAGCAGA 
TMGGTTGGGTTTTTTGTGAAGAGAGGATGTTCTAGAA 

TCAGGGCTTCTGGAAATCTTGTAATCACCCGTGAGATTGATGTGGCAAAAAATCAGTCCT 

TTTGGTTCATCAACAAAAAATCTACAACCCAGAAAATAGTGGAAGAGAAAGTTGCAGGCT 

TAMTATTCAAGTGGGGAATCTTTGCCAGTTTCTCCCTCAGGACAMGTTGGAGAATTTG 

CTAAACTCAGCAAAATTGAACTCCTCGAAGCCACTGAAAAGTCAATTGGTCCCCCAGAAA 
T 

Sequence 2246 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAACAAAACCCACACTATCTCTGAGG 

TGTGCTTGTAGGTTTTTGTTTGTAACATTTGCCTTACTGCACTCCTGCCTTGCGACAGAG 

CMGACTCCATCAAAAAAAAAAATGTAATTTAAAAGATTAA 

CAAATGTATATATCTACCATTATAGTATCATACAGMTGGTTTTACTGCCGTGAAGTTTC 

TCTGTGCTCTGTGTATCCATCTCTTCCTTCCCCTTGGTCCATGGCAACCACTGATCTTTC 

TACT GATC CCATAGTTTCACCGTTTCCAGAACGTCATATAATTGGAATCGAACAGTATAT 

AGAC TmCGGATTGGCTTCTTCTTTTAGCAATATGCATTTMGTrrCCT 

TTTTGTGGCTTGATAGTTCATCTCCTGTTAGCAGTGAA 

Sequence 2247 

NGCCGAGGTACTTCCGAAGATGGGCTTGTATCTGGTTTCGGACGGACTGTTAATGACAAT 
TTGATCGACGGGAATTGCACACCCCAGAATCCACCACAAAAGAAAAAGGTTACAAATTTA 
ACAATTTATAGTCCTTTTAATAGATTTCNTTTNTTTNTTCATAAATACTA 
Sequence 2248 

CGACTNCTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCTAACCCTGGGTCTTCACCTG 

CATTTTCCTTTTGAGCTGCTGATGCTTCATGGCAAGGGTTCCCCGATCCTGGGGCAGAGG 

CGGCCTCTGGATCATAGGCACAGACCAGGGCTCCTGCCAGAGCTGTCAGGAAGAGGAGAG 

TCATGAACCTCATGCTTCTGTGTGCTGGAGTGGGTATGCCACCAAATCCTTGGAGATCTT 

GGGATCTAGGGTGTCCCGCGTACGCGGGGACCTCATTCATTTCTACCGGTCTCTAGTAGT 

GCAGCTTCGGCTGGTGTCATCGGTGTCCTTCCTCCGCTGCCGCCCCCGCAAGGCTTCGCC 

GTCATCGAGGCCATTTCCAGCGACTTGTCGCACGCTTTTCTATATACTTCGTTCCCCGCC 

AACCGCAACCATTGACGCCATGTCGGGTTATTCGAGTGACCGAGACCGCGGCCGCTCTAG 

AAC 

Sequence 2249 

GTCCCGGCCGCCCGGGCAGGTACNTATTGTGTCCACTGTAAAGGTAMTGATTTNTTTTT 
TATATTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCACA 
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AAAATMTTCG TCTTT CTAGGTTATGAAGAGATAATTATTTGTCTGGTAAGCATTTT^ 

AAACCCACTCATTTTATATTTAGAAAAATCCTAAATGTGTGGTGACTGCTTTGTAGTGAA 

CTTTCATATACTATAAACTAGTTGTGAGATAACATTCTGGTAGCTCAGTTAATAAAACAA 

TTTC AGAA TTAAAGAAATTTTCTATGCMGGTTTACTTCTCAGATGMCAGTAGGAC^ 

GTAGTTTTATTTCCACTAAGTGAAAAMGMCTGTGTTTTTAMCTGTAGGA 

TAAATCAGCAAGGGTATTTTAGCTAATAGAATAAA 

Sequence 2250 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGACCTCAGAG 

CTGAGCTGGGCATGAGTAGATGCTCAGTAAGTGGTGCACAGGGTTGGTCCCTATGGTGGA 

GGCCCCCTAACACCGCCCAACGCCCCTCCATGTTCTCACAGCTCCACGCACTGAGCACGG 

GCATGAAGGCCATGATGTCAGAATTCTGGCACCCAGGGAGCTGAGATGTGCCGCAGGGCC 

TGTGGCGGACATGGCTACTCAAAGCTGAGTGGCCTGCCATCACTGGTCACCAAATTGTCG 

GCCTCCTGTACCTGCCCG 

Sequence 2251 

CCGGGCAGGTAC I I I I I I 1 1 I I I I I I III I ! t I II I I GCCTTCTNGGGAAGTGCTGGGAA 
CGCAACTTAAGGTGTTTGTAGACACTGCCCCTCTTAGAGCCTGAAATGCCACAATTTAAG 

GTTTTCAGGGATTCTNTGCCAGTAGCCCTGGGGGTGAAAGGAGGTCAGCAAAGGGACTTG 
CCGGATGTCCGAGTAGGAANAANATTGCTNTAAGGAGTCCAATCCAAACCTCACTCATTT 
CACAATTTTACCGGAGGCTCCTNTTGCTNTCCCCAGG 
Sequence 2252 

TNGGGCGAATTGTTNCTCCACCGCGGTGGCCCGCCCGGGCAGGTACACACAAAGACAAAC 

CTGAACTTAATTTCANGGAAAACTTAAACCCATGCACAAATMTTGGTGAGCCTTCATTT ' 

CCCTGACTTCMGTTTCCATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAG 

GGAAGATTTACCTAATGGGTAAATTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCT 

GAGTCTCTTTGCAAACATGTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGT 

CTGCAGGAAGCTGCCTGCTCCTGTGGCCATCCCGCGTACCTN 

Sequence 2253 

GGAGGCTATGCAGATATATTCTTTTCTCTTTA 

TCAGTAAATCTTGTCTACAGTAATTATTACTATTTGCTATTCTAATGGGAATTTTGTATA 

TCTCCATCCCTTCTAATATATTTAACAACTGAATTCTATAMCAAGATTTGTTTCTTCTC 

CACATTATTTATTTTATCAATACAGACACATACTTTGGGGGTTATAATCCAATACTATTG 

TTATTTATTATGTNGTTCATATTGTTCTAGCTTTGGCTACTGGGATCTTTTATAAGTGTN 

TTCTGTG N CCTTTTG AC ATGCCCC ATTTTTTTCTTTTTAAGC AC ATCTTTACT^ 

ACTACMGATTGTCCAGGTTCTCATCTTGTGTTTTCCCTTCCCTAGCCTTGGAATGAGCT 

GNTTTTCCMTGATCTCTAGTTCCTTTATTAGAAAATGGTGGCCAACATGGTGAAACCCC 

ATCTACTAAAAATACCAAAA 

Sequence 2254 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCTTCTAATTGATC 

ATATCTGCTTATTTTCCTCTGGATTAAGGATCAAGGAGATAGTATATTAGATGATTTGAT 

AAATTTCCAATGCTTTGCAAAGTAGTTGAACAACTTT^ 

TACTAGTCMGACTTGGTATTTCATAGCTGGCTTTCTTATTCTGAAGGTTGTAAAAAGAC 

ATATGAAGTMTATTTAACATTGAAAACATGATAATCAATTTGATTATCTATGAATTGTC 

CTAAACGTTCAGAGTAAAGTTTCCTTTAAAGTTAAAATCTTTCAAGTGMGGAAAT^ 

AAGTCACATGTAAACACCAAAATAAGAGGAATAGAGCAATAGGATAT1TTTGGCTTTATA 

ATTCAATTTAAAAATTAAGGTGCATTTATTTTTTTGGCAGCTGGCC 

TCCAGTAAGCAGTTGCTATCTATGATTCATGAGATCATGGGGTT 

Sequence 2255 

CCGCGGTGGCGGCCGAGGTACGCGGGGGACANGGCCATCTCGCTATAGGAAAGGAAAGTG 
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GAACAGCATTCATCCTCAACATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTG 

ATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAAGCATTAT 

CANCAATNATGCNTNTTNCCAATCCCAAACCATGGGGGGTTTCTCACAGCTTTACACCAA 

AGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCTTGCCCG 
Sequence 2256 

CCGCGGTGGCGGCCGAGGTACACACTTGTGTGCATTTCTGTTGGGTATATAACAAGTATG 
AAATTGCTGGGTCTTAAAATATTTATATATTAACTTCATTAAATGCTGGCTMGAAT^ 
TCMGCGGTTGTGCT ATTTTAC CTTCACCAGTTCTAGTTGTTCCATATCCTTGCTGTCAC 
TTGGCATTAATCTGAA ) Mil I ATTTTGACCAG ACTGGTCTGTG TGTCATAG TATCCCAT 
TGTCATTTTMGTTGCATGCCTTTGATGAAGACAT^ 

AATATATCCTGTMCTGATTTGTAGGGAGNTTTTATATATATTCTGGATATMNTNTTTT 
TATN TG G AC ATAAATATTTCGCAG ATTATCTN C TCCC ACTTTTAACTTCCCTTTTTTC 
Sequence 2257 

GATCCTATATGGCGNNTAGGGGCTCCACCCGCGGTGGNGGCCGCCCGGGCAGGTACGCGG 

GGGGAGCGGCCAACATGGCGGAACGCAGGAGACACAAGAAGCGGATCCAGGAAGTTGGTT 

GAACCACCTAAAGAAGAGAAGGCTGTGGCCAAGTATNTTCGATTCAACTGTCCAACAAAG 

TCCACCAAT ATGAT GGGNCACCNGGTNTGATTATTTTATTTGCTTTCAAAAAAGCAGNTG 

NGACTTGTC TTTTTGGAAT TCAAAAAGATGGGGNCAAAAGGCTCAAAATATAATGGTAGT 

AGNGAAGCCTTTTATTnTACCAAACCA^ 

GGCAAACCAGGCCTTTTTAAAAGGAAAGGCAG t rTTTTTTTI I CACCNGGAGGCNCCTTA 

AAAAAGTTAAAANTGNAAAAAATTGGAANATTAATTGGTTATNTATGGACCATATATNGA 

GNACATATANTNCATNTAATTNAANAAANAAAGGGTACCCCTTTCGGGTCNCCGCNTTNC 

TAANGAAAACCTANNTTNGGGGGATTNCCCCCCN 

Sequence 2258 

CCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGAGAAGGCTTC 

CGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAAATGCCAGGGAGGGGAC 

AATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAATGGCACACGCGCAAGCA 

CGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGCCGGGACAGATCAGACTCCCCC 
GCGTACCT 

Sequence 2259 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCTTTAGAGAGTAGAGATACATCTGCATATA 

AAAAAAAAATCTGCCGTAAGTGATAGAGTTGAAAATAATGTCCTACATACCAGAATGTTC 

TTCTG AATTAG GCATCATGTAGGTATTCAGCATTCTTTCCATTTCCAAATTGCTCCCACT 

C CCTG u I 1 > iCATCATCCCCCTTTTTTCCTCATTAGCTCATTTGGAAATAAMGATTTT 

TATTTTATTTTCA I rTTTTTTTI ICTGCAGAAGAAGCCACATATGTATGGAGATTTAGGC 

AATGATTTCTCAAGGAAAAAATGTTTTAAAAGTACAGCATGGCTGGTGGTTTCAAAAAAC 

GTAGTCATTCTTCTCACTGCAACCAATGTAAGATAAGCAGGGTAGATCTGTTATTTCCAA 

ATTAAAGGTGGATTAAGATATATGGAGAGAGAACATGGCATGTGAGGTTTATAGGGCTAG 
AAACTG 

Sequence 2260 

GCGAATTGGAGCTCCCCGCGGTGGCTNAAACATCTTTrTCTTCTTAGTCCCAGGTTGTTC 

TCATTCGTGGCTGGATCACACCAGCAAGAAGGATTTTAAAGGACAACTCCTTGGGGCCGT 

CCAACACTGAAACCGCCTGACCGCTGCTGTCTTTGTGCTTGCAGAAAATCACGTGGCCCC 

GGAGAGGGTCAGCGANGGGCCTCCACTGTCTGAGTCCGCTCCTGGCAACGCCTGCTCTCT 

GCACTCAGAGGCCCGGTCCCGGCAAGCTGCCACGTTGCTCAGAAGTTCTGGCAGGGGGAG 

CCTGAAACCCCTGTCTGCTGGCGACACTCCCGGAGCAGCAGGAAATCTCTACATTTTATT 

HUM CTTAACAGACA AAAGTA TTTGACTCCAGAATGATTTCCAGGAATTACAGTATGA 

GAACTTAGTCTTTGAGA I I I I TATTTTCTTCTTAAAAAAATAAAAATGAAAT 

Sequence 2261 
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CCGCGGTGGCGGCCGAGGTACGCTTTGACGACACTCTCAGACATGCTGTGCAACTTAACG 

TCACTGCCACCCGGCAGCTCTTGCTTATGGCTAGTCAGATGCCAAAGCTGGAAGCCTTTA 

TACATATCTCTACTGCCTATCCAAATTGTAACCTGAAGCACATCGATGAAGTTATCTATT 

CCNGCCCTGGNGGACCCAAAAAAAATCATNGATTCCCTTGAGTGGTTAGACGATGCTATT 

ArTGACGAGATTACACCCAAGCTGATCAGAGATTGGCCCAATATTTATACCTACACCAAG 

GCCTTGGGAGAAATGGTGGTGCAGCAAGAGAGCAGGAACCTGAACATTGCCATCATAAGG 

CCCTCCATTGTGGGAGCAACTTGGCAGGAGCCTTTCCCAGGTTGGGTTGATAATATAAAT 

GGACCTAATGGAATCATTATTGCGACTGGGAAAGGGTTTCTTCGGGCCATAAAAAGCTAC 

TCCAATG 

Sequence 2262 

CGAGGTACGCGGGGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCT 

CCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATAC 

ACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGA 

CAAGGAAGTTAATGGAAGTCAGCMTGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGT 

MTTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGtGAG 

GAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATAC 

TGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTTTGTGGTTT 

AGTCMTGGTCTATGGTAATTTTTTTATAGCAGTCCCAGCCAAGACAGTGCCTCATTTAC 

TACATACCATTTATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAAATAAGAGAT 

MGATACTGAAACACATAACACCTTCACTAGTTTTTAGTATACAAATATTGAGAAATAGT 

TTGGTATTAACTATCTCATNCAAGAAATGCAGATTCATGTTGGTTCTAATTTTTATTATA 

TAAATGACAAAATGNAGAAACTTAACACCATCCTAGATTTTAGCTGCCCCNG 

Sequence 2263 

CCGCGGTGGCGGCCGAGGTACCTTAGATTTCTATGGGACATCGTTTAAAMCTATTGTTT 

ACGCGAGAGCCTTGCTAATTTCCTAAAAATTGTGGATACAI Nil I CTCCCATGTATAAT 

TTTCTCACCTTCTATTTAAAAAGAAAAAAAAAAGTCAGTGTAGTATTTACATATTTT 

CTATAAGGAGCTAACATAACTTTTGATTTAGTGTTATTCATAAAATTAGGTTAGCAGTTT 

ATTMCCTTTTGTATTTGCTCTGGCAATGTTTAATATCTCATAAGCTATACACACCTCGA 

AGCCATCAATGACMCCTTTTCTTGCTGAATAGAACAGTGATTGATGTCATGAAGACAAT 

TTTATCTCCTTTTGCCTTCCATAATTTGTACCTGCCCG 

Sequence 2264 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNNNTTTTTTT 
rTTTTTTTTTTTTTI 1GGGAATCTAAACCAAACAAAACATGATAATTTCAGAAGAGATTA 
ACTNGATTTTTTTCCTTCATCGGCAGCAGACTCTGACTTACCCACCTTTCCCACTATTTC 
TTTGAGTAAGGATTTGAATAATTTAAGCCTTCA 

TTGNAAATATTATGAGTAAAAATCAGGAAATGCCTCACTGGTNCCCTTGGTTTCTCAGAC 

AACTCACTGATTTATGGTCTTGAGACCATAAACTCA 

Sequence 2265 

CCGCGGTGGCGGCCCGAGGTACAGGAGGGAACCAGGCGCTACAGACTGTGTAATGAGTGT 

CTTGCAGMTTTGGCATAGACAGCCTCCCCATTGACTTGGAAGCTGAACAACATCTTATG 

TCCCCATCAGATGGAGATAAGGATTCCAGATGGCACTTGAGTGAAGATGGGAATAGATCC 

TATGTGGAGATTGTAGAAGATGGGTCTGCTGATCTGGTCATCCAACAGGTTGATGATAGT 

GAAGAAAAAAAAAAAAAAAAAAAAAGGTACCTGCCCG 

Sequence 2266 

CGCCCGGGCAGGTACGCGGGATTGACATTGCAAGAAGCAAATGTTACCTCTAATAGCGTG 
GTCCAAAATAATGTCTATTTACATTCTCAGAGCACTTTGTCAAGGTTATTCAGTTGAAGG 
CTGTAACACTCTCAGGAGTTAATCAAGAGCAATGGCATCTATGTTTATTAGTGAAGGGAA 
G ACTT ACTTG CACCTACTGCATATCTG AGTTTACTG AG CTCAAACTGG AATTTAGG AAGC 
CAAAAAAAAAAAAAAAGAAAAAGAAAGACAGTGTAGACAGAGGCAAGAAAGTAGAGTGAT 
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TGCATTTGGGCATCTGAATGACCCATCTTGGAATCACAGCTCAGTCACTGGCTGTATGCC 

TTGAGAATGTTTCTTCTCTGTTCTGAGCCGCTGGTTCCTTATTTATGAGATAAATGAGGA 

ACTGTGCTGCACGCAGTTGGTTGGAGCCTGAAATGAGATAATGGATGTTGTCCTAGTTAG 

AAGATGAGTAATTTTAGGGCCGGATGTGTTTGGCTCACGCCTGTAATCCCACCAGTTTGG 

GAGGCCAAGGGTGGGCTGATCACAAGGTCAGGAGATCAAGACCCGTCCTGTTTAACACGG 

GGAAACCCATTCTTTACCAAAAAATACAAAAATTAGCCANGNCTTGGTGGCGGGTTGCCT 

GTAGTCCCAACTACTTAAGGAGCTGAGGCAGGAAAATGGGTGTGAACCCTGGGANGCCGG 

ACCTTGCCAGTGGNGCTGANATTGGCACCAANTGGCCTTCAACCCTGGGCGANAGANACG 

AGGACTNTGTCTTCCAAAAAAAAAAAAAAA 

Sequence 2267 

ACTACTTAGGGGGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGAGGGAACCAGG 

CGCTACAGACTGTGTAATGAGTGTCTTGCAGAATTTGGCATAGACAGCCTCCCCATTGAC 

TTGGAAGCTGAACAACATCTTATGTCCCCATCAGATGGAGATAAGGATTCCAGATGGCAC 

TTGAGTGAAGATGGGAATAGATCCTATGTGGGAGATTGTAGAAGATGGGTCTGCTGATCT 

GGTCATCCAACAGGTTGATGATAGTGAAGAAAAAAAAAAAAAAAAAAAAANGTNCCTGCC 

CG 

Sequence 2268 

CGAGGTACGCGGGGGAGCGGTGAGTGATCCTTTCAGCAAAGTCAGTCCTGGGCAGGAGAC 

GGCTTCAGGAATACTGTCAGCTTTACTGGATTCCACCATCGCTTTCCAGGACTGTTTAGG 

CCCTGGGCCCTTGAAGGGTTTGCGTGCCTCTTGTCTCCATTCATACCTCAAGAACTTTGT 

TCATGTTAAI rTTTTT I CACTCTATCATATGGAATTGAGCAAAAAAAAAAAAAAAAAAAA 

AAGGTACCTGCCCG 

Sequence 2269 

CCGCGGTGGCGGCCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAA 

CGTCGGTTCATGTAAATTAGAMTTATGGGGCCACTTTGCCATTCTTCACACCTGCAATG 

MCAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAACTCCATTTGCATGTTATAGTGG 

GATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCAT 

ATTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTCCTGGAGGA 

AACTGAACTCAGCTGATGAAAGCCACCTGGTCCTGGAGGTGAAGCATCTTCCAAATGACA 

GCCTGCAGTCAATGACTGATGAATATGACTTCATTGCCTCATGACAGGACCTACTCTGGG 

GTATAGATCATGCTTCTAAGCTCCTCCTGGGGTCCTGCTGAGGCTCAATGCCAGCTGAAA 

CCATACCCTTGCTCACATACTTTCCCTTCTCTTTCCTTCTTCCCTTGCTTCCTTAGAGAT 

TCCTCCTGCAATGGCTTCCTCAAAAAAGCATTTTCATAAGCAACCTCATGTCAGCCTCTG 

CTTTCACAGAAGCCAATCTGANGTTAACAAGCTTGTTTAACTTAGCATGTCCAACATAAA 

ATTCTTGGTTTCTTCTCATCCCCTGCCC 

Sequence 2270 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATG 

GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCA 

AGCTTTGAAGAAGTTTACAAAGTCTATATGTCAGAATAC ACATTTC CCACCTTGCC CAAC 

AGTAGAAAAACATATGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCA 

GCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAG 

AGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAG 

Sequence 2271 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGATTTT 
TTATTTTAAAATMTCTGGMGTAATG 

CCAAATTGGTTAGATGAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCGGCACTTTGGG 
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AGGCCGAGGTGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGT 

GAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGCGCCTGGTTGAGG 

CATGAGAATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCAAGATCATGCCTACTG 

CACTCCAGCCTGGGCAACAAAGTGGGACTNTGTCTTAAAAAAAAAAAANAAATAATCGGT 

TAG ATGAN GAAAGC ATGTATATTTTCTATATACC AAAAAC 

Sequence 2272 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGTGCCCAGGTCTG 

GCATCCTGCACTTGCTGCCCTCTGACACCTGGGAAGATGGCCGGCCCGTGGACCTTCACC 

CTTCTCTGTGGTTTGCTGGCAGCCACCTTGATCCAAGCCACCCTCAGTCCCACTGCAGTT 

CTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGACACAGGAGCTGAAGGACCACAAC 

GCCACCAGCATCCTGCAGCAACTGCCGCTGCTCAGTGCCATGCGGGAAAAGCCAGCCGGA 

GGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACGTCATCTGGCTGAAG 

GTCATCACAGCTAACATCCTCCAGCTGCAGGTTGAAGCCCTCGGCCAATGACCAGGAGCT 

GCTAGTCAANATCCCCCTGGACATGGGTGGCTGGATTCAAAC 

Sequence 2273 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCATTGTTGGMGTTTACTCTGGTTAAAGGATA 

GAGAAAAAAATGGAAAGAGGAGGGTAATCACAACAGTCTAGGCAAGAGATGATGGTAGTT 

TGGAGACAGAAGGTGTGGCAGTGGACATGAAGATAAGTGGATGGATTTGAGAGAAACTTG 

GAGAGTGAAACTGCATGGATAGTGATGGATCAATGACAGTAGACTGGGATGAATGGAAGT 
ACCTGCCCG 

Sequence 2274 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I H H I It 111 M 1 

llllllllllllll i GAAANGTTAAAAATT7TAAAATTTAATAACTTGAATATACATCTG 

CTTTGAAATAGGACATAAAGTCATACAGTATTTTTACATAATTTTCTTGC 

TAGTATGCAACTCATAAAGAAATAAAGATCTAAGGAGCNCAAAGATNCATGTATGTGATT 

TAAAAGMGCATTTCAAATAAACAGGAAAAAAATTACTCTCTTATTTCATACCGTATCTA 

TTGNCTTCATCATAACAGAAATCTATAATAATAAGAACTAGAGCTNTCACAGCACCTNTC 

ATNTGCTGACACTGNGTTAAACACTNTACATCCATCACCTCCTNTAATC 

Sequence 2275 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCMTACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATG 

TTTAACACCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATT 

CCAATCCACATTCGCCTGGCTTGGGAATTTTTCTCAAA 

Sequence 2276 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGACGGGGCTGTAAT 

CTGGTAACTGTATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGA 

GGGAGGAAGGGGCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGAACC 

CTTCTTGGTGTGCGAGAGCCTGTGCATTTCAGAGGCAGCAAAAAAGTTAAAAAAAAAAAA 

AAAAATTGATCTTTGTTTAGATTAACAGACCCCTGACTATGAANAAGGAAGGCATCCAGA 

CCAGAAACCGAAAAATGTCTAGCAAATCCAAAAAGTGCAAAAAAGTGCATGACTCACTTG 

GAGGACTTCCCCAAGAACAGCTCGTTTAACC 

Sequence 2277 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTGC 
AGTGAGCTCAAGTGTTGGGTGTATCAGCTCAAAACACCATGTGATGCCAATCATCTCCAC 
AGGAGCAATTTGTTTACC I I I I I I I 1 CTGATGCTTTACTAACTTCATCTTTTAGATTTAA 
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ATCATTAGTAGATCCTAGAGGAGCCAGTTTCAGAAAATATAGATTCTAGTTCAGCACCAC 

CCGTAGTTGTGCATTGAAATAATTATCATTATGATTATGTATCAGAGCTTCTGGTTTTCT 

CATTCTTTATTCATTTATTCAACAACCACGTGACAAACACTGGAATTACAGGATGAAGAT 

GAGAT AATCCGCTCCTTGG C AGTG7TATACTATTATATAACCTGAAAAAAC AAAC AG GTN 

ATTTTCACACCNAAGTAATAGATATCATGACACATTTAAAATAGG 

Sequence 2278 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCATGAGAGACAAAGCTGAGAAGTGTGGTG 

GAAAAGGAAGAGAGAGCTGAGGCAGACGATGTAAGATCTGCTGTTACTTCTTGCTGTGTG 

ACCTTGGGGTGAAGCAGCAAGAANGCCCTTGCTAGATGAGGCTNCTCAACTGTAGACTNC 

CCAGCCTCCANAACTGTGAAGNGAAGTGGGTGTGCCTGGGATCNNACACAGCTGCTAAAA 

GAAGGNCTNAAGAGCCACAGATGTCTT 

Sequence 2279 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GCAGTGAAGTANAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGAT 

GAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAAAAAA 

AAAAAAAAAAAAAAAAGTACCTGCCCG 

Sequence 2280 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCATTGTTGGAAGTTTACTCTGGTTAAAGGATA 

GAGAAAAAAATGGAAAGGGGAGGGTAATCACAACAGTCTAGGCAAGAGATGATGGTAGTT 

TGGAGACAGAAGGTGTGGCAGTGGACATGAAGATAAGTGGATGGATTTGAGAGAAACTTG 

GAGAGTGAAACTGCATGGATAGTGATGGATCAATGACAGTAGACTGGGATGAATGGAAGT 

ACCTGCCCG 

Sequence 2281 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGATT 
AGTTAAATATAAGACTCCGTAATTTTTACAATTTTAAC 

TGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTT 

TATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAG 

TGTTAGGAACCCAGTTGAAGTCTAAGAAGACAGATGTTAGTATCTTAGACTAGGTTGGTA 

TTTGAATAGA TATTGGTAA TATCAGTAGAATTTAATAATACATTAGAAAGAAAGAAATCA 

GAGAAGATTCTTTTATTTTCACTTGATACTTGTGTTGTTACTTTCM 

ACAGGCAAAGGAGAMGTTCGGGGGCAGGGGATGAGAAGAAAACMGMTTTTATGTTGG 
AC ATGCTAAN GTTAAAC ACCTGCTTAACTC AG 
Sequence 2282 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTGA 

CATCCCACACTACCTGATTTGATACACTAAGAAGGGCTTATCATTTCTGTGGTACTCTTG 

GCCAACAATGCACGTGTMCATTTATTCATTGAAACACATTAGGCAGAATTGCAGGACAT 

TCTTCAAAATAGCTGTCCAATACTCTTCAAAAGTGTCAGGTCCTGGAAGACAAAGAGATA 

CTGAGGAACCATCACAGAGTGGGAGAGGACATAGAGTGATAAAAACTAACTGTGATGTGG 

AATCCTACATTGGATCATGGACCAGAAAGACAGCACTGATGGGAAGACTGATGAAATCTG 

MTAAGTCTGTAGTTTGGTTTAAAGAAGAAGAATMTAACAATMTGGTTTAGCTGCTGG 

CTCCTTAATAAAATTCCCCTANTTACTGTAATGTCTGAAAATGAACCCCCTATGGTGGGC 

TACTGTGACTGTTCACTGCATTTGAGAATAAAGAATAATTAAGGAAAAAAATACTCTGTC 

AGCCTAGGCAATATAGTGAGACTCTATCTGTACCT 

Sequence 2283 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACCAGGCAGGATGGAG 

CAGGATGGTGTGAG ATTTC ATCACACTACTCAGGATAGTGCACAATCTAACACTTATGAA 

TTGTTTATTTCTAAAGTTTTCCATTTAATATATTTGAAATGAAGTAGACCCTGTGTGTAG 
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CTGAAA CTTT GGAAAGGAAAACTATGGATAAGGGGGACTACTAGAATACACTTCAATAAA 

ATAGCATTTTAACAAAATCCTAGTGTCCGTAGAGTAGTTCTCCATCTAGCTGTCCATCTT 

TCAAACGTGGAGAGAGGTGAGATGAGGGAGAAGTATGGTCAGAGAGGGGAGACAGGTGTG 

TGTGGTGTGGGTGTGCAGGTAGATACATAGCTCAGTGTTCACTGTGAGAGGTGAACAGAG 

AACATTCCAAGCAATGGGAGTGGCATGATACAAGGCACAAAGAAGTCAAACTGAAAGGCA 

CCTTCAGGAAAAAGAAAGGAGTCTCTTGCAATTCATACACAGAGATCGGCACTGGCAAGA 

AACAGGACTCTGGGGGAAAAATCAGGGCTTCCAAAAAAA 

Sequence 2284 

TATAG GGCG AATTGGAGCCTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAACAGT 

TATAGTCACCATCACCTGCTTCAGAATGGTCTTTTAGATTTGTGTTTGTTTTGTTAAAGT 

TGTTGGCACCAGGATGCAGAGAATCAGACTGGCCTGAGGTGAAGGAGCACACAGCCCTGA 

GGGC7TGGAACCCTGGGTCCAGTTCCTCTTCACACCCCCTTCCACTCTGAGTAGCACATC 

TCCCCAGGTGCCCATGGAACACCTGCTTTCATCCCAAATATCCGTCCACCTAGGCGGGGT 

GGTATGTTCTTACGTCTCTCTGACTTTGATGCCACTCATTCTATAGTTTAGCTGGTTTTC 

Q^ CAAGATATTC TTGGTAGTAACTGACAAGTATGTTGCACATGTATTGGGGGAGGCGCT 

TCAl I I I I ATTTTAATACACATGTATTTCCTCCTTGCACAGGATTTTGATGGTGTGGGAA 

TATCCTAAGTGGTAGCCTTCCAAAGTAGCAGTGAGTTGACATTCAGCTGCTTTTAACTAT 

TCAGGCTACCTT TTATA CTAAACCTTGAAAACTAGAATCTAATGTCTACCCCAAAAAAGT 

AAGTTCTTTGATATTTTATACTTTTTATGTACCTTGGCCCGGTCTAGAACTA 

Sequence 2285 

AGGTACCCAGACTCCAGTATGAAAACTCTCTGGGCTGTGTCCTATGATCTTCCCATGAGT 

AACTCATAGTCTTGATCCAGTGGAATCTGGCCTTCATTAGTCTCAGTGGCAAGTTGGTTA 

TGTGGAAAGTCTCTGTTCACTCACTTGGGTGAATAACAGTAAAGACCTTTCTATTGTTTT 

CACTTTACATTAGGCCATGAGTATTTGTGCCTGTAGCTGCAGTTTGTGTTAGTTTCCTAC 

CCCAGGTATCTCCTGCAGCATGCAGCTTCAGTCCTACCAGACCCTCAAAACTTAAAAGCG 

AACACTATTTCTAGGGAGGATTTTGCAGGAAAATGGAGAAAGGGTTACACACAAAAAAGG 

TTAMCTACTCTATGCATGTTTCTGCAATGTGTTATCTCAAGAATTCATCTCTGTAGCCC 

ATCAGGGCAGGAGCTGGTCTCTCACCTGTTGATAATATTCCATAAGGGAGGTTCTTCCCC 

ACAANTGTTTAGTCTTCCGACGCTGGTATAGCCTGACATGATGACATTCTACTTTCATGT 

TGGTCGTGCTGCAGGGAGAATTCTGTGAGTGTCCCAATANGCTGGGAATCACTTGCTAAG 
GGTGAACCCCA 

Sequence 2286 

CCGCGGTGGCGGC NCG AGGTACTTTTAGTAGAG ATGGAGTTTC AC CATGTAGTCC AGGCT 

GGTTTCAAACTTCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTAGGATT 

ACAGGTGTGAGCCACTGTGCCCAGCCAGAATATATCATTTCACTGGACTCTGCAGGTGCT 

TTGGATGATCAAGGAATAGGACATGGCTGTAGAAGTACGCGGGTGTTGAGGAGGCATATT 

TTTAAAAGAATCAAATATAATTTAGATAGGAAATGACAAACTCAGCAGACATATTATGAA 

GGAAATGAGACATACAGGAGMGAGAATTAATGAATTGGTTGGAGGCTAGAACTTAATAC 

CCTCATTGTAGCCAAGAAGAGAGAGAGAGAAAAAAAAGGAAAACATGAAGGGAATAAAGA 

AACATGTTAGTGGGATGAGAGTCTCACATATGTATTCTGTAAAGTAGAAACTGAAGGAAG 

GGACAATATTCAAAGTGAGATGACCAAGTACCTGCCCG 

Sequence 2287 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCATCCCCAAGG 

ACACGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAAC 

CAGACGCCTTCAATCCTGACCACTTTCTGGATGCCAATGGGGCACTGAAAAAGAATGAAG 

CTTTTATCCCCTTCTCCTTAGGTAAGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGG 

GGCAGGTCCTATCCTCAACTTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTT 

AATATATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAG 

GAGGCTTGAGAAAGAAGATGACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGC 
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AAGTATTTTAACATCTTCTAAACCTC 
GT 

Sequence 2288 

CTATAG GGCGMTTGGAG CTCCC CGCGGTGGCGGCCGAGGTAC l rTTTTTTTTTTTTTn 

I I I 1 I I CTGCTTTGATTTCA I I II lAI I I 11 1 I AAG AAG AAAATAAAAATCTCAAAG ACT 

MGTTCTCATACTGTAATTCCTGGAAATCATTCTGGAGTCAMTACTTTTGTCTGTTAAG 

AAAAAAAATAAAATGTAGANATTTCCTGCTGCTCCGGGAGTGTCGCCAGCAGACAGGGGT 

TTTCAGGGCTCCCCCNTGCCANAMCTTTNTTGAAGCAAACGTGGGCAAGCTGCCCGGGA 

CCCGGGCCTNTGAGTTGCAGAGAAGCAGGCTGTTGCCAGGAAGCGGACTCANACAGTGGA 

GGCCCCTCGCT TGAC CCTTTCCGGGGCCACGTGATTTTNTGCAAAGCACAAAGACAGCAG 

CGGTCAGGCGGTTTTCAGTTGTTTGGACGGCCCCAAGGAGTTTGTCCTTTTAAA^ 

0 

Sequence 2289 

GNGGCGGCCCGAGGCACTNTACTN rTTTTTTTTTTTTTlTTTl 1 GGGATTTTTAGCTGAT 
GAMTATGGTC CCATT TATCATCAGTTCTAACTTCGTGTTACTACTGTTGTTCTTAAGGT 
CCTTAAAGTAAATTTTAAGACTCAATCTAAGTGTGTCAGTGTTGATTACCATTCTATTTA 
NAMGGCAGAATTTGTCTTAAMGATATTTGNCTAAAGTGTGTGTTAGTATTTGTCAAAA 

TAAATGGAGGGAGGGAGTGGGTGAGCTGGGATAGAGGCAGGCAGAAAGCAACAGGACAGA 

GAGTTNGANANAGAGAAAGAAAAAAAA 

Sequence 2290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAAAGAACTCTAGAACATACAGGGTGTAG 

ACGGCAGTCTTCTTGGGGAAAMGAGGCTTCAGGATTGCAAGATTTCAGCTACTGTCTCT 

CCTCGGAGCAGGATTCCATCTTTTCCATGGCTGGGGCCCTTGACACATGTCACTGTACAC 

TGCTCTTCTCGCTGGGATTTCCAAAAGACCTTGGCTCAAGGTGTCACTTTTGCAGAAAAG 

CAT TTACT GACTGTTGACTTGGCCAAATCCCTTGTGCTGTGTTTCCTAGCACGGNGAGTT 

GCATTTTTCACATGAACTCATGCCTACCTTCTCTTGTTTTATAAGCTTCATGAAGGCAGG 
TACCT 

Sequence 2291 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGT 

GGCAGGAGAAGGCTTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAA 

ATGCCAGGGAGGGGACAATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAAT 

GGCACACGCGCAAGCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGGCCGGG 

ACAGATCAGACTCCCCCCGCGTACCT 

Sequence 2292 

NCGCCTGGGCAGGTACACACTTGTGTGCATTTCTGTTGGGTATATAACAAGTATGAAATT 

GCTGGGTCTT AAAAT ATTTATATATTMCTTCATTAAATGCTGCCTAAGMTTTTTCAAG 

TGGTTGTGC TATTTTA CCTTCACCAGTTCTAGTTGTTCCATATCCTTGCTGTCACTTGGC 

ATTATCTGTATTTTTTATTTTGACCAGACTGGTCTGTGTGTCATAGTATCCCATTGTCAT 

TTTAAGTTGCATGCCTTTGATGAAGACATTTTCTGATGCTTAATTGGCTCTTT 

TTCTGTAACTGATTTGTAGGAAGTTTTTTATATATT^ 

ATAAATATTGCAGATATCTTCTCCCACTTTAACTTCCCTTTTCTTTCTCTTAATCATGAC 
TTTCTTAATACAAAGTTCTTAATTTTAATGT^ 

GTTAATGCTTATTTAAGTCTTGCTCAAGAAATTTTTGTCAACTCCAGTGTCATAAAGAA^ 
CTC TCTTATGTTATAG AAGCTTGGTTTTTTTTTTTT 
Sequence 2293 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
TGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAA 
GAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGGTGTTT 
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GGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCG 

CACCTTCTTCCTGCCATGTGAGGAGCCAAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGA 

CCCTCACTAGAAGCTAGCCATACTGGCATCCTNATCTTGGCTTTCCAACTTCCAGAACTG 

TGAGAAGTATATTGTTTGTGGTTTAGTCAATGGGTCTATG 

Sequence 2294 

CCGCGG1 GGCGGCCGAGGTACACTGATTTCCGATCAAAAGAATCATCATCTTTACCTTGA 

CTTTTCAGGGAATTACTGMCTTTCTTCTCAGAAGATAGGGCACAGCCATTGCCTTGGCC 

TCACTTGAAGGGTCTGCATTTGGGTCCTCTGGTCTCTTGCCAAGTTTCCCAGCCACTCGA 

GGGAGAAATATCGGGAGGTTTGACTTCCCCGCGTACTTTGGAGTCCCCTGGTTTCTCAAG 

MTTGCCGTTGACTCTTTCTTTGGCTTCTGCTGGCACGGTAACCAGACTCCCTACAACTG 

CACTCTTTGTCTTTGTCATGGAAGCCGCGAGCCGTAGAGGTTCCGCGTGCTCTGCCGGAC 

TGTGAGCAGGNTCACTGGGTCCTTTACACTTGTGAATTCGAAGCTTGCCAGA 

Sequence 2295 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGATTTCCGATCAAA 

AGMTCATCATCTTTACCTTGACTTTTCAGGGAATTACTGAACTTTCTTCTCAGAAGAT^ 

GGGCACAGCCATTGCCTTGGCCTCACTTGAAGGGTCTGCATTTGGGTCCTCTGGTCTCTT 

GCCAAGTTTCCCAGCCACTCGAGGGAGAAATATCGGGAGGTTTGACTTCCCCGCGTACTT 

TGGGAGTCCCCTGGTTTCTCAAGAATTGCCGTTGACTCTTTCTTTGGCTTCTGCTGGCAC 

GGTAACCAGACTCCCTACAACTGCACTCTTTGTCTTTGTCATGGAAGCCGCGAGCGTAGA 

GGTTCCGCGTGCTCTGCCGGACTGTGAGCAGGTCACTGGGTCCTTTACACTTGTGAATTC 

GAAGCTTGCCAAGATGTATCCCCAATGCATTGCCACTTCTGCCCCGGTTGT 

Sequence 2296 

CCCCGCGGTGGCGGCCGAGGTACNCGGGGGTGGACAGACATGGGAGTGCTGAATCCCCAG 

AGTTAGAGTCCACATCATGTCTCTCTGAAGAACAGNTAAAGTGTCTTCTGGATGAATGCA 

TACTTAAACAAAMTCCATCATTAAACTTTCTTCAGAAAGAAAAAAGGAAGACATTGAGG 

ACGTAACACCTGTGTTCCCCCAGCTTTCCAGGTCCATCATCTCTAAATTGCTAAATGAAT 

CAGAAACAAAGGTCCAGAAAAACTGAGGTAGAAGATGCCAGATATTGCTTTGAGAGTGAA 

GAAATGTTGAAGCTTTCTTAAAAGGGCTACTATCTCACTAAAAGCCCTTTGACCTGGGGA 

CCATTAACCATGGTTCAAAAAGCTTCTTTGTTCCACTTGAAAGCCAGGAAGAAAATATTG 

AAAANTGCCCTTTTCAAANTTT 

Sequence 2297 

AGCGGCCGCCCGGGCAGGTACACACACTCACGCATGCACAGGCACACTCACCGGGACACG 

CACACAGTTGTGCTGTGCACACAGGCATACACACACACACCCTCACACGCACACACACTC 

GCTGGATCACTCACACACACACATCCACGAGAGGCGCGGCGCCTCTTTGGTCGTGCAGCC 

AGCCCCTGGCTTGCTTTTGAAGGCTCTGCTAAGTCGATGGTCAGGTTGCCAAAGGGCCGG 

TCATCTCGCATCCTCCCTCCCTTCGGTGCCTCTGGTCTGATGTCCAGTGAGGATCAAAGC 

CAGGAGAGCGAGATGGAAGTTTTCCTGCTGCAACGCTTAAGCTCCGGTTGAACACAAGGC 

TTGGACTCCCCGCGTCCTCGGCCGCTCTTAGAAACTAGTGGAATCCCCCGGGGCTGNAGG 

Sequence 2298 

CCGGGCAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGG 

GAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGMTACACATTTCCCAC 

CTTGCCCAACAGTAGAAAMCATAAGAAGAGAAAAACATTAAAAMTGACAAGGAAGTTA 

ATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCC 

TTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGGAA 

GCCGGCNTGTCTGCAACCTGCAAGAGGGACCC 

Sequence 2299 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAATAGTATCA 
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GGCATTGGCTTAGACACTGAGGATAAGAGCTCTATATACTTGTGAACTTGGTGAAAGTTG 

AGAAGGATGGAGAAAGAGATAGAATGAATATAAAMTATGAGTAGAGMTTGCTTTAATT 

TGAATAGCCTC ATTGT TTTAGTGCTCTGAATAGAMTAGAAAAAAATGGTTAGTAT7TAC 

TGGAATTGCTATTTTTAGGTCATTCAGAGTTMGGACAAATTTCMTTGTCACACATAAA 

AGGTCTTATTTACATAAACAATTTTACAGGGCTTGTACAAATACAAGAAACCAAAAGCCC 

TAACTGCTGTTTCTCATGGGGTTAMTCTGTCTCTTCAATCTATCAMTATTTGGCTTTA 

GAATCCTAACAGTCTTTAGCTATCATTCAAAG 

Sequence 2300 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGAGGGCTGATTTATCACATACG 

TGGGTTCTGTAGGGCCCACTGTGGGACTTGAGTATGCATTGGGTTTTTGTATACTCAGGC 

GTTCTGGAACTATCCTCCATGTATACCAAAGGATGGTTATTTCTACAAAAGCAGGAGGTA 

AAGCCCAATGCACAGCTTGCAGATTCCCCTCAGGCGAGAGACAGAGGTTAAGGGTAGATG 

GCAGATGACCTAAGTGGCTTGTTTTAGGAGGCCACTCCCAGGGCCACAGCTTTCATGTGT 

TTGCCACCAGGGTAGAAGGTCTTGCTGACAAGGGCAATGACTACTAACAGCCCCAGCTTG 

CATGGAAACAGAAAAGGGCATATTGCTTGTGGCCACCAGGGCTCAGGCTCTATCCCTCAG 
CAAGCTTTGGGATC 

Sequence 2301 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAGTTTTTCAG 

AATCAGGCCTGCTTTGTCCCTAAATGCTTTCTTATTTCCCCAGAGCCCTTATTGGATTT^ 

AGCCCCATA TCATAT CTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGTCC 

TTGTAAGTTCTTTTTCCGTGCCTAGAAGTTGCAGAAGACCAAGGAACCTTGGAAGTGACT 

TACTC I I I I GC7TC ATAAAGTTGCTTTAAMCAGCCTTTATTTTAATTATTAACAAGGTA 

TGTCACACATTTTTAAATTTTAAGGTTAATTTAGTCATTAATTCTGTCAAACAA 

TTTGGAAGCTAGTTGTCTTCAATGTCTTTTACTTGGAGTTACTTGGAAGTTTGCTTA/^ 

TTTATTCACTTGGAGTAGAACTTCCTAG 

Sequence 2302 

TACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAC 

TTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCC 

TATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGTGA 

AGTAGAAGAGACAGGGAATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGG 

AGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGA 

GAAAAACATTAAAAAATGACAAGGAAGTTMTGGAAGTCAGCAATGTGATTGGTGTTTGG 

GAGGTGGAGCCTTCAGNAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCG 

CACCTTCTTCCTGCCATGTGAGGAGCCAAGA 

Sequence 2303 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGATTAGTTAA 

ATATAAG ACTCCGTAA l l l 1 1 ACMTTTTAACAATAATTTTATTTCTTCAAGCTTGTTAG 

TTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATMTGCTGCTTTATGGA 

AMTGGANTAATAGGTGGGTMGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTA 

GGAACCCAGTTGMGTCTAGMGACAGATGTTAGTATCTTAGACTAGGTTGGTATTTGAA 

TAG ATATTGGTAA TATCAGTAGAATTTAATAATACATTAGAAAGAAAGAAATCAGAGAAG 

ATTCTTTTATTTTCACTTGATACTTGTGTTGTTACTTTCMTGAGATAAAGAAGGACAG 

CAAAGGANAAAGTTCAGGGGC 

Sequence 2304 

CTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATTATCTTA 
GTGTAAATGA i mi l GTAAAATAATTTAGTTGTTAGATACATGAACAGATTTTTGCAAA 
GAAAACAAATTGCATGGCACTGTATCCTCTCAGGACATTTTGAGATAAGATGTGAAAATG 
ATACTCATTGGAACCAAAAAMGATTGAAMTAATTTTTGTAGAAAATATA^ 
TTCTACTATGGTGTGTATGGTGTGTGTGTGTGTATATATATATATATATATATACTGAAA 
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TAAAATTATGAGTA ATTAA CTTTATGACAGGTAACACTAATAAAGTGCAGTGGCATGTTC 

CTGAAATCCCAGCATTTTGGGGGATTGAGGTGGGGAAGGGATTACTTTGGAGGCCAGGGA 
GTTTGGGACTAG 

Sequence 2305 

ACTACT TAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGAGATGATTAT 

TCCCTCl (lilt f AAAATTTGMGTTTCCTTTGATTCCCCTTTTCTTTTACAAAATGGAG 

TCATGACACGATCCCCGTGTCCCGTGTTGAGMCTGAAAGGTTTCTGTAGCTTTCCTTTG 

ACATGAAATTTCAAAAATATACTACTCTGTTTCACTCTGGCCTGTAATTGTTGCACTCTG 

GCCTTTMTTGTTGCAACTGATTAGACATTTTTCCTTCCAGCATTATGGTAGACTCGATG 

CCC 

Sequence 2306 

AGGTACl l i l iCACI rTTTTTTTTTTTTTTT! I GCATTTGAGCTAAGATTTTAACAATCC 

TGTGAAAATAGGTAATATAGTCACGTATTTGTTGAATACATTATTTCAAAGAACCACAAA 

CACACAGAAAAAATATTTAAAAAGTCCATGGTGGGGGGAAATGATCCAGAAACAACAGCT 

CTTCTACTTCCAGCCCTAACCGTCTGGGATGCTAGATGGTTGTGGGCAAACCGTGGAAAA 

AGATAAG CTTCT GTTTTTCAAGATTTTATTTTGTTAGAGAGTGCANAAAATC 

GTCTCTG l l l l iGCCTTCTAGCMTTMTATATGACAGCAGTCTTTGTGATTTATTTTAA 

CTTTCTGCAAGACCTTTGGCTCACAGAACTGCAGGGTATGGTG 

Sequence 2307 

AG GTACTCTTG GTTTTTTAAGACAAAGAGCAAATCCTCCCCTGCCAGGATTGACTTTTGG 

CTCl mill ICAMCCTCACTGCTTTTTGGTTTAGTTGTCATAAAATGCCAAGCACCAT 

GAACAGGGCTCCATGAAGGGGCTCAGAGGTAGGAGGGCTGTGATTAGGAGAAGGCTTGGA 

CTGATGGGCAATTTGAGTGCTCAGAATTAGAGTGAGGGGGTGGGGGTGCTGCAGGGACAG 

ATGCTGGGGAAAGACACCCTGAAGGGCAAAGGGGAGCAACAAATGGGCTGCAAGTACCTT 

GCCCCGGGCCGGCCGCTTCTTAGAAACTAGTGGGATCCCCCGGGCTTGCAAGGAAATTCG 

ATATCAAGCTTATCGAATACCCGTCCGAACCTTCNGAGGGGGGGGGCNCCGGGTACCCAA 

GCI I I I ( I GTTTCCCTTT 

Sequence 2308 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGATGCTGCCA 

CCTANGTTACTTGTAGG ACCCTATACG G N AACCTCCTTTGCCAGGAACTATNTATAAAC A 

TCCTGCAGGAAAATGAGTCAAGGAAGCTTTTNTTTTGAGCTATTTACAGCTT^ 

TGAGTAMGTATACTCCTGTGAACAAAATTTGGAACATATTTGNTTCTNTCTAACTGATT 

TCTNCAGAATTTGGAACTAGTTCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTA 

TGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCC 

Sequence 2309 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGGCAGTGG 

GAAGCTCGCNGCAGCTGGGGAGGAGCCAAAGCCTCGGCAGCTCACCTAANCCGCANGGAG 

ATACACCCAACTGGGAGATGAGGAAACAGCNACCCAGAGAGGAGAACTAACCCACACAGG 

ATCATTTNCCGAAGGAGCACGGCTGAAGAACCANACCTGGACTTTCTTAGGACAAACTTA 

CTGCAGCTTGAANGANCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCT 

CTACTATCAATACC AAATCATTANCTGCTGTGCTG NTTTAG AGCCCTG GG AGCG ATCTAT 

GTTTAACACCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTTTCTCAAAAA 

Sequence 2310 

AGGTACGCGGGAGAATTTGTTGCATTGAGTTATTAAAGTAGATTATACACATTTAGCATG 

GTTAGAAACTAGATGTTACCACTGAGTGTTTTCTCCTTAATAAGTTAGTTTAAGAGGCTG 

GGGATAATTTCACGGTAAAATGGCAGCTTAACCCCAGTTTCAAACATTGAAACTTGAAAA 

TTTTCCTTTTTGAAAAATTACATATACACCTGGCACTCTCTATCTGTGGGTTTCACATCC 

CAGGAGTCAACCAACCTCAGAAACAACCAATCGTGGATTGAAAATACTTTGGGGGGGAAA 
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AAATNNAANANAAAAANAAAAANNNNANGGTACCTGCCCGGGCGG 
Sequence 2311 

CCGGGCAGGTACGCGGGAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCC 

ACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAAC 

ATCCTGCAGGAAAATGAGTCAAGGAAGCTTTTCTTTTGAGCTATTTACAGCTTTTAGCAA 

TTGAGTAAAGTATACTCCTGTGAACAAMTTTGGAACATATTTGTTTCTCTCTAACTGAT 

TTCTCCAGAATTTGGAACTAGTTAGTCTATATGTCAGAATACACATTTCCCACCTTGCCC 

AACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAA 
Sequence 2312 

AGGTACGCGGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTC 

ACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGA 

GAACTAACCCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACT 

TTCTTAGTGCTCATTCCTGGGCCCTCGATCACTGCATGTTTGTTACATGGTCCAGTTTGC 

TGCTTCCTGGAAGAGCATCTCCTGACTGTGATGCCTTTGAGAATAAAGTCTTTCCGAAGC 

CATTAATATTCAGAAGGAGCCACATGGACA 

Sequence 2313 

CCGGGCAGGTAC I I M I II I I U N I II { I! I I M I M I I li I NAANGACTGACTGTGCT 

CCANANAGTGGANAAAAATGAATTGGGAGCTGAGGAGGAATGGTANATGGAGGCTAATTA 

CAGGATATTCCAAGATGCTTGAAAATCAACATCCACATGATGAGATGACCTGGGTATCTT 

GGGTTTCTTGTGTGACTGGCANAAATTCATATNTAGAATATAAGGGGGTTAATATTGGCA 

TTCTCTAGGGTGGTGGAAGTGTCTGTTGTATCGTGTGGGGAANAAATCACCTTGACAGGA 

AAACATGGAGCTCAACACATCACTTATTCAGTCACAAA 

Sequence 2314 

CCGCCCGGGCAGGTACTATTCTCTCTCCCAGATCAAATTTTCTCCGGTGGCTAAAAAGTT 
GTTTGTGGTMCTGCAGTGAGTGCTATACCTGTAATTTTTCTGGCTCATCACTTTAAAAG 
AAAACGTGGAAAGAAGAAAGGAAAAATATTACCATGGGAACCAGAGCACCTCATACTTGA 
ATACACTAAAAGAGCAGCATCAGACAAAGGTTCAAGTTGTTCCAGTAGCAGACAGAATTT 
G AC ATTATC TTTAAGTTCT AC C AAAG AC AAAGG AT CTCAAGTTTGTAACTATGCT AATG G 

AGGACTTTTCAGTAAATATTCAGGTTCTGCACAGAGTTTGGCCTCTGTCCAG 
Sequence 2315 

ACTATAGGGCGAATTGGNNCTCCCCGCGGTGGCCGGCCGCCCGGGCAGCTACGCGGGATA 

AGAGGCCAGAGGAAGAGCGCTTCGAGCTTCTTCAAGGAACTTAGAGAAGAGCGGCATTGT 

GCTCCTTCTGGGACCCCCACAGGACCAGAGATCCTTGCTGCTGCAGTTCCTCCCTCTTCC 

CTAAAGAACMTAGGGAACAAGTAGMGTGGTAGAATTTCACAGCAATAAAAAAAGAAAA 

TTGACGCCAGATCATAACAAGAACACAAAGGCTAATCCTAGTGTTTTGGAGAGAGATGTG 

GATACACAAGMTTTAACCTAGAAAMGCTCGTTTAGAAGTGCACCGGTTTGGTATCACG 

GGTTATGGAAAAGGAAAGGAGAGAATCCTGGAACAGGGAACGTGCCATTATGCTGGGCGC 

TAAGCCTCCTAAAAAGAGTTATGTGAATTAC 

Sequence 2316 

NCTATA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGTTCTTTTm 

inn INTTNTTGGAGCAGCTAAAATCTAGGATGGNGTTAAGTTTCTTCATTTTGTCAAT 

TATATATAAAAAATTAGAAACAACATGMTCTGCATTTCTTGGATGAGATAGTTAATAAC 

AAACTATTTCTCAATATTTGTATACTAAAAACTAGTGAAGGTGTTATGNGTTCCAGTATC 

TTATCTCTTATTTGAACATGGGTTTNTGAAAGGAGCCTATATAATAATATAAATGGTATG 

TAGTAAATGAGGCNCTGTCTTGGCTGGGACTGCTATAAAAAAATTACCATAGACCATTGA 

CTAAACCNCCAACATNTNCTTTTCACAGTTCTGGAAGTTGGAAAGCCAAGATGAGGATGC 

CAGTATGGCTAGCTTCTAGNGAGGGTCCTNTTGC 

Sequence 2317 

CCAGACGGAAGCTGACATTCGCTATTGATGTTCCAGCATTGATTGAGCTCCTGCCTCGTC 
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TGCATCAAGCATG GTCCT CGCAGCTTCACGCAGTTAGAGTCCACATCACGTCTCTCTGAA 

GMCAGTTAAAGTGTTTTCTGGATGAATGCATACTTAAACAAAAATCCATCATTAAACTT 

TCTTCACAAAGAAAAAAGGAAGACATTGAGGACGTAACACCTGTGTTCCCCCAGCTTTCC 

AGG N CC ATCATCTCTAAATTGCTN AATGAATCCCAAAACAAAATGGTCC AN AAAAACTGG 

AGGTANGAAGATGCANATAGTGCTTGAGAAGTGAAGAATGTGAAGCTTTCTAAAGGCTAC 

TTTCTCACTAAAGCCTTGACTGG 

Sequence 2318 

GCG GCC CG AG GTAC AATTGTGTC ATATTC ATGC AC AACTTC ATTTTAGTCTCTATTTGAC 

TTGATTTCTTTAGTGGATTTGGGCTTTTTGGCCAATMGATAAATACCTGCMGGGACA^ 

CACATAAAATTCGATCATAGAAGAGGATCTCTTTCCTGCCGTCCACATCTATCTGGAGCC 

TGGGTATGCTGGAGGCATCATGGTTGACCACCCCCGCGTACAATGGAACAAGGAGATAAG 

CAGTGAAAGGCCAAGGGAATGTCTGGAGTTAGGACTTCAGGTGATTCACAACTTGGCTGC 

CACTCACCCGAGACTGCCCAAGCGCAGATTTCTTNCTTCTATAAGAATATTGATTCTTGC 

AAATAAGATGAACCTAAATGTGGTCCAGGAGCCAGCATCTTCTACATGGTACCTGCCCG 

Sequence 2319 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGGCCNGGGTCCGCGGGNAGGGAGAA 

TGCTGCCACCTAGGCTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATT 

TATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCC 

AACAGTAGAAAAACATAAGMGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAG 

TCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAG 

AAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGC 

Sequence 2320 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACGCGGGGGGACAA 

GGGGCTTGAGCGTTCTGTGGAGAGAGTGCGAGGTCAGGCCATGAACTTGGGAGATGGTTT 

AAAGCTTGAAACTGAATTACTGGATGGAAAAACCAAGCTAATATTGTCTCCATATGAACA 

TAAATCAAAAATTTCTGTGMGATGGGAAATAAGGCCAAGATTGCAAAATGTCCTTTAAG 

AACAAAAACTGGGCACATTCTAAAATCAACACAAGATACTTGTATTGGGAGTGAAAAACT 

TTTGCAAMGMGCCAGTTGGTTCAGAMCATCACAGGCAAMGAAAGAGATTTCACCTG 

CAACTCCTAATATGCAGAAGACTAGAAACACCGTAAATACATCTCTAGTAGGTAAACAGA 

AGCCTCACAAAAAACACATNACAGCTGAAAACATGAAGAGCAGTTTG 

Sequence 2321 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGNCGGGTNCCCGGG 

GGANATGCTGCCNCCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAA 

CTATTTATAAACATCCTGCAGGAAAATGTCAGAGATGGGAAGAAACAAGAACTTTGACAT 

GCTTGGTGTTCTTGCCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAATACACA 

TTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAA 

GGAAGTTAATGGAAGTCAGCAATGTGGTGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAAT 

TAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAG 

CCAAGAAGCCGGCTGTCTGCAACCTGCAAGAG 

Sequence 2322 

GGGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGAGTCCAAAGACGTAAACGT 

GCCTTCAGCAGGGAAGTTCGGAGGTCTTGGATTCTGATGGTAATATTCTTTTCATAAAAA 

GGAMTMGTAATATTTTTGTCGGCAAGATCACAGTCCCTAGCTATGCCTTAAATATCAT 

CTCAGCTGAATTCTGTGTTCCACTCCACAGCCTTCTTTGCAACACTGGAAGGTTAACGGA 

TTTGAAGTAAATGAATTGTAACTGTAATTTC^ 

GTTTGGACCGTGAACAGGCTAGCCAAGAACAAGTATTGGGAGGTCCATGTAATATNGAAG 
AGAGTTGATATTTGGCCCAACTTCCTGATACTTTAAAAACAGAGAAGTNAAACAAAATAT 
GAGGATNTCTTTTGGCCAGCAAGTCC 
Sequence 2323 
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CNCTTAGGGCGAATT GGAGCTC CCCCGNGGGGNCGNCCNGGGACNACCCTTGGNTTTCCA 

GNGGGCNTCAACANA I rTTTTTTTTTl I AANAAAATGAATTCACTGAGGTTTAG AAG ATG 

TTAAAATACTTGCCCAAGTTCCTGCTTGCCCAGATGTGCCTTTGGGAATGTCATCTTCTT 

TCTCAAGCCTCCTTACAGGGTTGTAGTGAAGATGGAAGGGGATAAGGCAGGTGAAGCAAT 

TGGAATATATTAAGTGCTCAATTAATAAGGACCTGTCA 1 1 1 I 1 I I 1 CTCAAGTTGAGGAT 

AGGACCTGCCCTCTGGTGTCTGGGAAAGAAACTGTGGGTCCAGCTTACCTAAGGAGAAGG 

GGATAAAAGC TTCATT CTTTTTCAGTGCCCCATTGGCATCCAGAAAGTGGTCAGGATTGA 

AGGCGTCTGGTTTTTCAAAGTAGTGTGGGTCACGGA 

Sequence 2324 

GGNGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGGAAAA 

CTTAAACCCATGCACAAATMTTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGT 

GAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAA 

ATTTGGGCAMGCACATTGAGTGTGCTTGTTTGGCTCTGAGTGTCTTTGCAAACATGTGT 

CTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCT 

GTGGCCATGTCAAGCAATTCTTTCTTTCAACTGCAACTGTGT 

Sequence 2325 

GGCGGCCGCCCGGNCAGGTACNCGGGGTAATTGACATTGTGGACTTCCTGGGGTAGTCCT 

GAGTAAAAAGCACTTAAGCACCTGTGGTACATCAAAGAGTCTCTTTGAGAGTTCTGCAAA 

GATTCAAGTGTGTATACCTGAGTCTATATATCAAAAAGTAATGGAGATNAAATNGAAGAA 

NGTANGAANGAGCCTNCTAAGAAGCCATCTGCCTTCAAGCCTGCCATTGAAATGCAAAAC 

TCTTGTTCCAA.ATAAAGCCTTTGAATTGAAGAATGAACAAACATTGAGAGCANATCCGAT 

GTTCCCNCCAGNMTCCAAACAAAAGGACTATTGAAGAAAATTCTTGGGATTCTGAGAGT 

CTCTGTCGAN GACTGTTTCACAGGAAGGGATGGTGTGTTTACCCAAG GCTACACATC AAA 

AAGAAAAAAAAAAA 

Sequence 2326 

CCGCGGTGGCGGCCG AGGTAC I rTTTTTTTTTTTTTTI I GTTTTTCACTGTGGTAGGGTG 

GAGCTTAATTGGGCANACCTGTCCTCAGGGACCCTGGTGGTGCATGTCAAGCAAGGCTGG 

CTGTGGTAGTCAATGGGCCAGGTGATAACCAGGTCTCTGGTGGTATGTCAAANGGGGGGC 

CANANGNGCTTGGACTTGNGAACCATGTTACTGGAGGAGGTGAMNTACCTTTTCAGTGG 

AAAGCCTCCATAGGCAGGAAGCTAGGGACAeAAACTTTGCTAATGCTTTAGCCCCAAAGG 

CTACAACCCAGCACAGCAGCAGTTATGAGCAGNGAAATTTACCCTTGAGGTACCTTGCCC 

GGGCGGCCCGGNTNTAAAACTAGTGGGATCCCCCGGGCTTGC 

Sequence 2327 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGAATTGGAAACT 

ATAAAAAAGTCACTGCAGATTTGTTAAAAAAAAAAAGCCAMTATAAATTTGAAAACTTG 

AAAAACCAAACAGTGAAAATTAAGAACTCCAAATGTGTTAGTTATCTATTGCTGTGTAAC 

ATACTGCTCAAACTTGACTTAACATNACCTTTNATTATCTCAAGTTTTTTGGTT 

GGAAATCCAAAGCCTTGGGCCCTMGNTGGGGTTCTCTGGCTTTTAGGCTTGCAAATCAA 

AAGGCATTNGGTCGGGGGGTANGGGTCTCCTNTGGGAGGCTGAAGCTGGG 

Sequence 2328 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACACACACACAGGC 

ACACGC AGGC ACACACGCAGACACACACACACATAAACACACAGAGTTCACTAGTCCGAG 

TTACTGATTTTCTTAGGATTCTCAMGTGACAACACCGGAAACAAGGTMTTCATGTTAA 

AACA CAAGGGGTTATATTCAGTTAAGAGAATGGGGATCCCCGAAAGTAAACCCGNGGGAA 

TTTTGGAATCCAAGTTTTCGAAAGAAGCTTAAAAAAAAGGAAAATTNGGGN 

CATTTCMCCCTTTCTTGGMTTCCCTTAAAGAAAAATACCAGGANGGTTTCAA 

NGGAA AAACCATTTTNCCGGGTTTTTTNNGGGGATTNCCCAAAAAGNTTTGGAAANNCAN 

TTTTTGNGGNAATNAANGGCCCTTTANN I 1 1(1111 IAAAA 

Sequence 2329 
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TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGAAAATAACGTCAATTTTCT 

CAGGCTACTGMTTAGGTGACAGAATTTAMTTTAAATACAAAGTCACGTGGCTTTAGCA 

CCCAAATTCTTAACTAAAGTTCTGTGCAGGATMGACTTATGACACTGGGATTATTTGTG 

CTCCCTTTATGCATCTGTGTTAACCCACACATATTACCAGCCCTGTAGTTAATAGGTTAA 

TAAAGAATTAACTGCTGCATAAAATATTCAAGAGAGGAGGGGCTTACTCCACAACTGGAT 

CCTTGAGTCTCACCTTCTCTACATCTCCCAGACCCTCGGTGTCCAGCAGAACTAGGATGT 

GGCCTGGCTTCTTGGGGTGGGGCACACACCACATCCAGATTCCTTTAAGTGTGAAGACTG 

CACCGTGGAGCCCAGAGAGAAGCCTGCAAG 

Sequence 2330 

AGGTACGCGGGGAAACACTTCTTACCTCAGAGCTGAGCTGGGCATGAGTAGATGCTCAGT 

AAGTGGTGCACAGGGTTGGTCCCTATGGTGGAGGCCCCCTAACACCGCCCAACCCCCCTC 

CATGTTCTCACAGCTCCACGCACTGAGCACGGGCATGAAGGCCATGATGTCANAAATTCT 

GCACCCAGGGAGCTGAGATGTGCCGCAGGGCCTGTGGCGGACATGGCTACTCAAAGCTGA 

GTGGCCTGCCATCACTGGTCACCAAATTGTCGGCCTCCTGTACCTGCCCGGGCGGCCGCT 

CTAGAACTAGNTGGATCCCCCGGGCTGCAGGGAATTCGATATCAAAGCTTATCGATACCC 

GTCGACCTCGAGGGGGGGGCCCGGTACC 

Sequence 2331 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTTT 

I II M I I I I I I N I I I I I M I GACATGAACTAATTTAATCCTNTTAACAAATTTACAAGG 

AACATATTCCCCTCACTTCATTATGTAAATAATAAAACAAAAGCCTAGAAGGGTTAGTAA 

CTTGCANATCATATAGTCACTTTGCAGCAGGTCTGANATTTGGGTCCCANANAGGCAGNT 

CCAGCCACCATGCAACCGAATCCTGCGCAGCACAGTTCGGNGTCATTCAACAAGACGGNA 

TCC ATGTTG ATTAAGG AATCTTAGCTNTTAAAAGG NTCTAC NAATTACATTTTTTCC ACC 

CAAGC 

Sequence 2332 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAAAAAAAAAGAAG 

TGTTTATTTGAAATTCAAATATMTTTGACATCATCCCTTATTATCATTTGCTTAATC 

GTAATCCTAACTAGGAAGAAAGCAAAAAAAAAGGCTTCTGTATATTCATATTAGATGCTA 

AAATGTATTCAAATTAGATGCTAAAATTCATGCACTTACCTTGCAGAAGGCTCACTCTCT 

CAGGACCTACAAACTGGCAACTCTCACGCCAAATTTACCTCACTGATATATTTTATTTGC 

TMGCAAAATATTTTAATTTCMTTAGTTACCATCATTAAAATGAATAGAGTAGACATAA 

AAAAATGATTTCATATTCTGGCTTAAAAAAATTTGCCAGCCTGGGTAACATAGCAAGGAT 

CCTGTCTCTACAAAGAAAAAAAT 

Sequence 2333 

CCGCGGTGGCGGCCGAGGTACAGTAGGAGTGCCCAGACTCGGGGAGAGGCAAGCTGGCGC 

GTCTCCAAGGTGCTTGTCACTCACCACTAGAACGTGGTCCTTTACCACTAGACAAGATCT 

CCTGGTTTGACAATGCAGGTGACACAGCTGAAACTTTATCTCAACACGGCCTGAAGAATA 

CCACTGTCTAAATATGAGGTGCTAAAATACTATGACCTACTCTAATATTCTCTCCCAACT 

CTGTCCATCCTCGTGACTGGCACCCATGCTGGCCCAAATGATGACAACCTCCTGTTCTAA 

GTGCACAAGCCGCACATTTAATAAACCTTTCACCCAGTTCCTTCACTGGTGTTCATCTCT 

CGGCTTCACTTACAACCACTCCCTGTGGCTTTTGCAGTGAAGCCTCTTAGCCCAAAACTC 

TTCCACTCCCCA 

Sequence 2334 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAAGGAGG 

GCACTAGGGGGAGTTAAGAATTTGTGAGTTAGGTAGCCAAGATTGTTGAGGTTGGAAGAA 

TGAGGAAAGAAAGGGGATCTGTGGCAAAAATATGCTATTGGATCCACAGTTCTAGTTTAA 

GGACATGTCTAGTCCCTTTTCCAGCCCCCAGTTAATTACAATCCCCTGCTGCCCAAATGG 

AGAGGGAGGTTGTTCACCAGTAGAGGGAGCCAGGGGCTGGGTGTCTCAATAGAAACTGAT 

CTGGTCCCAGAAGGGGCCTTTAGATTTAAG I I I I I CGGATCGAAGGCAGAGCAAAGTGTC 
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CCTTACAACCAATGGCTAAAATGTATGCCATTGTAGTAGATCACATTGTTAGTTTTCTTT 
Sequence 2335 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACAGGCCA 
TCTCGCTATAG GAAAGG AAAGTGGAACAG CATTC ATCCTC AAC ATTTTTACGAAGACAAA 

ATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAA 

GATGTGGTGTGAGAAGCATCATCAGCATCATGCTATTACAATCCCAAACCATGGGGGTTT 

CTCACAGCTTTACACCAAAGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGT 
ACCTGCCCG 

Sequence 2336 

AGCTCCCCCG CGGT GGCGGCCCGCCCGGGCAGGTAC I (TTTTTTTTTTTTTTTT1 1GGAC 
AAAG AAACG TG TTTTAAAAAATAAC ACAAAGTCC ATAATTTCAATG AAAGTAAGTGTCAA 
GCTCTTACCAGGCTGGTCCTCTGGTCCAGTTATAGCTAGTATATCTGCTTCAATACTATC 
A TCTTCTGGT AMGGTCTAGMGACACAGGCA^ 

ATTTTTTT1 fGAAGTTCAATCTATTAAACATACAGCTTATAGAAATMTTATTTGGAAAA 

TGGAATAMGCAGTGTTGTTGTTTTTGAGACAGGGTCTTGCTGTCTGTCACCTAGGCTGG 

AATACAATGGCACAATCATGGCTCACTGCAGCCTTAACTCCTGGGCTCAAGAGATCCTCC 

CGAGTATCTAAGACTACCAGGTGCCGTGCCCGACGCTTGGCTAATTTTAGTACCTCGGCC 
GCTCTAAAACTA 

Sequence 2337 

CCGGGCAGGTAC I rTTTTTTTTTTTTTTTI I I GGACAAAGAAACGTGTTTTAAAAAATAA 

CACAAAGTCCATAATTTCAATGAAAGTAAGTGTCAAGCTCTTACCAGGCTGGTCCTCTGG 

TCCAGTTATAG CTAGT ATATCTGCTTCAATACTATCATCTTCTGGTAAAGGTCTAGAAGA 

CACAGGCATACTTTTTAATCCAGAAATTAAGAAAGAA I I I I 1 t 1 I 1 GAAGTTCAATCTAT 

TAAACATACAGCTTATAGAAATAATTATTTGGAAAATGGAATAAAGCAGTGTTGTTGTTT 

TTGAGACAGGGTCTTGCTGTCTGTCACCTAGGCTGGAATACAATGGCACAAtCATGGCTC 

ACTGCAGCCTTAACTCCTGGGCTCAAGAGATCCTCCCGAGTATCTAAGACTACAGGTGCG 

TGCC AC GACGCTTGGCTAATTTTTAGTACCT 

Sequence 2338 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCACCTAG 

GTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTG 

CAGGAAMTGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAA 

CATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGAT 

GGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGA 

GCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTTCTGCAACCTGC 

AAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGG 

Sequence 2339 

CCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAAC 

GTCGGTTCATGTAAATTAGAAATTATGGGGCCACTTTGCCATTCTTCACACCTGCAATGA 

ACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAACTCCATTTGCATGTTATAGTGGG 

ATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCATA 

TTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTTCTGGAGGAA 

ACTGAACTCAGCTGATGAAAGCCACCCTGGTCCTGGGAGGTGAAGCATCTTCCAAATGAC 

AGCCTGCAGTCAATGACTGATGAATATGACTTCATTGCCTCATGACAGGGA 

Sequence 2340 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGGCGGCNCGCCCGGGCAGGTACTTACAGTTT 
GACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGA 
CACTTAGCAAATCCAATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATG 
GAGAGCAAACTGGTTCATATCTTCAGATAGTGTCATTCAACCCCTTGGTAGCTTTCTGGG 
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CTACTAAATATCACACCGTCTGGTGGGACATTTCACCC^AGTCGTGAAGGTCCGCAGCTT 
CACTCCTGAGCCAGCGAGACCACGAACCCACCAGAAGGAAGAAACTCCGAACACATCCGA 

ACATCAGAAGGAAGAAACTCCGGACACACCGCCGCCTTTAAGAACTGTAACACTCACCGC 

GAGGGTCCGCGG 

Sequence 2341 

TAGGGCGAATT GGAGC CTCCACCGCGGTGGCGGCCCGAGGTACCATGATTAGTTAAATAT 

AAGACTCCGTAA I I \ I f ACAATTTTMCAATAATTTTATTTCTTCAAGCTTGTTAGTTTG 

GGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTTTATGGAAAAT 

GGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAAC 

CCAGTTGAAGTCTAGAAGACAGATGTTAGTATCTTAGACTAGGTTGGTATTTGAATAGAT 

ATT GGTAA TATCAGTAGAATTTAATAATACATTAGAAAGAAAGAMTCAGAGAAGATTCT 

TTTATTTTCACTTGATACTTGTGTTGTTCTTNCAATGAGATAAGAAGGCAGGCAAAGGAG 

AAAAGTTCAGGGGCNGGGGATGAGAAGAAAACAANAATTNTATGTTGG 

Sequence 2342 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCTCTCTGTTCCCCTCCCTCCACCCTTCCCC 
TCCTTTTCTGT TTTTA AGATAAAGTGTTGATTTGGM 

AMGAGGTCAGTTTTGTTAGAGGCCTGGATAGAGCAATATGTTGTAACCAAATCACACGT 

GMCATrCTGTGGCTGAACATAAGAMTTGTGCATTATTAACGTTGTGCTTCAAGGTGGA 

ATGCTGAAAGCACTTGGAAAGTAGTTGCCAGAGAAGAGCTCACTGAAAACATTATTAGCT 

TTATATATCCACTTGTAGAATCTCTGTACAGTGATACATCCCTCCCTTTCATCTTGATCA 

TAAATATAAGACTTAAAGGAGGCAGAGTGAACGTGCTTAGGAAGCCCACTGAGAGCTCTC 

AGAAAGGCGCTTAATACCmCTTAGTAAGTTGAAAGTAGGTTGATCTTCAAGT 

Sequence 2343 

CGGCCCGCCCGGGCAGGTACTATAGCTGTAAGGAGAAGCTGAGAAATGATACCCAGGAGC 

AGTAGGCTTTACGTCTTCAGCCTAAAACCTAAAAAAAAAAAAGAAAAAAAAATTTAAAAC 

AGCTATTAAACTGAAAGCATCTGTAAAAAAAAAAAAATAAAAAAAAAAANGGTCCTGAGC 

ACCCAAGTCAGCTGTGCACTGAGGATTATGAGGACTGCAGGCTGTAGTCATTCTTCCCCC 

TGAGTCGTCAGCCTTGTCCCACACAGCAAGGTACCT 

Sequence 2344 

AGGTACGCGGGGGACGCTGGGAGGAGATGCTGCCACCTAGGCTACTTGTAGGACCCTATA 

CG GCAA CCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCAAGGAA 

GCTTTTCTTTTGAGCTATTTACAGCTTTTAGCAATTGAGTAAAGTATACTCCTGTGAACA 

AAATTTGGMCATATTTGTTTCTCTCTAACTGATTTCTCCAGAATTTGGAACTAGT^ 

GCTGAGACCAATGATGCTGACCTCCCTCAAAAGCTGCATTTCTGAATTTCTGAAGGCAAA 

CTGTCTGCCTATATTGNACCTGCCCG 

Sequence 2345 

AGGTACGCGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCT 

CAACATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAG 

CATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGCATCATCAGCATCATGCTATTAC 

AATCCCAAACCATGGGGGTTTCTCACAGCTTTACACCAAAGGGCATCACTATCCCTCAAA 

GAGAGAAACCTGGACACATGTACCTGCCCG 

Sequence 2346 

AGGTACGCGGGGAGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGTGCTTGCGCGTGT 

GCCATTTCCTCTCTCCAGAGGACCTTTCCTGCCTAGGACTCATGATTGTCCCCTCCCTGT 

GTTGCCTAGTTTCCTGGTATTAAGGAGAATCAACTCTCTGGATAAACGTGCCTTCTCCTG 

CCACGGCATGCACCCACACATACGAGCCTCCCGGGTACCTGCCCG 

Sequence 2347 

CGCC CGGG CAGGTACCATGATTAGTTAAATATAAGACTCCGTAATTTTTACAATTTTA^ 
AATAATTTTATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGAC 
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TTAGAAAATGATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGC 

AAAAATTGTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGTTA 

GTATCTTAGACTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGAATTTAATAAT 

ACATTAGAAAGAMGAAATCAGAGAAGATTCTTnATTTTCACTTGATACTTG 

ACTTTCAATGAGATAAGAAGGACAGGCAAAGGAGAAAGTTCAGGGGCAGGGGATGAGAAG 

AAMCAAGAATTTTATGTTGGACATGCTAAAGTTAAACACCTGCTTAACTCAAATTGGCT 

TCTGTGGAAGCAGAGGCTGCATGAGGTTGCTTATGAAAATGCTTTTTTGANGAAGCCATT 

GCAGGAGGAATCTCTTANGAANCCAGGGAAGAAGGAAG 

Sequence 2348 

CGAGGTACGCGGGGGAGAAGAAACGGCGGAGACCTGAGACCGGGAGGCTGAGGCTGTAGG 

CTGTTATCCTTGACATCTGCAGCAGCCCTTCCAAGCTGTGGAGACCAGGTCATCTGGAAT 

GCCCATTTATGTCAATGGAAGAAAGAAAAAGGGGTCTCCTCCCATCCTCACCACTGCATT 

CTCCCACCAACCCTGCTCCTGTCCCACTTCCCACAACTCAGTTGTTGACAATTTAACAGT 

GGGTTTCTGGAAACAGTGCTGCCTCCTTGATGGTCTAACTAACTACCAGAGTTATTCTAA 

CAGCGGGAATTTCAAGCCAGCAAACCAGACATACATAGGAGGCACAGCAAAAGACCCAAG 

GTGAGGGGCTTCCCTCACCTTCACCCTTGTTCTCACGTGGAGGTGGCGCATGTCTATGAG 

ACACAGTACCTGCCCGGGCGGC 

Sequence 2349 

CGAGGTACCATGATTAGTTAAATATMGACTCCGTAATTmACAATTTTM 
TTATTTCTTCAAGCTTGTTAGTTO 

ATGATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGTAAAAATT 
GTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTT 
AGACTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGAATTTAATAATACATTAG 
AAAGAAAGAAATCAGAGAAGATTCTTTTATTTTCA 

ATG AGAT AAGAAAGACAGGCAAAGGAGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAA 

GMTTTTATGTTGGACATGCTAAGTTAMCAGCTGCTTAACTCAGATTGGCTTCTGTGGA 

AGCAGAGGCTGACATGAGGTTGCTTATGAAAATGCTTTTTTGAGGAAGCCATTGCAGGAG 

GAATCTCTAAAGGAAGCCAGGGAAGAANGAAAGAGAAGGGAA 

Sequence 2350 

AGGTACGTGGGGAGGCCATCTGGCTGCTGCTGCTGCTGCCAGCTTCCAGACGGAAGCTGA 

GATTTGCTATTGATGTTCCAGCATTGATTGAGCTCCTGCCTCGTCTGCATCAGGCATGGT 

CCTCGCAGCTTCACGCAGTTAGAGTCCACATCACGTCTCTCTGAAGAACAGTTAAAGTGT 

TTTCTGGATGAATGCATACTTAAACAAAMTCCATCATTAAACTTTCTTCAGAAAGAAAA 

AAGGAAGACATTGAGGACGTAACACCTGTGTTCCCCCAGCTTTCCAGGTCCATCATCTCT 

AAATTGCTAAATGAATCAGAAACAAAGGTCCAGAAAACTGAGGTAGAAGATGCAGATATG 

CTTGAGAGTGAAGAATGTGAAGCTTCTAAAGGCTACTATCTCACTAAAGCCTTGACTGGA 

CATAACATGTCAGAAGCTCTTGTCACTGAAGCAGANAATATGAAATGCCTTCAA 

Sequence 2351 

AGGTAC TT1 imilll llllMlllllM GGTTTCTTCCTTCCTTCTGTCTTTTCTTCC 
TTCCTTCTTTTTCTTTTCTTCCTTCCTTCTTTTTCTT^ 

(MM l CTGG AGTAACC AGTGTGACATAAGATCTGTTTGTGTTC CCCC ATTTTAC AAAAG 

AAAATTTAAGTCTGCAACTTCCCCTAGGCATCTGCAGCAGAGAGTGGATTTGTCAACATG 

CAAAGCTGCTGACCTGCCCGGGCGGCCGCTCGAAGCGTGGTCGCGGCCGAGGTACGCGGG 

AGTCTCGGTCTCCAAAAAGGGATACTACTAGGGAAAGCAGAAGATCTGAATCACTGTCCC 

CAANAAGAGAAACTTCTAGAGAGAACAAAAGATCTCAGCCAAGAGTGAAAGATTCTTCCC 

CAGGGAGAAAAATCCCAGGTCCCAGAGCAGAGAACGAGAAAGTGATAGAGATGGGCANAG 
GAGAGAGAGAGAAAG 

Sequence 2352 

CCGGGCAGGTACCGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCT 
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CGGGGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCC 

AGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGA 

CCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGC 

GTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTG 

T7TTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTACTAACCATTATTGCTATGG 

TGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTGGCTTGGGMTTTTm 

CAAAAATATNTTGGATATNACAGTACCTTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnAGGAATTCNATATCAAGCTTATTCGATACCCGTCNACCTTCGANGGGGGGGCCCN 

GTACCGAGC I I I I I GTTCCCTTTAG 

Sequence 2353 

CCGGGCAGGTACGCGGGGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCA 

CCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACA 

TCCTGC AG G AAAATG CAGTGAAGTAG AAG AG ACAGGGATATC CC AG AAGGTTATGCAAAA 

CATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAAC 

AGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCA 

GCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAG 

AGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCT 

GCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCCTT 

CCACTTCCAGAACTGTGANMGTATTTTGTTTGTGGTTTAGTCAATGGTCTATGGTAATT 

TTTTTA 

Sequence 2354 

AGGTACGCGGGCAAAGCCCATAAAAATAAAAAATTATAACAAACCCTGAGAACCAAAATG 

AACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCAGTA 

CCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGNGAATTCGAATAT 

CAAGCCTTATCGATACCCGTTCGACCTTCGAGGGGGGGGGGCNCCGGTACNCCAGCTTTT 

TGTTCCCTTTTAGTTGAGNGGTTAAATTGCGCGCCTTGGGNCGNTAATTCATTGGNTCAA 

TANGCTTGGTTTACCTGGTGGTGNAAAATTTGGTTATTCCCGCTTCACANATTTNCCAAC 

AACCAACCAATAACTGAGGCCCGGGGGAAGNCATTAAAAAGTTGGTTAAAAAGANCCTNG 

GGGNGTTGGCCCTTAAATTGNAAGGTTGGAAGCCCTAAACCTTCACCAATTTTAMTTTG 

GCGGTTTTGCCGGCCTTCAACCTTGGCGCCCGGNCTTTmCCCMGTTTCCGGGGGGM 

AAACCCCTTGTTCCGNATGGCCCCAAGGCCTTGGCCAATTTNAAATTGGAAAATTCCCGG 

GCCCC 

Sequence 2355 

AGGTACGCGGGGCCCGCCGATATCTCTGCCGGGTGACTAGCTGCTTCCTTTCTCTCTCGC 

GCGCGGNGTGGTGGCAGCAGGCGCANCCNANCCTNTAAATGCANAACGACGCCGGCGAGT 

TCGTGGACCTGTACGTGCCGCGGAAATTGCTCCGNTTAGCAAATTTCNCAATCATCGGGT 

GCCAAAG GAACNC ACCGCTATTCTCATTN NCAN NATTGN AANCGTTGGN CACGAAGTGTT 

TGTACAAAGGGTTCAACCAAGGGCCAGGGTTTNTAAATNGGGNCACAAGGTTTNTAAAAA 

AACCNTTTAATTGGCCTAATNCCTAACNGNGGGGGCTCAATTATTCNGGTCAAGNGNAAT 

TGNGGGGGTTGGAAANTTCNMTNGAATTGGAATTTTCCCCCAATTTTNCTTN 

AATTTGGGGNNCCCANAGNGGACCCCTNAATTGGGCCCANTTCNNGTACCTTCAAAAAAG 

TAAANCC I rTTTTTTI IGTACCTTNGGGGAAGTAAGNAAAATTCCAACCCAAGGTAATTG 

GGTTGGGGAA . 

Sequence 2356 

CCGCGGTGGCGGCCCGAGGTACTGGAGATTCATATATGCAAATATTCTCATGCAAGAAGT 

TCCACAGTAACAACAGCAAAAAGAAAAAATTAGTTGTCCAGCCAGTGCTGGAGGAAAATG 

TTTCTGGGGAAGATGACTCAGTCATTTTGTGGCGAGACACCCTTTGGTAACTCCCACTGA 

CCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTGAGCGGAGATGTTTTTTGCAAAAT 

GAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAACCAAGAAGAAATCCAAAGACCTGG 
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GGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAAAAATAATTAAANAANANANNTNNN 

NNGTACCTGCCCGGGC 

Sequence 2357 

GACTACTATAGGGCTTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACCACCCTCA 

TGCCTCCTACCCCCGAGGTTCCTAGAGCTAGGCTCTCCTGAGGCAATGCTTTCCTTCTCA 

ATTCATATTCTTCCAGGAGGGGCACCMCGTTTTTTAAAATGATGTTGGCGACGAGGACG 

GTAAATTTTCTAGATGACTGAAGGCTGACT7TCCCCTTTCTGTGACTCTCTAGGCAACAC 

CGTGACAGGAGAGACGACTCCCTTTTCTCCTCTTGTGACCACTTCTGAATCTGTGACCCG 

AAATCACAGCCAGTAGCTTTGTGGTCTCCTGGGTCTCAGCTTCCGACACCGTGTCGGGAT 

TCCGGGTGGAATATGAGCTGAGTGAGGAGGGAGATGAACCACAGTACCTGCCCG 

Sequence 2358 

CCGGGCAGGTACCATATAGGTCCAAACCCAAAACTGATTTTTGTTTTATGAAGTAGAAAT 

GGTTGCATGGGCTCTTGGCTAGAACCATCTTTGGCCCATTGAAAATCAAGTTTCTGAGAC 

TGAAAAGGAMCACAC AGTG ACACCATTTTGATGAGGAAGAGAAGAATTCTGAAGGAGCC 

AGAATATATTTGCTGMTTTTGACACTGAAGAAGAMTAAACATTGAAAAAACAAAGCAA 

MCAACAGCAACAA CAAAAA ACTAAACCCATTTAGGTTTCCACTACAGATGCAAGAAAAA 

AGTGCCCTTGCACGTTTTTTCTGTCTGAGTGTGTCAGCCAAGGTTGCATGGGGGAATTCA 

GGATGTATAAAAACATAGACAGAGAGAAGMGATNATTTTGCACCTCAGAAATAACCTTG 

TG AAC AGCTG AGTAGC CTGT 

Sequence 2359 

TCATAT AGGGCGAATTGGACTCCACCGCGGTGGCGGCCGATGTAC 1 rTTTTTTTTTTTTI 

TTTTTTGTACGGGATGGGTTTTTGCCATTGTTGTGCCAGCNTGGGTCTTGAACTCCGGGG 

GCNTGNAGGCCNGCCTCCCACCAACANAATTAACTTNNTNAGAGCAGCAAGTNCCCATAT 

GGNAACACAGTCACCACTCCCAGGAATTAAGAATGTGGGACAGTNGGACATCTTTGGGGG 

GCCATT ATTCA GCCTGACCCTGACTCAAGGTCTGGACCAACAAAGAAAAAATGGGCAGCC 

CAGCACTTTTTNCCTCCTTTCCTCTCCTTTCATCCCATTTTTTTACATTCAGT 

TTGTCAGAACACCAATTAAAAAAAGAAAGCNAAGATTGGGGGCT 

Sequence 2360 

AGGGCGAATT GGAGCT CCCCGCGGNGGCGGCCGAGGTACAGATTGTGTCCACTGGAAAGG 

TAAATGATTGCTTTTTTATATTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAA 

GAATTCTATCATCACAAAAATAATTCGTCTTTCTAGGTTATGAAGAGATMTTATTTGTC 

TGGTAAGCA \ I I I I ATAAACCCACTCATTTTATATTTAGAAAAATCCTAAATGTGTGGTG 

ACTGCTTTGTAAGTGMCTTTCATATACTATAAACTAGTTGTGAGATAACATTCTGGTAG 

CTCAGTTAATAAAACAATTTCAGAATTAAAGAAAT 

GATGMCAGTANGACTTTGTAGTTTTATTTCCACTAAGTGAAAAAAGMCTGTGTTTTTA 

AACTGTAGGAGAATTTAATAAATCAGCAAGGGTATTTTAGCTAATAGAATAAAAGTGCAA 

CAGAAGAATTTGATTAGTCTATGAAAGGNTCTCTTAAAATTCTATCGGAAATAATCTTCA 

TGCN AG AGATTTCA N GGT TTGGATTAGCCAGTGGGAATAAAGAAATGGGC ATTGGTTC CC 

TATAATTGGGCTGGTTTTATAACTTTTGTAATATTACCTTTTTCTGGCTGNGGTT^ 

CTTATCCCATATGCNTGGATGGGGAAAAAATTT 

Sequence 2361 

ACTNCTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTrTTTTT 

ITTTTTTTTTTTTTTTn I GGAGTAACATCCTCAGACAAGTAGATAGACAAGCCTCCAGC 

AAACAATGGGAGGAAGAATGCCATGATTCCTCCTTAAATTGTANAAGCTCTCTGGCCTCT 

GCTGTGCATAATCTGCCTACCCATCAAATCACAAAATTTGTTTGTTTATATAATATCATC 

CTTTCTCACAGCTTTTATCAGCAGAGTTCTTAAGTTCTCAAGGGAACTGTCTATTTGAGG 

AACTATAAGGATTGGTCAAATATGCTGGAATACTGCAAAGGCTAAAAAATGAGATTCTTT 

ATAAGACTATGAATCCAGTGAACTAAATATTAGGCAAGACAACATTGT7TTATCCANATG 

GCTAATTTGCATGACTCCGTTTAACACTTGATTTTTTT 
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Sequence 2362 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTGTATATGTCAGAATACAC 

ATTTCCCACCTTGCCCAACAGTNGAAAAACATANGAAGAGAAAAACATTAAAAAATGACA 

AGGMGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAA 

TTAATGCCCTTGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGA 

GCCAAG 

Sequence 2363 

AGGTACCACACACACACAGGCACACGCAGGCACACACGCAGACACACACACATATAAACA 

CACAGAGTTCACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAGTGACAACACCGG 

AAACAAGGTAATTCATGTTAAAACACAAGGGTTATATCAGTAAAGAAGATGGGATCCCCG 

AAGTAAACCGTGGMTTTGMTCAAGCTTCGAAG AGC TAAAAAAAGAAA7TGG AG TTTC A 

ACATTCACCTTCTTGMTCCTTMGAAATACAGAAGTTCAAAATAGAAAACATTACAGTT 

TCAGGATACAAAAGTAGAAACATCTGAGATTAAGCCTACATTTTTAAAAAAAGATATTGA 

AAATTACTGTGCTTGTMTACCAGCTTTTCAAACATCAATACCAGTATTGGCATTACCTA 

GATC 

Sequence 2364 

GGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACCGCGGGGGCCGCCGTGTG 
CCATTTCCTCTCTCCAGAGGACCTTTCCTGCCTAGGACTCATGATTGTCCCCTCCCTGCT 
GTTGCCTAGTTTCCTGGTATTAAGGAGAATCAACTCTCTGGATAAACGTGCCTTCTCCTG 
CCACGGCATGCACCCACACATGCGAGCCTCCCGGGTACCTGCCCGGGCGGCCGCTCGA 
Sequence 2365 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTCCCT 

GMTTCCATCTGACTCTAATTTTATGAGAATTGCAGAACTCTGATGGCAATAAATATATG 

TATTATGAAAAAATAMGTTGTAATTTCTGATGACTCTAAGTCCCTTTCTTTGGTTAATA 

ATAAAATGCCTTTGTATATATTGGATGTTGGAAGAGTTCAAATTATTTGATGTCCGCCAA 

CAAAATTCTCAGAGGGCAAAAATCTGGAAGACTTTTGGAAACACACTCTGATCAACTCTT 

CTCTGCCGACAGTCATTTTGCTTGMTTrCAGCCAAAAATATTATGGCATmGGA 

TTATTCAAGGGCTATTACCTCAAACTTTTTTCTTCTCAGAAATCCCAGGGATTTTCACM 

GGGATACCTTGTTATATATGGGGAAAACAAGCAAGGTTTAT 

Sequence 2366 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTACTCA 

CCCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGG 

CATCTGAAAGGACCAAAGAGAGTGACCAGCAAGCA7TTTTTGCAAGGCTGAGGAGCTGAC 

AGCTTCCATGAAAGGCTGGACCACCCAGTGGTTGAAAAGCATCATCTGGGTTACCTTGTG 

CTGCCATAAAACACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAA 

TCCTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCT 

CTTTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGGAGAGAGAAGAGAGCCATTGAACC 

AAGNCCCTCTTACTGTCCCTCTTAAGAAGGGGCACTAAAT 

Sequence 2367 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGGAGGAGTTA 

GGGCTTAGGAAGAAGGAAATGCACAACTTGGGACCTGAGTTAGGCCTTCATCGTCGCGGA 

CTCAGTAGACAGAGGGCATAGGTGCAGCTGTGATGGTAGAGGTGGAAGACTGATTTGCAT 

TTTCACCAGGTGAGACCTGGTTACTCAATTGGTTGGATGAGGGAAAGAGATGATAGCCAG 

GGGCGGCATAAAACGGCAATGAGCTCAGTAAACAACCCCCTTCCCGTGATCTTCGTGGCT 

TCCCGCCCAGACNTGTGACCAATATAGCAAGGAGCAGTCTTTAAGGCGACACCGGCAGAG 

AGGCAGAAAAAGATGGGCCCTTACNCGGTTAGTTAAGAAAAAAAAGAGGACCTGTTGCCT 

GTCCCTCTAGCTTTGAACCTACAAAGTGGGAGGGGG 
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Sequence 2368 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACACGTCCCTAGAA 

Jff™ GCTCCAGCAACAA AGCTAATCTGAGAGTAATACCCAAGATGCCTCCTACCTTG 

CAAATTTCAAGGAATrTCTCACTGGTGTATTTCATAGGATGCTCAGTGAAAGTAGCATAA 

GGAACTTCAGTGGACCATGGGGTTCCAGCGGGGACAGAAGAGACTGCTCCTCCGGACTCC 

?^ G J AGATCCTAAGGCCTTCTCCTTGTC TCTTGTCCAGGGACATCCCAGGGAAGGTGA 

ACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAAC 

TCAGCATGATGGAAGTCGTATTTGCAGGATCAAGGAGAGTCTTGTCTGTGGACCAGTAAG 
NAAA I 1 CTCTCC I I CTCA l I G 

Sequence 2369 

GACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGACGGGGCTGT 

AATCTGGTAACTGTATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTG 

GGAGGGAGGAAGGGGCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGA 

ACCCTTCTTGGTGTGCGAGAGCCTGTGCATTTCAAGAGGCAGCAAAAAAGTAAAAAAAAA 

AAAAAAAATTGATCTTTGTTTAGATTAACAGGACCCCTGACTATGAANAAGGAAGGCATC 

CAGACCAGAAACCGAAAAAATGTCTAGCCAAATCCAAAAAGTGCAAAAAAAGTTGCATTG 

ACTCACTGGAGGGACTTTCCCCAAGAAACAGCTNGTTTNAACCCGGCCCGCTCTTANAAC 
TAGTTGGGATCCCCCGGGCTGG 

Sequence 2370 

CCGCGGTGGCGGCCGAGGTACCATTTAATCCTCACCCCTGACTGTCAGTAGACTGCCATG 

GAAACCCCAAGGATGCCTGTCCTGTTGGACATGTGGTCAGAGACTGGCTCCAAAGATATC 

TCCCAAAGAGCAAAAGCCACCATCTTCCTGTTGGTGCCCAGACACAGGTGCATGAGCTCC 

AAGGACTCCTTCCTAGGCCACCTCTNTGCTGCCTGATCTAAACTCACCTCATTTCTGGGA 

GCTGGTCTGAGAGTCTGCCTTGGAGATGTGTTGGATGGGAACTAGGGGTTTGTAGGCTCC 

ATTGTGAGCAATACTGATGGGAAAATGACCCCAACTGGTATGGATATTTTTTGCANTAAA 

AGCCCCCTGNAAATNCTGGACATGCTCAGGGGTCAAGAGCCATTGAA 
Sequence 2371 

TTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCAACCAACAAGT 

CATTATTACCCTCACTGTCAACCCAACACAGGCATGCTCATAAGGAAAGGTTAAAAAAAA 

GTAAAAGGAACTCGGCAAATCTTACCCCGCCTGTTTACCAAAAACATCACCTCTAGCATC 

ACCAGTATTAGAGGCACCGCCTGCCCAGGTGGACAACATGTTTAACGGCCCGCTCTAGAn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTGG 

CGTAATCATGGGTCATAAGCTGTTTCCTGTGTGAAATTGGTTATCCCGCTCACCAATTCC 

CACAACAACATTACGGAGCNCGGGNAGGCATAAAAGTGTAAAAGCCTGGGGGT 
Sequence 2372 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAAATTTGGAGC 

ATCATTTGAAATCATTTTCTGATTCTAAGCTATAACTGAAATACAAATATAACATTAAGA 

CACCCCAAAGA AGAC AT TAAGG ATACACTTGTCGTTCTTAAGTGTGTTTCACTAAAAATA 

ATAACAAAAAACTTTTAATTTTTAAAAATGGTTCTAGCTTTTAAAAACCTGCAGTAAATA 

CTTrACMGTGGCTAGACACCTTCCCTGGTGGTTTAGTTTATACAAGCAGGAAACTTTCT 

CTCCTACTCTTTCTCTCTCTCTTTCTTGTTTTCTTTAAACCAACTGACGTAATGCCAAAC 

TTTGCTTTAAAAGMCAGACAGGAAGTAATTGGTAGCTTTTAACTTTTAATAATGTTCTG 

GGATTGGGTTTTAGTGGCCAAAATGCCATTTAAAAAAAAAAAAAGTT 

Sequence 2373 

AGGTACCATGATTAGTTAAATATAAGACTCCGTAATTTTTACAATTTTAACAATAATTTT 
ATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAMCTACAGTGTGTGACTTAGAAAAT 
GATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGCCAAAAAATT 
GTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTT 
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AGACTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGAATCTTAATAATACATTA 
GAAAGAAAGAAATCAGAGMGATTCTTTTATTTTCACT^ 

AATGAGATAAGAAAGACAGGCCAAAGGAGGAACGTTCAGGGGCAGGGGATGAGGAAGAAA 
ACAAGGA 

Sequence 2374 

AGGTACGCGGGCATCCTTAGGAGACCTGAGTCCTCAAGAAAACCCTCTTCTGGAAGTAGT 

TGCTCCTTCAGAACGTTTTACAGAAAACACTAATGTAAAAGACACAACTAATGTNAAAGA 

CACAAAAGAGATGTGTTCAMGACACATTTTCTGAAAACACAAAANCTNACAAATTCAAT 

CCTCCCTGGAGNGGCAAGTTTTTCNGGCCTGGGGNAACCTTGGCAATTTCNAAAACTTNT 

NANGAAACCCCATAACCTTGGTTTNAAAMCCAAAAAACCTTGGAAGGAACCAAATAATA 

TTTGNGNGGAANATTMCCAAANCCAANMTTGGTTGGGGGGGCCCAATTTTGGAACCNT 

TTTGNGTTCNC CCCC CTTGGGAAGNCCCCCCAAAAAANAGGGCCTTITTCAANANTTT^ 

ANCCCCCC CAAT TTTGGGCCTTCNTTCCNGGTTTCCCCCCCCAAGGGGGTTTNGNAATTC 

CAAGGNCCTTTTGGTAAMNMTTTTTCNAAGGNCCTTAAAACCCCGGAA 

TTAACCNGGGGTTCTCCCCCTTNNATTTCCCCCCCCNAAACCGGAAGGGGGATTGGTTGG 

AAGGAAAAAAGGGTTTNCAATTGGGTCCTTNCAATTGGGTTTAATTCCNTGGGGGAACCC 
CTTTGGGAA 

Sequence 2375 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCATTTA 

ATTAGGGTTGCAGAGGAGATGGAAAMCAAGCTGTTTGATTATGAAGGCAACACTTCTGT 

CAGTAGAATTGCTTTGGATTGTAAATATGTTTACTTCAGAGGAAATCAGTCAGAGTGGAT 

GGCT CAGCA AAACCCATCACAACTGCGGAAAAAAGGAAGTGACTGGCTAGAGGTAAGGAA 

ATAGATTTTCTCTAAGTTAACTGGCCTACGTGTGAATCGAACCCTGCACCCTGGCCTCAT 

TAGCACCAACACTGTGAAGAACAGCTGTGGCAGAGAAAGTGCCACGGCCCCACCCCATCA 

CCAGATTCAGTAGAAAGGCCCTTGGAATTTCCTACAAGGGTTCAGTTCTGAACAGCTCCA 

CTTC AAATC AGT ATAGAATC CTCAC AGTG ACTCTGTG AGGG G AGG 

Sequence 2376 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTGTCCAGGTTTC 

TCTCTTTGAGGGATACGTGATGCCCTTTGGTGTAAAGCTGTGAGAAACCCCCATGGTTTG 

GGATTGTAATAGCATGATGCTGATGATGCTGGAAGAGGCATCAAAACTCAGTTACTTCCC 

MGGTTTGACTTACAGACATAAACTTCCCAGGGACCATACTTTGTAGGAACTTAAGAAGG 

TATTTGCCATTATTCCMGGCTGGTTGTCCTTCAhlAAACAAAACATGGGGATCAGTGGAG 

CATTGGAAAGTGGGCATGCTGGGCACAGTTCATCCATTGTGGTCAAGAAGCATGTCACTG 

GGAGAGTACCT 

Sequence 2376 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTGTCCAGGTTTC 

TCTCTTTGAGGGATACGTGATGCCCTTTGGTGTAAAGCTGTGAGAAACCCCCATGGTTTG 

GGATTGTAATAGCATGATGCTGATGATGCTGGAAGAGGCATCAAAACTCAGTTACTTCCC 

MGGTTTGACTTACAGACATAAACTTCCCAGGGACCATACTTTGTAGGAACTTAAGAAGG 

TATTTGCCATTATTCCAAGGCTGGTTGTCCTTCANAAACAAAACATGGGGATCAGTGGAG 

CATTGGAAAGTGGGCATGCTGGGCACAGTTCATCCATTGTGGTCAAGAAGCATGTCACTG 
GGAGAGTACCT 

Sequence 2378 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCATCTCTCTGACCTCATTCGCATGGCATTC 

ATGGCTGCAACTGATCATAGCAACCAGCTGCGAATGGCTGGGCTCCAGGCGCTTGAAGAC 

ATTATCAAGAAGTTTGCGTCTGTGCCTGAGCCAGAATTTCCAGGTCATGCGATACTGGAG 

CAGTATCAGGCTAATGTGGGAGCTGCTCTAAGACCAGCCTTTTCACAAGATACACCATCA 

GATATAATAGCGAAAGCTTGCCAGGTATGTAGTACCT 

Sequence 2378 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCATCTCTCTGACCTCATTCGCATGGCATTC 

ATGGCTGCAACTGATCATAGCAACCAGCTGCGAATGGCTGGGCTCCAGGCGCTTGAAGAC 

ATTATCAAGMGTTTGCGTCTGTGCCTGAGCCAGMTTTCCAGGTCATGCGATACTGGAG 

CAGTATCAGGCTAATGTGGGAGCTGCTCTAAGACCAGCCTTTTCACAAGATACACCATCA 

GATATAATAGCGAAAGCTTGCCAGGTATGTAGTACCT 

Sequence 2379 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGCCTGGCTACCTGTGGGTTG 

TTTGGATCTATCGGMCCTCATTGGAAGTGTGCACTTCTTTTTCATCCTCACCCTCATTG 

TGCTAATCATCACCTATCTTTACTGGCAGATCACAGAGGGAAGGAAGATTATGATAAGGC 

TGCTCCATGAGCAGATCATTAATGAGGGCAAAGATAAAATGTTCCTGATAGAAAAATTGA 

TCAAGCTGCAGGATATGGAGAAGAAAGCAAACCCCAGCTCACTTGTTCTGGAAAGGAGAG 

AGGTGGAGCMCAAGGCTTTTTGCATTTGGGGGAACATGATGGCAGTCTTGACTTGCGAT 

CTAGAANATCAGTTCMGAAGGTMTCCMGGGCCTGATGACTCTTTTGGTAACCAGACA 

CCAATCAAATAAGGGGAGGAGATGAAAATGGAATGATTTCTTCCATGCCACCTGTGCCTT 

TAAGAACTGCCCANAAANAAA 

Sequence 2380 

AGGTACGCGGGGGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCAACCTCCT7TGCCAGGAACTATTTATAAACATCCTGCAGGAA 

AATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAG 

AAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTT 

TGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGC 

GCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGA 

CCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTG 

TGAGMGTATATGTTTGTGGTTTAGTCMTGGTCTATGGTAATTTTTTTATAGCAGTC 

AGCCAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCTCCTTTCAG 

AAACCCATGTTCAAATAAGAGATMGATACTGAAACACATAACACCTTTCGCTAGTTTTT 

AGTATACAAATATTG 

Sequence 2381 

CCGCGGTGGCGGCCGAGGTACAACATGGGGCTGTTCATCCTTCGTCTTGCTGAAGATGGT 

CCTGCCACCAAAGATGGCAGAATTCATGTTGGTGACCAGATTGTTGAAATCAATGGGGAA 

CCTACACAAGGAATCACACATCTCGAGCAATTGAGCTCATTCAGGCTGGTGGAAATAAAG 

TTCTTCTTCTTTTGAGGCCAGGAACTGGCTTGATACCTGACCATGGTGATTGGGATATTA 

A TAAT CCTTCGTCTTCAAATGTGATTTATGATGAACAGTCACCATTACCCCCATCTTCAC 

ATTTTGCTTCCATATTTGAAGAGTCTCACGTGCCAGTAATTGAAGAATCTTTGAGAGTTC 

AGATATGTGAAAAGGCAGAAGAATTAAAGGACATTGTGCCTGAAAAGAAAAGCACTTTAA 

ATGAAAATCAGCCTGAGATAAAGCATCAGTCTCTTCTCCAGAAAAATGTGAGTAAAGAGG 

GATCCACCCAGCAGTCATGGGCACAGTAACAAGAAAAATCTATTAAAANTAGAAAAATGG 

TGTTACACNAAGAGGTAGATCGGNTAGTCCCAAAAAGCCAGCCAGTCACATTCAGAGGGA 

ACATTTGGATAAGATTNCTAGTCCTNTNAAAAATAACCCCAAA 

Sequence 2382 

GGGCGAATTGGAGCTCCCCGCGGNNGCGGCCGATGGCCTCTTCTAGGCAGATCCGTCTAC 

AACTGGTAACAGCAACCATATCGAGTTTCTCTTTATCACAATACCACGAGGCTTATCATC 

ATTGCCATGCCTTTAGTAACATCACTTTTTTAAATGGGCATTTTCACAATGTAATC 

ACCTACATATAACATGCACTCTTTATTGAAGGCCGTACCTGCCCGGGCGGCCGCTCGAGA 

GACACATTACCTGAATGAGCAGGTGAAAGCCATCAAAGAATTGGGTGACCACGTGACCAA 

CTTGCGCAAGATGGGAGCGCCCGAATCTGGCTTGGCGGAATATCTCTTTGACAAGCACAC 

CCTGGGAGACAGTGATAATGAAAGCTAAGCCTCGGGCTAATTTCCCCATAGCCCGTGGGG 

TGACTTCCCTGGTCACCAAGGCAGTGCATGCATGTTGGGGTTTNCTTTACCTTTTTCTAT 

TAAGTTGGACCCTCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNAA 
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TATTCAAGCTTATTCGATTCCGGTCNACCTTCGANGGG 
Sequence 2383 

CCGGGCAGGTACGCGGGGGGGGGGGGATTCTTGATCCATGCACAGCGATGTGAGCTGAGG 

TGCAGGCACCAGACCTAGGAATTCCTAGAAAAATAGTCAGGAAGCATTTAGACACATCAA 

ATGTTAAACGAGTCCTGATTATGATGATAATGATGATGATTTTGGTGGTTGCAATAGCAA 

AGCC7TAAGTATGAAGGAGACGTGCCAGCTGGAAATACAGGTAGACAATGAACAACTGAA 

TTTAGAGGACGMGACATTGAMGCATTGATGCCACCAAATTGAGCCGTTTCATTGAGAT 

CAACAGCCTCCACATGGTGACAGAGTACCT 

Sequence 2384 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAAACTTAAACCCATGCAGAAATAATTGGTGAGCCTTCATTTCCCTGACTTCMGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGG 

GTAMTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGCCTGCAGGAAGCTGCCTGC 

TCCTGTGACCATGTCAAGCAATTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTC 

TGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGCACATCTGAGCCATGTCTTCCCATTAG 

TTGTCATGAGTCAGCAATAAAGCGGGTATGTTGATGTCTATCAATCTAAT 

Sequence 2385 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAMCTTAAACCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGMGATTTACCTAATGG 

GTAMTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGCCTGCAGGAAGCTGCCTGC 

TCCTGTGACCATGTCMGCMTTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTC 

TGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGCACATCTGANCCATGTCTTCCCATTAG 

TTGTCATGAGTCANCAATAAAGCGGGTATGTTGATGTCTATCAATCTAATTCCTATGTTC 
TGAACTCAGG 

Sequence 2386 

CGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGAGAAGGC 

TTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAAATGCCAGGGAGGG 

GACAATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAATGGCACACGCGCAA 

GCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGCCGGGACAGATCAGACTCC 
CCCCGCGTACCT 

Sequence 2387 

AGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAGTGGAGGTGGAGTTCGGGTGA 

GTCACAGAGCTGGGGCGCCGTGGGGATGGATTGAAGTCGTCGGGCCCAGAATTCCTTTCT 

TTGCCGTGGGGCCTGACA 

Sequence 2388 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACGCGGGGGGGGGG 

GGATTCTTGATCCATGCACAGCGATGTGAGCTGAGGTGCAGGCACCAGACCTAGGAATTC 

CTAGAAAAATAGTCAGGMGCATTTAGACACATCAAATGTTAAACGAGTCCTGATTATGA 

TGATMTGATGATGATTTTGGTGGTTGCAATAGCAAANCCTTAAGTATGAAGGAGACGTG 

CCAGCTGGAAATACAGGTAGACAATGAACAACTGAATTTAGAGGACGAAGACATTGAAAG 

CATTGATGCCACCAAATTGAGCCGTTTCATTGAGATCAACAGCCTCCACATGGTGACAGA 
GTACCT 

Sequence 2389 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTTTi 
TTTTGGGGAGAAGATCATGCATTTCCTAGGACAGGACTMTACATTTAGCTGTGGCTATT 
ATCAAAGAGCACCAANACCTTAGGTGGGGCAACCAGACTGGCTATGGGGCTCGATGTAAG 
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CCAGACCCATGACCCATAAACTATTTCAGGTAGGAAGTAAGCAATACTATAACTTGGACT 

CGAGGGGGAGCAAAAGGGCACCCCTCAATGGTGATGGCTGCCCACTTGGTCTCAACACTG 

TAAAACTTGGGACTTGCT 

Sequence 2390 

CTATAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTC i ITTTTTTTTTTT T 
I I I I I I I I GGGGGGATTAC AC AATN N AAAAGCCCC AAGGATATTCATCAATTCTATTCTT 
TTCTCTTTATGACTATTTCCTTCTTTATCCTAMTTGTCAAAAATCATCAAGGGGCATTC 
TGATAC AN AAGCACTCTGATACAN AAAC AC N C ANTTAACATG AAAATTAACGGTTAACAA 
AAMGGAAAGTTACAGAGGATAAAATACCTTTTAGCCTTGACCC 

G GG C ATAC AG ATAC ATGCTAATATCCATACCTTGCTAAATATAAAAATACCTGAGAGTAG 

CTGGATGGAGCCCAATTTTGTATANACAGCCTGTTAGTTTCTCCATCTCTATATGCATTG 

GCATATAAAAAGTAAGAGAAAACTAAACAAGTTCTTTTTAAATGTMCTTCCT^ 

AATATGC ATAT AAGACTATGCATTTTAATTTTCATGAATACTGACCTTCTGTAAATGCAA 

ATACCC ATTTTCTAC ATTACAATATTACC C AAGGGGG AA 

Sequence 2391 

CGAGGTACTTI I I I I M I I I I I I It I t M I I N G ATGCTATTACTG AAT AACTCTTTCTGA 

GATGGTTGTTGTA TAGTA AACCATGAGTCTGGCATGTCTAAAATTGTCTTCATTTCTTTC 

CATAMTTTGAATGTTTTGCTGTGTAAGTCATTCAAAGTTGAAGACAATGTCCCATCTTT 

TGACAGAAAGGGAAGGAAGGATATGTGCTCAGCCTGCCAGTTTGACCCAC7TTTGAAAAG 

GGCT7TGTGGTCTCTACCAATGAGAAGTCTGGTGTCAATGTGATTTTCTTAACTTTATAG 

GMTTGTTTGTTTCCCTCTTTTCTGAAAACTTGNGCTTTCATCAGGCTCTACCAGCCTTG 

ATACTCTGTCACTCACTGGGCATGGTACCTGCCCG 

Sequence 2392 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTACTCAC 

CCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGC 

ATCTGAAAGGACCAAAGAGAGTGACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACA 

GCTTCCATGAAAGGCTGGACCACCCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCT 

GCCATAAAACACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAATC 

CTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCT 

TTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGAGAGAGAGAGCATGAACAAGCCCTCT 

ACTGTCCCTCTTAGAAGGGCACTAATCCCATAATAAGGCATTNCACCCTTCTGACCTCGG 

GCTAACCCTAGTTAAGTACCTCGGCCCGCTCTAANAACTAAGTGGATCCCCCGGGCTTGC 

AANGAATTTCGATATCAAAGCTTATCGATACCCGTCNA 

Sequence 2393 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAACAGGCGATATCTTCACCATGCG 

CACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATA 

MTGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGC 

TGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTGCTCTCCATAGGGAGG 

ATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCTAAGTGTCTATGCCAT 

ATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCAAACTATTTC 

ATATGTTGAAATCCTGACTCCCAGCCTGAGGGCATTACGAGGGGCTCTGGGAGGTGATTA 

GGTCATGAGGGTGGAGCCCTCATGAATGGGATTAGTGCCCTTATAAAAAGAGGCATCAGA 

GAGATCGTTTGCTCCTTCTGCCATGTGAAGACACANCAATGGATGACTATCANGGTATTA 

AAGAAAGAACA 

Sequence 2394 

CGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGGGATGAGTCAGGGAGAGCTAGTGTG 
GTAGCAGTTTCTAGAGCTGTTTTCAAGGMTGGAMGAGGAGTGGGGAMGGATTTAGGA 
TCTATGGGGTTGGCTAGGTTTCCCTTTCTGAGTTTATATAATGGTTTCGAGAATAAATGT 
TGAAGGAGCANGAGGGTGTCTTGTTGAGAAGATTTAAAGGAGGGGCTACAAAGTAGAAGG 
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TCATCAATATATTGAATAACGTGAGAAGCACAGGGGTGGAAAGAAAGTAAATCATGAGAA 

AGAGCTTGGCTGAAGGAATGAGGGCTGTCCCTGAAGCCTCGTGGCAGTACCTCGGCCCGC 

TCTAANAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnTCGATATCAAACTTATCCATACC 

CGTCNACCTCGANGGGGGGCCCCGGTACCCACTTTTTGT 

Sequence 2395 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTC 

CCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAAT 

AATGGTTAATAAGATGGCGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCA 

GCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATC 

CATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTT 

CAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAAGTTCTCCTCTCTGGGTTGCTG 

TTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAGCGCCGAGGCTTTGG 

CTCCTCCCCAGCTGCTGCGAGCTTCCCACTGCCCCGCGTACCT 

Sequence 2396 

TGGCGGCCGCCCGGGCAGGTAC I I I 1 II I 1 IGNI I 11 I 1 NANTCN I I I I I ANAACAAACA 
CAAAAAAGTTTTATTTAAAAAAAGAGTTTGAC 

GACACCCTGTTTCCTTGGAAACACAATGCANAACCACAAACTCTGAGACTTATTATAGCG 

AAAGTATTGTCATTCAGTTCAATCTMTGGTATGAGGTTAGCTTTGAAGGCCAAAAGAAA 

CTAGTTTCTAATTCTTATTCTTCTCCCCACTAGACTTGTGGCCTAGGTAATTTTGTAATC 

TTTCTGAGTCTGTTTTCTCATCAGGAGCAGGATAATTCCTAAGTAATAGATCATATGTAG 

AAGTGMTATGATCTCATGGCAGAGGACATNAACCATAGTTMTTATTAAGAATATATTT 

ACATTGAGCTATCCTTTTATNTACTTTAAAAA 

Sequence 2397 

CCGGGGNTTGGGGCCGGGGTCNCGGGGGGAAGGCCAACCCCCCAAAAGGGGGGTTGGCCC 

GAAGGGGGGCCACCCAAAGNTNNCTTTGGNTGGGGGGCCGGCCNTGGTTGGTTGGGGNCA 

ACCCCAANTTTNCAACCNAGGNCAAAAGGGTTCCCAAATTCCTTNTGGGGGGGTNCATTG 

GTTAAGGNTGGGCCAAAGNCCAAAGGGGAAAGNCCGGNGGCCCGGCCCCCTTTTNCCTTN 

TGGAAAAAATTCCNTTCCCAACCCCTTAAAGTTNGNAACCCAAAACCCCAAACNGGNAAA 

ACCCTTTTGNAAAGGTTGGGTTGGGTTGGGGGGTTCCCCCTTTTGGGCCTCCCAAAGNGG 

GAAATTCCCTTCCCCCTTTCCGGTTTTGGGCCAM 

CCAAGGCCCTTNCCGGGTTNTTAAAAAAAAGGGGGGAATTTGGGCCCCCCCGGGCCGGGA 

TTAMCCNCCNTTTCCGGGGGGGGNCNCCCNGNNCGTTTCCTTTTAAGGGAAAAACCCTT 

TAAAGGTTTTGGGGGGGAAATTTCCCCCCCCCCCCCGGGGGGGGGNCCNTTGGGCCNAAA 

GGGGGGGAAAAMNTTTTTNCCNGGAAATTTAANTTTCCAAAAMGGGNCCCT^ 

TTCCCGGNAATTTAACCCCCCGGTTTTCTGGAAACCCCCCTTTCTNGNAAAGNGGGGGGG 

GGGGGGGGGGCCCCCCCCGGGGGGNTTTTAACCCCCCCAAAGNNCC 1 fllllll I GGGTT 

TTTTCCCCCCCI HUM I AAMGNTTGGGGAAAGGGGGGGGTTTTTAAAAAAATTNTTG 

GGCCCGGNCCCGGNCCTTTTTGNGGGGCCCGGGT 

Sequence 2398 

ATTACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTGGCAGGG 

CCAGTGGCAGGAAGGGAGGGACAAGTGGACAGTGGTGTGTCTGAATTGTAGAGTGTTAGA 

TTCCAGGTCATTCCCATCTGCTCAATTCCATCAGCCGCAGAGTTCAAGCTGTCATCTAAG 

CCCAGGGAGGTGTAAGGAACAGGCCACATCTGGTTCTTGGAAGTATATGCCCAGGCATTT 

AAAGCGGATCTTTCCTCTTTGGGCCATGGGTGGATTTCCTGCTCAAGCAACCTTCATCTC 

TTAAGCTTGTGTTGCTTGGAAATTCTGGGACT 

Sequence 2399 

AGCGGCCGCCCGGGCAGGTACGCGGGGGTTGGAGACCATTGCTCTATAGCAAGACCAGAC 

TTTGCCCTTCCTCCTCTCAGCCTACTCAACGTGAATATAATGAGGATGAAGACCTTTGTA 

ATGACCTTTTCCCACATAATGAATAGCCATTGGGAGACACACTTCTGAACACCACCACTG 
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GAAAATCACACATGCTGAAATGGGAGAGTTCCCTGACCCCCTTGCAGGATATGTGACAGG 
AGTGTGGCTCATCTGTTCAGCTGGAGTGCATACTCAAACCCCTTATGAGACAAGGAGTAT 
GCAGACAGGAAGGTGCAGGGAACTGGGGAAGCAAAATATAAACTAGTTAA 
Sequence 2400 

TATAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC ) rTTTTTTTTTTTTTTTI 
minim iGTAGAGATGGAGTCTTGCTGTTTTGCCCAGGTTGTTTTCGAACTCCTGACC 
TCAAGCAATCCTTCTGTCTCTGCCTCCCAGAGTGCTGGGATAACAAACATAAGCCACCAT 
G CCCAG CCMTCCTCAGTTATmCTAATACTAGATAATCTAATTGCTCACTTATTACAT 
TAl l M iCCAATTAAAGGNGNMGATTTAGACATTAAATTTAAGTTATAAGTGTTTCCTT 
GAACACCTAATTGNAACTGNMCTAGMCAGGGGATCATTTGGTGACTATTGGAAAGTTT 
CAGGTGGATTTTTAAATNATATTTTCAATC 
Sequence 2401 

CGTGTCCTCAGATCTCAGGCTGCTCAGCTCCATGTAGGCTGTGTCTGTAGATGTGTCCTC 

GGTCATGGTGACTCTGCCCTGGMCTTCTGTGCGTTGATTGTTTCACCATCTTCAGGATC 

AAAACCTCCCTTCCACTCAAGCCCTTTTCCAGGGAGCCTGTCCGCACCCAGCTGNCATGG 

GATAMTTTCANGTGAGGGGTGTTATTCCCGGAAAACCTTTGGCAGNGNAAGACCNTTCA 

ACTTGGAGGGCCCCCAAGGGCTTTNTTTCACCCTTCAAGGCCCCCCAGGAACTTGNTACC 

CTTCGGGGCCCGGCTTCCTTAAGTAAACCTTAAGGTTNGGGGATTCCCCCCCCCCGNGGG 

GNCCCTTGCCAAAGGGGGAAAATTTTTCNGTAAATTAATTTCCAAAANGCCCTT^ 

TCCCGGAANTAACCCCCGGTTTCNGGAAACNCCTTNTNTAAGGGGGGGGGGGGGGGGGCC 

CCCCGGGGTTTACCCCCCAAANCCTTTTTTTTG 

Sequence 2402 

TCCACCGCGGTGGCGGCCGAGGTACGCGGGGAGGATTAACAGAAGCAGATTAACCTCAGA 

AATCCTGTCTGGCTGGCAGATTTCAAGTAAAAAAAAAAGGTGGGTTGGGGGGACCCTTTT 

CTTTCTAGTTGTCTTTAAGGGAAAATTAATTTTAC i I I 1 I 11 I i GTTCTGGCCCGAAATT 

TTNTATGGAGNATATCCTCNTCACCTTGTCTNTCCACTTTTGAAACNCGGTTAAAAGGCT 

NCA TCAGCCTGG TCCAGCTTCTGGAANTGNAGGGAGNGGGGGAAGNAAGCCCCCNTGGGG 

GTCTTTTCTTTTTGGAAMGGGNAATTCCCGCNTTGGCTTTGNAGGGGGCTTGCCCCTCC 

CT CATT GGGTGGTGGCCGTTGGTTCCGNTTTTCCTTTCTTTTCNCTTGGACCCGCCATTT 

CNTTTTGGTTGGNATTAATTCCAAGGNAGGGGGTTTAAAAC^ 

GGCCCAATTTTCNCAAAGGGGCCCGGG 

Sequence 2403 

AGGTAC j I I II I I II M I I I II I i 1 IN 1 I GTTTAATTACATTTATTTC ATTGAATAATG 

GATTGTCAACAGTAGTTATAAAATAGTGCATGGCTTAATGTGTTAATTATCATAAGTAGA 

ATCCACAATTAAAGTTTGATATTTGCTGMTACTTTCTCTGCAAGGATATAAAAATTAAT 

C ATCCA C AACCT AATTAACTTAAATTTCAAATTAMTTCACATATGNGGGTGTTTTACTA 

TCTTTTGCTTTTCGGTAATGTTGGTGACATT^ 

CMTGTGAATTAATTGGGTAATMCTTTAGTGTCACCATC 

A 

Sequence 2404 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGATGCTGCC 

ACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGMCTATTTATAAAC 

ATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTA 

GAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAA 

TGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGC 

CAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCCTGTCTGGC 

AACCTGGNAAAGAGGACCCTTTACTTAGNAAGCTAGC 

Sequence 2405 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATTATTCTTTCC7TT 
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TGTGTATGT1TGAMTTTTCCATTATAAAAAGGACAAAAAAAAAAGAGAGAGAAAACTM 

AAMGAAAAMTAG TGTT GCGCTTGTTAGTAAAACCAATCTTMGGCCAGTTACTAAAAA 

GTAMCTCCTACAATTTTGAATAGCTATTTGCTMTACAGTGATTTGCTGCCTTCTTGGG 

GTTAGTTTAGCCTGTCCATTCCTTATTTAACTTGCTCCCTTCCTTCCCTTTTAATAAAGT 

CCTGCCTMGCTCCAGGCTCTATTTTCCACCTCTCCCCTAAGATTTAGGATTTGTTCCGT 

TTTAAGGTCAGTGGCCCATATGTCTTCCTGTTCTCC 

Sequence 2406 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGGACTCATACCTGAAAGCAGTGTT 

CAACCTTAGCAAAATCTCCAACCAGCGCATGAACAATTTTCTACATCACAACGACCTGGT 

TTTCAAATTCAGCTCTCAAGGCCAAATCTTTTCTAAATTTAACCMGAACTTCATCAGTT 

CACAGAGAAAGTAATCCAGGACCGGAAGGAGTCTCTTAAGGATAAGCTAAAACAAGATAC 

TACTCAGAAAAGGCGCTGGGATTTTCTGGACATACTTTTGAGTGCCAAAAGCGAAAACAC 

CAMGATTTCTCTGAAGCAGATCTCCAGGCTTGAAGTGAAAACGTTCATGTTTGCAGGGA 

CATGACACCCACATTCCAGTGCTATCTCCTG 

Sequence 2407 

CGCGGTGGCAGGCCGCCCGGGCAGGTACCCGTTMCTTCCMTTAACTAGTTTTGACAAC 

ATTCAAAAMGAGTNATAMCTNNGGCTTAATTTTAATAATCAANACCCTNCTAGCCTTA 

CTACTAATAATTATTACATTTTTGACTANCCACAACTTCAACCGGCCTACATAGGAAAAA ' 

ATTCCACCCAC TTAAC GAGGTGCCGGCTTTTCGTACCCCTTATTATCNCCCNNGCCCNCG 

GNCGGTNCC NCTT TT NCTT CCCAATAAAAAANTNTTCNTTTCTTTMNGGTAAGACCTTA 

ATTTMNCCCTTTT CCTTTTAA TTATANTTTTTNGGAATCCCTTM 

GCNCCNCTTCCCCTTTTTTTTAACCCCNCCCTTTAACNNCNATTTGNAAGGNCN 

NNCNAAAAAACCAAAATCTTTANAACCCCNTNGNCCCCAACCTTAANAATTAAGNGTT^ 

NATTNGTGTCAAATTCCCCCTrrCCTTTTAATTTTTAAAAA 

Sequence 2408 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGCTCTCACT 

GCTCCCAAAAATGGCGGACGCATTCGGAGATGAGCTGTTCAGCGTGTTCGAGGGCGACTC 

GACCACTGCGGCGGGAACCAAAAAGACAAGGAAAAGGACAAGGGGAAATGGAAGGGGCCT 

CCAGGGTCTGCAGACAAGGCAGGGTAAGGAAGAAGTTCCAGTTGTTGCTTTTATTTTTTT 

TCTCGGTGGGGAGGAAGAAAACACTACTCTCTTAGGAAGAGCACCGAGAAAGTGAAGTGT 
ACCTGCCCG 

Sequence 2409 

TANCACTNCTATAGGGCGAATT GGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACT 

"illiillNIIIMlllllll I CCTTTNAATGCTGTTCACTGCTACCCATTTAGTCAA 

CTTGGACCACCCTNTANAGGGGCTTCAACTGTTATTTTCAATACATTGTCACAGGGACAN 

AGGAGGGTGTTAGGGAACTNTGTGACTAAAGAGTTTTCCATAAATGGGTGGACCTCANTT 

TACAAGCATTAANTTATTGGGCCAGGAGGGGGAACANATCATGATTAGGAGAGTGGAAAA 

NAAAGATGACAAATCACTTTAAGTCANACAAACCTTTTTNTNTACCTCGGC 

Sequence 2410 

GTTTACCCGNCCGGGGGGGGNGGAAAACCATATNGTTGGTTGGTCCTNTTAGGGAACCCC 
TTGGTACCCGNCTTGGGGGGNAAGGGGGAAAGGAATTGGCCATGGGCCCCAACGCCTTAA 
GGGTGGTT TAACCATT GTNGGTGATGGGGGAACCCCCCTTTAATTANCCGGGGGCAAAAA 
CACNTANACTTTTTTTGTCTCNANGGNGCAAAC^ 

ATTNCTCCTTGCCATGGGNANAAAAANTGGGAAGCTACCTTAAATNATTTGGTTCCANGG 
CAAAANTAACCCAACCAATTNTTTTCCACCC^ 

GG TTAAT GNAAAAAAAAAACCAATTTANATNGGAAAAAGGNAAAGGAAAAANA^ 
CACTTTTATMNAAANAAANATTTGGNAACCCAAAANGGTGGGAAAAAGGATTTAAA 
TTGGGGGGAAAAAGGCCCAAAAATTGNGTTGGAAATTGGGGGTTGGTTTTm 
AGGNGGGTTGGNGGCAAGNCCNCCTTmCAAAGGAAAAAGGGGTTNATAAATTTTAAAA 
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AATTGGCCCCNCCCTTTTNGGTTATAAAGGAAAAAGGGAAAGGGGGCCCCCAAGAAAAGN 
AAGGCCCTTTTGGGCCGGCCMCCCCCTTTTNCT 

GGNAAGGGGGNAAGGCCCCCAAAAAGGNAAAAAGNCCNCCGGGGGCCTTTGNNTTCCTTT 

GGCCCAAANCCCCTTTGGNCCNTAANGGAAGGGGGGACCCCCC 

Sequence 2411 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATG 

AAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTG 

ACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCAC 

CCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTT 

GGTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGAGGCTGGGAAGTCTGCAAT 

CACCGGGTGCCCAGCCATGGGTCAGGTTCTGGGTGAGGGCCCTCTTTCCTTCTCACCTGT 

GTGCTCTT 

Sequence 2412 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTTI 

nrrm i attctttgggcagctaaaatctagg atggtgttaag tttcttcattttgtc a 

ATTATATATAAAAAATTAGAAACAACATGMTCTGCATTTCTTGGATGAGATAGTTAATA 

ACAAACTATTTCTCAATATTTGTATACTAAAAACTAGTGAAGGNGTTATGTGTTTCAGTA 

TCTTATCTCTTAT7TGAACATGGGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTA 

TGTAGTAAATGAGGGCACTGTCTTGGCTGGGACTGGCTATTAANAAAAANATTACCATTA 

G 

Sequence 2413 

I M I I I I I I I I M II I I I I GCTTTTCTTCTTCTTTCCCTTTCAGGCATCTCCTCGTTTTA 

TTTTCTGTAACTCACAGAGTGCTGCAAAATAATGTTAGGGCTTTGCAGTTTATTCCTTGC 

GATTTAACTGCATAAATATAAAAAGTGCCTGAGCAATAATGCACGTTGGGTTATTCAATA 

ACACAAGGAATGTAAAGCCGATTTCATATCTCCTTGAATCAGGAACNTTAATTAATCATA 

TGATGAGAATGCNACGTTTTNTTTACCANGGNCAAGGAGGCTGCCAAGAANCTTTCTGAG 

GTTTAATAATTGCACCCNCTNTGGAGGGGAGGGAAAAAT 

Sequence 2414 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 

I II I I I I I I I I n I I n IGGGCAATACCAGCTTTCACCCTGTCCTCTGCACTGCTTAGCC 

TTGGCTCTGCTGAGTTGNGCAACACATTCTGAGAGATGCCATACAATGCTCCAGGCAAAC 

TTATAGAAAAACAGGAATGAGTGATTTTATACGGGATGTGTTTCAGCTGTCCAATTCAAA 

ATAAATCACGTCCAGCTGCATTCTTTCCTAATTTGNGACCCAAACTGNTTGCACTTTATA 

CATTMTGMTTATTTTTTTAAAAAAGGAAATTAGCCTCTATCATGGAAANACAGTGGCT 

NTCTAACCCCTTTGNGGGTCCTTTCTCCCTTTAAAAAGGC 

Sequence 2415 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTTCACAT 

TTTTTGGAAGAATTCCACAAAAGAAGAAGAAAAGGAAATCTAATCTAAAAA 

GGAGGGGATAGCAGATTGAAAATGAGAATTTTGAAAGTTTCAATACCAGTGGGGAAAGCA 

TTTGCTAGAGTTTAGAAGGACAGCATATGGTACACCAGATCACGAGACATCGTTTCATAC 

TTCCCAAATAGTTTTATATTTTAGCTTTGMGGTCAGTTACCAGAGCCAM 

AACMGCAGAATTTTATGTCCATTCAAAGAGTCTCTTACACCTTTCTGGGCCTATTCACT 

TGCAGAGAGGAGTCGAAACTGTAACCAG 

Sequence 2416 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTNTTTTNI FTTTTTTTTn 

rmnrm i actactttcttctttcttattttctcattggcctcaggtcttggaatatct 

TGAATTCAAACTAATAATTCACCTCCTTCTCAGCACTCTCAATATAGTCTTTTGCCAACA 
CCTGATGAGACATATAACTTATGACCATTGCACTTCTNM I I I I I CCATGGTTTGCATTT 
ANGAGGIMTTTTCTTCAGCTATTANAGCATATCCTTCCTCATNATTTTANANAAATACT 
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TGGTCAAATGACCTTTTAAATAAAAGATGGTTC 
Sequence 2417 

^ TA TAGG GCGAArrGGAGCTCCCCGCGGTGGCGGCCGAGGTACT I rTTTTTTTTTTTTn 

111 l l l I NCAAATG TG AC A GG CATTTATTTTAAC AG AGAGG AAGGGGAAG ACAAAATATC 

ATAAAATGTGTGGGTTTTCCCCANATGACTCTGTTGAAAAAACCAnGAAATGCAGAGCGG 

GGGGACAAATGGAGGAGAGATNTTAACTAGAACATTGCTGACTCAAGGCCGTCCAGCTTC 

AGNGGGAGTTCTGATTAGGCCCCATGTNTGCCTGATGCCACCTGCAAC 

Sequence 2418 

CCGCGGTGGCGGCCGAGGTACGCGGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

MTGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAG 

AAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTG7T 

TGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGC 

GCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGGAGG 

ACCCTCACTAGAAGCTAAGCCCA 

Sequence 2419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGG 

GAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATG 

AGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAA 

GGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCAA 

CCATGGATTTGAGGCG TGTGA AGGAATATTTCTCCTGGCTCTACTATCAATACCAAATCA 

TTAGCTGCTGTGGCTTGTTTTAGAAGCCCTGGGAGCCGATCTATGTTTAACACCCATCTT 
TACTAAACCATTATT 

Sequence 2420 

TGGGGTAAGNCTTTCTCANCTCCGTCCGNGNTTGGGTCAGGGTCTCCGTCTCNCCGGGGG 

TCCANGGGTTTACCCGTCCGGGGGGGGGNNACCNCTTGGTACCCGCCTTGGGGGGTAGGG 

GTAAGTAATGGCCTTGGTCNCCACCCCTTTAAGGGGTNTAACCTTNTGGGTTANGGGGAA 

CCNCCCCTTAATTACCCGGGGCNAAGGCTCCTTCNCCTTTTTGGGGCNCCAAGGGGNAAA 

ACCTTAAATTNTTTATTNANAMCCAATTTCC^ 

CCAAGGGTGGGANAAGGTTANGGTAAAAGGGAAGGTAACCAANGGGGGGGTAATNAATTC 

CTCCCAAAGNAANAAGGGGGTTNTTANTTTGGCCCAAAATANTACCCAMTTTCTAATAN 

GNGAAGNAAAAAGGNAATTTGGAAAGGGAAGGGGGNAAGGTTCCTTNAATTTAAT^ 

TTCCAAGGGAAAAATTAANCCAACCCAAA \ Mill 1 1 CNNCCCCAAACCCCCCTNTTTGG 

GCCCCCCCCAAAAACCCAAGGGTTGAANGNTAAAAANAANAAAMCCCCMTTTTTNTNN 

NNNACNACAANNACNATATAATATCATNANAAGAATAANATNATTATCACATNANNACAA 

T NANGA GAANAATAATNANAANGANNGGGGNTTTNTCCCCCTTTNGGGGGNCCCCCCGGN 

CTTTTTCCNTTTAAANAGAAAAAACCCTTTAAGGGGNGGTGGGGGGGGAATTNNCCCCCC 

CCCCCCCCNGGGGGGGGCCCTrrGGTCCCAAAGGGGGGGNAAAAMTTTTTTCCCGNNNA 

ATTTNNTTTCNAAAAAAGGGNCCTTTT^^ 

NNGNGAAACCCCCCTTTCTTTAAAGGGGGGGGGGGGGGGG 
Sequence 2421 

TGGGGNCCGGGNGTCCGNCTTGCTTTTGCNCCGGCNTGTGCCCTTGCGGGGCTTGCAACC 

NTGGNNACATTCNGGCTTGGNCGTCTTTCNGGGGTNTCTGTTNTACNGAGACCTTGGGCN 

GTGGTCCGTAAGTCCGNGNTTATTNCAANGACATNCAAACCTNCAANATAGGGGGGCCGG 

GGGCTAANAATTAACCGGGGGGTTTTAATTTCTCCAACCAAAGGGAAAATTCCAAAGG 

GGGGGNAATTTAAAAACCCGGCCAAAGGGGNAAAAAAAGGAAAANCCAATTGGGTTNGNA 

AAGGCCAANAANAAAAGGNGGGCCCCCAAGGCCNANNATAANAANGNGGGCNCCAATGNN 

GTAAANANCCCCCGGGTTTANNAAAAAAANAAAGNGGNCCCCCGCCNGGGTTTNTGGNCN 
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TTTGGGGTCCCGGTTTTN 1 I 1(111 I CCCCAAATTAAAAGANGGGCCNTTTCTCCCGGTC 

TCNCCGCTCCTCCTTNGGNAACCGGNAAAGGNCCCANTTTNCNAATCCAAAAAAAAAAAN 

AAATTCGGGGAAACCCGGGCCTTTCNAANAAAGGNTTCNAANNNGTGAAAGGGGGNTTGG 

GGGGGCCCGGGNAANAAANAACCCCCCCCGGTAACCCCAAGGGGGGGAAACCTTTAAATT 

TNATAANAAAGGGGAAATTAAACCCCAAANGGGGGGCCCGGG I I I I I I I I I CNCCCCCCC 

NCTTTGGGGGGGNAAAAAAGNCCCTTTCNCCCCCGTTTCCGGGTTGGGGCCCGGCCCTTT 

CTTTCNCCCTTTGGTTNTTCCCCGGGMCCCCCCNTTTGNGCNCCGGNCTTTTTTACCNC 

CCGGGGGTAATTNACCCCCTTGGTTTCCC^ 

GGGGGGG 

Sequence 2422 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGAGTGCACAATGTAAGAAG 

T CACT CACTCTTGGGCATGCGCTAGTTCAGGGTTGGCAGCGACAGTGAGTGCCTTCTTAC 

ATTTTGCACCCTGGATACCTTCCTTGCCTCACCCTAGACTGGCCCAGATGTCTAGTCTAC 

AGNTC ACNC TTATCACTTGCMTTTCTATTTCAATGACTACACTTTTNACTTCCAAGGTT 

TCTATATTTTAATCCCTATCTATTNTTMGMTTCTTTGTTCCCCTTCCCTCTTTTA 

ATTTGATGATTCAAGAATNC 

Sequence 2423 

CCGCGGTGGCGGCCCGAGGTAC I I 11 I i 11 I I 1 ! 1 I I I I I I I I 1 1 I I AACTGCAACAACC 

TAACAGGAAAACTAATCAGAGATCATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAG 

GAACATAATCCACTTGAGAATTTAAAATAGAGAGGGCTTAGGCTGGGGAGTTGGTGACAG 

CATAATGGGAAATCTACAAAGCAAACAGGAATAGGGAGACAACCCAGAGCCCATNTAGGC 

TGGCAGAGTCATCATCTNTAGGCTGGAAGGTCAAAANAAGGTGGTGTTAGTGGAGCCAGG 

AGATGCACAGTCTCCCAGGGGAAGCTGGAGCTAGGGTGTACCTGCCCG 

Sequence 2424 

ATTATTTTGATTTTATATCTGAGTATGTTTNTAGCCAGAAGAGGAGAAGAAAGAGGTTAA 

AAAGCAAAAACAGAATTTATAGATGCACGATATTCTTCTGAACAAGAGAAAGAAACTGGT 

AGGAAAATTTATTTTAAAAAGTGTTGAAGGGAAAGAATCAAGACCACAGATCCAGATCCG 

GAGATTATTTTGCTAAAGAATAGCAATTGTGAGGCATGAGGTGGGAGGGGGGAAGAAGCT 

ATGMCTTAATTTTGAGGTTTCTGAATAAGGAAACTTGAGTGAATTCACTTCAGATGCAT 

TTGGAATGTTNGCACTCCAGAAGATNANATTGTGTGTGCTCTGGAGAGTATTGGAAGA 

Sequence 2425 

A I I I M 1 I I M I M I I I I 1 I I 1 11 1 NCAAANTTCTTTC I ill I 1 AATTT AAATTG AGTTC 

ATTTATTTGAAACANACTGGGCCAATGTCCACAAANAATTCCCGGTCAGCNCCACCGATN 

TCCAAAGGNGCANTNTCAAGGAAGGGCAGGCGTGATGGCTTATTTGTTTTGTATTCAATG 

ATTGNCTTTCCCCCCCGNGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCCnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCAATTCACANAACATNCCAACnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGGNCC 

CACCTGCCCCTTTTCCAATTCGGGAMCCTNTNGTGCCCACTGCATTTANTGNAATTGGN 

CCAACCCCCCCGGGAAAAGGCGGNTTTCCTTTTGGGCCCTTCTTCCGNTTTTNTTCCCCC 

CCACCNCCCTTGCCTTNGNNCT7TTNGGTCGGGGGGNNAGCGGNTTTTNACCNCCCCCAC 

AGGGGGGNTATTTCGGGTTTTTCCCCCNNTTNNNGGGGGATACCCCCGGAAAAAANTTNT 

CCCCNCNCCCCCAAATNGTTTGTACCNTAAAAGGCCCTTTNCCGGGGTTTTTTAAAAGGC 

CCCCCCCCCCCC 

Sequence 2426 

CCGCGGTGGCGGCCGAGGTACGCGGGGGATGTGGGAGGATTGCATTCAGTCTAGTTCCTG 

GTTGCCGGCTGAAATAACCTGAATTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTGGAT 

TAAAACTGAAGATCAACCTACTTTCMCTTACTAAGAAAGGTATTAAGCGCCTTTCTGAG 
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AGCTCTCAGTGGGCTTCCTAAGCACGTTCACTCTGCCTCCTTTAAGTCTTATATTTATGA 

TCAAGAT GAAAG GGAGGGATGTATCACTGCACAGAGATTCTACAAGTGGATATATAAAGC 

TAATAATGTTTTCAGTGAGCTCTTCTCTGGCAACTACTn*CCAAGTGCTTTCAGCATTCC 

ACCTTGAAGCACAACGTTAATAATGCACMTTTCTTATGTTCAGCCACAGAATGTTCACG 

TGTGATTTGGTTACAACATATTGCTCTATCCAGGCCTCTAACAAAACTGACCTCTTTCAA 

ATGGACTTGCAGTTGCTGAATTCCAAATCAACACTTTATCTTAAAAACAGAAAANGAAGG 

GAANGGTGGANGGAGGGGAACAAGAAANGGTCCTGNCCG 

Sequence 2427 

CCGCGGTGGCGGCCCGCCCGGGCAGGTAC 1 > 1 1 I I 1 I 1 I 1 I 1 It I I I 1 1 I \ CCTTAATAA 

GTG AGGC CAAGTGCTGTGTGTAACTGTGGGGTGGGCAAAATTAGGATTCCTGAATCACTT 

CACT TTTT GTTGCCTTTCTCTTGGGAGAATGGTTACCATTGCAGAGAAGCATAGTTAAAA 

TTGCTTTTGTGTTTCATAAACGAATTCATACTTAGATGTCATTGACCAACAACTAAATAG 

TATATCAGAGATGAAAGGCATCCTAGGGATGACCAAATCAAGCTCTTCATTACTCAAAGG 

AGAAAAAAATCCAGAAAGTGGATGACATATGTTGAACTAGTGATGGAAGTAGAACAGGGT 

AGTGTATTTTCTGCCTCTTAATTCTTCATTTTTTACACTAAACCANATATTAGGTAT^ 

ACTAGGAAGATCGTGAGGTAAGGAGCAAAGGGTTTGGC 

Sequence 2428 

CGCGGTGGCGGCCGAGGTACGCGGGGGAGTCTGATCTGTCCCGGCCCAGTGTCCTCCAGG 

ACCCTGGCCCCTCATGCCTNCGTGCTTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCT 

TTCCTGCCTAGGACTCATCATTGTCCCCTCCCTGGCATTTTTTACACCTGGAGCAGCCAG 

AGGACGCATGCATGGCTCTTCNGAAGCCTTCTCCTGCCACGGCATGCACCCACACATGCG 

AGCCTCCCGGGTACCTGCCCG 

Sequence 2429 

CGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGGAACTTGTGTGCTTAGACCTGACGCTGG 

GAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCAC 

CTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTA 

ATGGAAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGT 

AAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGG 

CTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTT 

GGCTTT 

Sequence 2430 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACAC 

ACACACAGGCACACGCAGGCACACACGCAGACACACACACACATAAACACACAGAGTTCA 

CTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAGTGACAACACCGGAAACAAGGTAA 

TTCATGTTAAAACACAAGGGTTATATCAGTAAGAGATGGGATCCCCGAAGTAAACCGTGG 

AATTTGAATCAAGTTTCGAAGAGCTAAAAAAAGAAATTGGAGTTTCAACATTCACCTTCT 

TGAATCCTTAAGAAATACAGAAGTTCAAAATAGAAAACATTACAGTTTCAGGATACAAAA 

GTAGAAACATCTGAGATTAAGCCTACATTTTTAAAAAAAGATATTGAAAATTACTGTGCT 

TGTAATACCAGCTTTTCAMCATCAATACCAGTATTGGCATTACCTAGATCAMTCTm 

CATCAAAGTTGAAAAAAATAAAGCATTGGGGATAGAAGGACCATCAAAATGTCCAATATT 

GAAATATTGGTCTTTGCATAGTTCCATTAAAAAATTTACCTGCTCTCMTGGTTGGTCAT 

TCTTCAATTTCAANGGCTTTATTTGGAACAGACTTTTGCATTTTCM 

GGGGGTGGG 

Sequence 2431 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTI 
MINIMI IGATGTGTAGCCTTGGGTAAACACACATCCTTCTGTGAAACAGTCTCACAG 
AGACTCTCAGAATCCCAAGAATTTTCTTCAACGTTCTTTTGTTTTGATTCTGAAG 
ATCTGATCTATAAAAGGTTAATCACAGATACATTCATGAGAACATTTCTCCACTATACAT 
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TTTAAAAA CATAT ACAAATCTACT7TCATTCACGAATCTGTGGTTCCTCCTCTGTAGCCC 

GTATATTCTTTTTGTTGATTACAATGACTACTTCCTATAGTCAATCCGTTAGAATTGAAA 

TCTGAAAAGTCTGAAAATTMCATTATTTCATTAGAATGCACAAAAATATAMTTTGACT 

AACTTCTATGAATTATTTCTTCACAAAGCAGTCACATACTCTTCTACTCTGAAACAAGGT 

ATATCCTATATTTCCTGTNCATTCANAAAACTTTAATTACCTACCATCTCTCATTTCTTT 
GGAAATTTTT 

Sequence 2432 

AGGTAC n • i M i u l l l l II I I I I I M | [ 1 NNCGNAAANACCACAAACCACCTNTGAAG 

CCMGAAAAAGACGNGATTAGAGGGAAACACATCAGNACCTTTANTCCTCCACTGACTTT 

GGAAAAACMCAAMTGCCAGCTTCCACCANATCATTTNTTCTTAA 

GATTCTTTTGGAGCACTCMTTrrCTGGTCAGGTCMCAAAAMGTAATAAACATC 

ACACCTCCCTCCCCTGTCTTCCTAGCCCCATTCCTGGAGTTATGCAACCCTTGGGGGCGG 

C ATACACTTCAG CTATTNGCTCTG N GTTC ATAATTAG ATTG N CTAG ACAGGAN ATGTTGC 

TGGAGCAATAGG 

Sequence 2433 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCTTCTAATTGATC 

ATATCTGCTTATTTTCCTCTGGATTAAGGATCAAGGAGATAGTATATTAGATGATTTGAT 

AAAmCCAATGCTTTGCAMGTAGTTGMCM 

TACTAGTCAAGACTTGGTATTTCATAGCTGGCTTTCTTATTCTGAAGGTTGTAAAAAGAC 

atatgaagtmtat7taacattgaaaacatgataatcaatttgattatctatgaattgtc 
ctaaac g ttc ag agtaaa gtttc c tttaaagttaaaatctttc aagtg aag g aaattata 

aagtcacatgtaaacaccaaaatmgaggaatagagcaataggatatttttggctttata 
attcmtttaaamttaaggtgcatt™^ 

tccagtaagcagttgctatctatgattcatgaagatcatggggggttccc 

Sequence 2434 

CGCCCGGGCAGGTAC I rTTTTTTTTTTTTTTTTI I GATGCTATTACTGAATAACTCTTTC 

TGAGATGGTTGTTGTA TAGTA AACCATGAGTCTGGCATGTCTAAAATTGTCTTCATTTCT 

TTCC ATAMmGMTGTTTTGCTGTGTMGTCATTCAAAGTTGAAGACAATGTCCCATC 

TTTTGACAGAAAGGGAAGGAAGGATATGTGCTCAGCCTGCCAGTTTGACCCACTTTTGAA 

AAGGGCTTTGTGGTCTCTACCAATGAGAAGTCTGGTGTCAATGTGATTTTCTTMCTTTA 

TAGGAATTGTTTGTTTCCCTCTTTTCTGAAMCTTGTGCTTTCATCAGGCTCTACCAGCC 

TTGATACTCTGTCACTCACTGGGCATGGNATTTGCACCATCCTTTCAT 

Sequence 2435 

CGCGGTGGCGGCCGAGGTACTAGACGGGGCTGTAATCTGGTAACTGTATGTATTTTAGTT 

CTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGAGGGAGGAAGGGGCCAGCTGAAATG 

GAAACAGATCCCTGATCCGGGGCGGCAGTGGAACCCTTCTTGGTGTGCGAGAGCCTGTGC 

ATTTCAGAGGCAGCAAAAMGTAAAAAAAAAAAAMTTGATCTTTGTTTAGATTAACAGA 

CCCCTGACTATGAAGAAGGAAGGCATCCAGACCAGAAACCGAAAAATGTCTAGCAAATCC 

AAAAAGTGCAAAAAAGTGCATGACTCACTGGAGGACTTCCCCAAGAACAGCTCGTTTAAC 

CCGGCCG 

Sequence 2436 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTGAGGATCTG 

AAACT7TGGCGACCAAAAGTCCTGATTTGCGTGAGGCACACCAGTTCACACTAGTTGTCT 

AGGTTTMTAATTAATAATGCTTCTTTTTCATTCTCAAAAGAGCCCTGAm 

AATTATATGGTCACCCTACCTATAGCCAGATATTTCTGCACCAAATGCCTCGATTTTGGT 

GGGTGGCTTTMTTAAGGMGAATGCTTTMGTTTTATGACATAACTCTGTTGTTCTTGT 

CAGGAAGTGATATGTATAAAACTATTAATTCATCAGATTATATATAAGGAGATAGGGTGC 

ACTTAAAATTGCTAGGAAGTTGAGATCTCTAGTTGTTTAAGTTGGAAGTAATTGAAGTAA 

CAGTTGGACAAGCTGTCATGATGGACATAGTAGAAGTGCCTTTTAATTAA 
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Sequence 2437 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCATCCCCAAGGACA 

CGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAACCAG 

ACGCCTTCAATCCTGACCACTTTCTGGATGCCAATGGGGCACTGAAAAAGAATGAAGCTT 

TTATCCCCTTCTCCTTAGGTAAGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGGGCA 

GGTCCTATCCTCMCTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTTAATAT 

ATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAGGAGGC 

TTGA GAAAGAAGATTACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGCAAGTA 

TTTTAACATCTCTAMCCTCAGTGMTTCATTT^ 

ACCGNTGTAAGCCCAGTGCTGTACCTCGGCCGCTCTAGAACTAAGTGGATCCCC 
Sequence 2438 

CCGCGGTGGCGGCCGAGGTAC I I I M I I II i I I 1.1 1 I tl 1 I I GCTGATATTGAGCCATAG 

AGCCAAGAAGATTGCTAATTTTGAAGTGTGCTTGGTCGGGCTTTCTACTGATGATGTTGC 

TATTGATTCAAACAGTCACMGTCTTTAGGATCATTTTTAGAGCAATGGCAAGAGTAGGG 

CTGAAATGAAAATCAGTCTGAGATAATGAACACTCCTGCTCAGAGATCATTATGGGCTGA 

GCTGTGCTGGGAAGGCACTGAGCCTTGGGGGGCTAACACTTGCCTCAGTGGAGTTAGCAC 

TCAGTAACCAGGCAATCACCTCGATCCAAGGCTTTCAACTCTCGTGGAATATCTGTCAGT 

TTGTC AGTT CAGCAAATAACAAGTGTTTTGTTTG NTTCTTTTTTG NTTTCCCCCGG AAC A 

TGGACTTTTAAMCAGCTTMTTAAAAACTATGGCAC 

GGCAAAGACCACTGGGTAGTTGGTGAGAAGCTTATNAAAACAGCTTGCTTTCTCCTGAGG 

GGGGAAGGCTAACNTCCATGGGGGTATGGGAC 

Sequence 2439 

CCCGGGCAGGTACGCGGNGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATAC 

GGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGT 

CAGAATACACATTTCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAA 

AAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTC 

AGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCC 

ATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTA 

GCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTTT 

GTGGGTTAAGTCMTGGTCTATGGTAATTTTTTTATAGCAGTCCCAGCCAAAGACAGTGC 

CTNATTTACTACATACCATTTATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAA 

ATAAAGAGATAAGGATACTGAMCACATAACACCTTTCACTAGTTTTTAGTATACAAATA 

TT GGAGAAA TAGGTTGGTATTAACTATCTCATNCAAGAAATGCCANATTCATGGTGGTTC 

TAAi mm I AT ATATTAATTGACAAAATGAAAAAAACTTAACCC ATN CTAG ATTTTAGCT 

GCCCAAGGAATGAAAAGAATNAAAAAAAAATTTTT 

Sequence 2440 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGAAGAAGTTCCTTGT 

GCTATTTCTCACAA 1 \ i I I CTTTMGTTTGAAACTATTTCAAAATGAAGTTATATAACTA 

CATGAATTGTGAGTTAAAATGCAAGCAAGATCTTCATGCCTGTAATCCCAGCACTTTGGG 

AGGCTGAGGCAGGAGGACTGTTTGAGCCTAGGAGTTTGAGAACAGCCTGGGCAACATAGT 

GAGACCCTCATCTCTACAAAAAATTTAAAAAATTAGCCAGGCATGGTGGCTCGTGACTAT 

AGTCCCAGCTACTCAGGAAATGGAGGTAGGATGATCCCTTGAGCCCAAGAGGTGAAGGCT 

GCAGCAAGCTGTGATCACATCACTATATTCCAGCCTGGATGATAGAGTGAAACCCTGTCT 

CAAAAACAAATAMGCCTMGGGMTATGCAGATTAATGGAAGAGAAGGCTATTTTGTGC 

AGACAGGAAACAGATGAMGACAATTTGACTTTTAAAACTATGAGTAAGGCTCATGCCTA 

TAATCCCAGCACTTTGGGAGGGCAAGGTGTGAGGACTGCTTGAGCCCAAGAGTTCGAGAC 

CAGCCCTGGCAAAACAGTGAGACCCTATTTCTATNAAACATTTAAAATATTAACCAGGTG 

TGGTAATGTGCACCTGTGATCCCCAGCGACTCGGAACCCTGANGGGGGGAAGAATGCTTG 

AGCCCTGGGANGGCAAAGGCTTGCAATGGGNCCGAATNATGCCACTTGTANTCTTGGNCT 
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NGGGGACAAAACCAAGACCCCTTGTNTTAAAAGAAAAAAAGAAAAAAAGCC 
Sequence 2441 

ATAGGGCGAATNGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTCCAA 

GGTCAGGTCCTGGGTGGTAAAGGTAAATACAAATTGGAAGGGCACTGTGTGAGCCAAAAT 

GAGTCAGATTAGTCATGATTCATTTCCAGTTTGGGTTTTGGGTGGTCTTGGAGAATGTTG 

TAAGCACTGCTTCATTGATAGGTTGATTGAGCCAGACTTTACTCAGCAGCCTGGAAAAGG 

AGAGATGGGCTCTGGGTTCTACCTTTGCTCACTGGTAAGTTGCTMGATTTCAGCCTTGC 

CCTCAAACCCTGAAGTAGTCCTTCATTCACACAGTGGGATCACTCGAAAATGTCAGATGG 

GGAAGTCCATAGGTTGTTACTTTAMGAAAATAGAAAAAATGCTGGAAAAGGTTTCTTCA 

ATTTTMTACCCATGAAGGCCCATGTTTTAGCTTTCCTCCGATGGGCAAACCATACACTA 

ACTTGGGCCTTGTAATCAACAAGCMGGCTAAAAGCTCTCTMGTGCTTGCTGTTTAAAC 

TATTTTGTGTTGGAAGAAGAGTTGGAAAAGAGGCAGTGTGGAGGTGAGGGGAGAAAGTNC 

CTNCCTCNAT 

Sequence 2442 

CCGCGGTGGCGGCCCGAGGTACGCGGGAAGGAGATGCTGCCACCTAGGTTACTTGTAGGA 

CCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAG 

TGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGA 

GGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAA 

GAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCMTGTGATGGTGTTTG 

GAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGGCAGAGAGCTTGTGC 

ACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACC 

CTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTG 

AGAAGTATATGTTTGTGGTTTAGTCAATGGGTCTATGGTMTTTTTTTATAGCAGTCCCA 

GCCAAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATAT 

Sequence 2443 

CCGCGGTGGCGGCCCGAGGTACTTGCAAGAAACCTCAGGACTTGAGTAACAGCAACATGG 

TTCTTCTGAGCTATGAAGGGGCCCAGATTTAAGGGCTATTTTTGACACCCTAAATGTGCT 

GAGACAAGTCATTAAGGTGGTCCTGCCAGGACACAGCCATCTAAAGCAGCAATCTGCTTC 

TTGCCAGAAAATCTCGTGCCTCTGCAGAGCCTTTTCCAGAATGAACCACACCATGCTGAG 

GAAAGGAGAAAGAGACCACCTACTGCATTTCTGTCACTCGCTGAAAAGGACACTCTGTCA 

GAAAATCTTCTAGCAAACTTCAAAGGGCAAAATCACCCCTTGTTACTGATAAAGCCCAGA 

GAGCTTCAGCAGCTAACATTCCCTGGACAGGGCACAGCAAGGATTTGAACCTAGGTCAAG 

TCTGGCCAGAACACCCACAAGCTTTCCTTAACTCAGTGTGCTATCTCCCCACGACTAGGT 

C^CTACTGCTTTATAATCACCTTTGTAGCCACCAGTGGATTT 

Sequence 2444 

GGTACATATTCTTACTTCTTGTGTAGGAGGCAAAAGGTATCTGCAGGGCTCTGGGCTTGC 

TAACCAAACCGTTAATCGCTTAGAAAC AGC AN GAN NTN NACTAGTG AGATGTTTATC ACA 

TACCTGGACACGAGTTGACTTTCTGTGAGACATCATTCTGGAATGAAAGCCAAAATCTCT 

CTGCTATTCAGTTGAGGCTGAAAAAACAATCACATCAGAATTAGTAAGGCCCATNACAAN 

GAGAGAGAAATGAAAACAAGCCAGTATTCAATGGGGCTAGGGGGAAAATTACCTGGGCTT 

CAGGAATTCAGGAGAGTTGGGGTTTTGAGAAGGTAAAAAAGTGGGCCAAGTGCGGTGGCT 

CAAGCTTGTAATCCCAGCACTTTGGAACGCCAAGGCAGGTGGATCACGAGCTCAAGAGAT 

TGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAACACAAAAATTAGCC 

CGGG 

Sequence 2445 

CTNCCCGCGGTGGCGGCCCGAGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAG 
TGGAGGTGGAGTTCGGGTGAGTCACAGAGCTGGGGCGCCGTGGGGATGGAT 
Sequence 2446 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACAAAACTAAAAATAAA 
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AAGTCAAAAA TTGGAG ATGAMTGTATGTTTTGAAACACATTATTTTGGATCTTAAAAGA 

TTTCCATGATATTTTTTGAAGGTGTAATATTAAAATGATGTCTC i 1 1 1 H CTGTCTCATT 

CCAAAGAACCATAAAAAAGCAMTTAAMCAGTGATGAAACAAuA^CGCTTACAGGTTTGT 

ATCTCAGTGCATCTCTGTAGGATCCAAACAAAGCAGCCTGTGCTCTAAGAAAGGCCCTAG 

CTACTCCATCACCCGTAGCTGTAGACTGCTTCTTCAAGT7TATTTTTCAAGGCCCCCGCG 

TACCTGCCCG 

Sequence 2447 

AGGGCGGCCGCCCGGGCAGGTACGCGGGAGAAACCGTCTGAACTATCCTACCCGCCATCA 
TCCTAGTCCTCATCGCCCTCCCATCCCTACCGCATCCTTTACATAACAGACTNAGGTCAA 
CTATCCCTCCCTTACCCATTCAAAATCAAATTGGCACACCCAAATGGGTACCTGGAATCT 
AATTTATTTCTAAAAATATTTTCCCCACCTC 

CNTGGATTGCCAAAATTANTGGGTTGGAAATAAGGTTGGGCCGGMCTCTTCATTNTTAA 
AACCTTGGGGGGCCTTAAAAMTTAATTCCTCTTTTTGGAACCAAGNCC I rTTTTTTTTA 
A NTACN TGGGNCNCCTNCCTTTCTCNAAAGNGTAATTNTNTNTTNTTAAGNNTTCCTTAC 
Nllll I AANGNCCNTTTNCTTTGNTTGGGGNCNCCAAATTTGTTTCTTTGGGGTAAGGGG 
NAAANCCTTTGTMNGGGNANTTGGGGGAATTTTTT 
Sequence 2448 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCGGCTTCTTAGCTTTAC 

GATGGCAACAAGTATGGCGGCTGCTAGTGGTAGATTTGAAAGTGCGAAGAGTATCNAAGA 

GCGGAAAGAACAGACCCGGAATGCCAGGGCCGAGGTGTTGCGCCAGGCTAAAGCCAATTT 

TGAAAAAANAAGAAAGGCGTAAAGAACTTAANCGACTTCGGGGTGAGGATACATGGATGC 

TACCTGATGTGAATGANAGAATTGAACAGTTCTCACAGGAACACNTCTGTGAANAAANAA 

GAAAGAAAAAAATGACCNAGCTTTTCNAAAAAAAGCNATGGAAANGGAAAAA 

Sequence 2449 

CGAGGTACTTACCTACACAAGTTTTACTATCTTGGMTGTTCTAGAATTCCAGTTAATAA 

GGTGAAAGTGATAAACATGGCTCCATTTAGTTGTTATCTCTAAATGATTTAGAGAAGTGT 

TTGCCAAATTATAATGTGCATATGAATCACCCGGAGATGTTGTTAAAATTCAGGTTCTGA 

TTCAAAAGGCCTAGGGTGGGGTCTGAGATTCTGTGTTACTAACAAGTTGCTAACTGAGGC 

TGATGTTGCTGGTTCCCAGATCACACTTTGTTTAGTAAGGATAAGGGCAGGTTTCCAATC 

TTGGCTGCAATTAGAATCACTTGGGAAAATTAAAAACATCTATATCTGGGTCCCATCCTA 

CCCCCCAGAGATTTGTCTTTMTTAAAGTGGAATGTGGCCGGGTATCAGGATTTTAAAGA 

AGCTCTCCAGCTGATTTTGGCCGTTCAGCTCACCGTTTGAACATGACTGAGTAGAGAATG 

GACCCTGGAAGATGAAANNNNNNNNNNNNNNNNNNNNGTACCTGCCGGGCGGCCGCTCTA 

NAACTA 

Sequence 2450 

AGGTACTCTCCCAGTGACATGCTTCTTGACCACAATGGATGAACTGTGCCCAGCATGCCC 

ACTTTCCAATGCTCCACTGATCCCCATGTTXTGTTTCTGAAGGACAACCAGCCTTGGAAT 

AATGGCAAATACCTTCTTAAGTTCCTACAAAGTATGGTCCCTGGGAAGTTTATGTCTGTA 

AGTCAAACCTTGGGMGTMCTGAGTTTTGATGCCTCTTCCAGCATCATCAGCATCATGC 

TATTACAATCCCAAACCATGGGGGTTTCTCACAGCTTTACACCAAAGGGCATCACTATCC 

CTCAAAGAGAGAAACCT 

Sequence 2451 

CCGCGGTGGCGGCCCGAGGTACTTTN ITTTTTTTTTTTTTTTTTTTl NACTGCAACAACC 

TAACAGGAAAACTAATCAGAGATCATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAG 

GAACATAATCCACTTGAGAATTTAAAATAGAGAGGGCTTAGGCTGGGGAGTTGGTGACAG 

CATAATGGGAAATCTACAAAGCAAACAGGAAtAGGGAGACAACCCANAGCCCATCTAGGC 

TGGCAGAGTCATCATNTNTAGGCTGGAAGGTCAAAAAAAGGTGG 

Sequence 2452 

TCAGTTTGGTTTGTTTGGTTTTGAAAAAGTAGG I I I I I I I I I I AGTATGTGATGAGCTTA 
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TTCTTAAGTGAATTAATAAATACTTTTTAGTATCTGTTTTAATTCTAACAGTATGAGTGT 

TGGTAGTTATATTCCAAATAMCAAAMCTCTTAGAGTCCTCAGTAATTTTTAAGACTGT 

AAAGGGGTCCTGAAAATGATTAMTGTTTGAGAATAACTGCCTCAGAACATTGTTCTTCA 

ACCCTAATCAAACCCAGCACGGCATTTATATAAAAAATGATTTTATAATGCCCATTTGCT 

AATCCTGAAATAAAMTACATAGGTAATAAGAATGACTTANCCAATTTCAAAAATATCAA 

TGGAGGGAATACTAGTGTTTGGAGACCTAACTATACAGGAGAAATAAAAGGAAAGTCATT 

TTTTATAACATTATGCATGCATGTATAAAGCACTCAGATCTGATGACATTGGGAGACACG 

GTGGCATAGCCAGATCCTTACACCTGTTTGTGGACTCACCATGAATACAGCAAGTGGCCA 

MGCAAACCCCCNCGCTATAACTGATAGGAATTTCATTGGCTTNGGTGATGGTCATTTCT 

ANAATGGTAAGAAAATCTTGGTAAGGTTAAATGAACCAA 

Sequence 2453 

CCGCGGTGGCGGCCCGAGGTACGCCACAAGGCATTTAATGCCCACAGTAACAGGGCTGTT 

TGACAGTGGCAGAAGAGGACGGGACTAAAGTTACTTTGTGCTGAGAGGGGGAAAGAAGCA 

CAAAGTTTGGTCTGTTGCGTAATTGAATTTTTAACACTCTTATCCACAACAAACA 

TCGTGTCCTGCTGTGTAAAAGACATCAGATATATTACAGATTTTCAAACAGGCAGGTATC 

AACATTTACATTGTAATTCMTAGACGCTACTACTACAAAGGAGCtTTATTCTTCCAA 

TAATATGGTTGCTGCGGGAACACTGCAGGATGAMCTGACTTTTTTTGTGGTMTGTTTC 

TCCTGGTTGGGAACAACGCAAATTAGCAAAGGCGTAATTTATAACCGCTGCTACCAAAGA 

GTGAAGTATTTGTGGAAGTTCTGACTGGAATCTAACTCGTACCTGCCCG 

Sequence 2454 

CCGCGGTGGCCGGCCCGCCCGGGCAGGTACTTTN nTTTTTTTTTTTTl I NTGCTNGGGG 
NCTTTTTCCCCTAGGATTTCTCCAATGAACACAA 

ATAAAACACATTGAAAAAAMGTAAAAGTAGGTGAAAATGTCAATGAMTTCTGGTTTGT 

AGAACCATCAAAACTTGGAAAAGTTTAATAAGACCACCANNAAGAAGAAGAAGGGCAAGT 

TTCCAAACTGATTTCACAGGGCCAGGAGGGCAGGCCACTGANATAGGTAAACCATGGAAG 

CACAATCAGTCCCATCTCATGGATATTCTACCTGGGGCTTTTACTTTGCGTGGCTCAAAA 

GCATCTGAGTNTTANACCAACAGAAAGATAACTCCNGAAGAAAGATGTAATAGTGTCATA 

TTTGTGTAATTGATTGATCTAAMCTTGGTTGATGAAAAATCCCTGGGACAACAGCAAGA 

ANAGTTGCGTTCCAACCCCCCGAAGTNCTGCTGCTTTCTGAACTGCAAGTATCATATCTC 

TTGCTTTTGATTTTA 

Sequence 2455 

AGGTACTCTTTCTAGTTATTAAAAAGAAAAGTTAACTGTAAAACAGCCTCAGGCAGGTCC 

TTCCGGAGCTATCCAGAAGAAGCCATTGTTATCATAGGAGATGGCAGCTCCATGGGTGTT 

ATTGCCCCTGAAGACCTTCCAGTGGGACAAGATGTGGAGGTGGAAGACAGTGACACTGAT 

GATCCTCACCCTGTGTMGCTTAGGCTACTGTGTGTATTTGTGTCTTAGTTTTTAACAAA 

AAGGTTTAAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAM 

Sequence 2456 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACACCCTAGATCC 

CAAGATCTCCAAGGATTTGGTGGCATACCCACTCCAGCACACAGAAGCATGAGGTTCATG 

ACTCTCCTNTTCCTGACAGCTCTGGCAGGAGCCCTTGTCTGTGCCTATGATCCAAAAGCC 

CGCTTTTGCCCMGATCGGGGAACCCCTTTGCCATTGAAGCATCATCCAGCTTAAAAAGG 

AAA 

Sequence 2457 

AGGTACGCGGGAAGAAGGGTCAGCTCAGGGTGTTGCCATGGGAATGAAATTCATATTGCC 

TMCCGATTTGATATGMTGTGTGTTCTCGAtTTGTGAAGTCCTTAAATGAAGAAGATAG 

TAAAAATATTCAAGATCAGGTTAACTCTGACCTGGAGGTGGCATCTGTCCTATTTAAAGC 

TGAATGCAATATCCATACATCTCCTTCTCCGGGAATTCAAGTAAGGCATGTCTACACCCC 

CTCTACAACAAAGCATTTCTCACCCATAAAACAGTCAACCACTTTAACCAACAAACACAG 

AGGAAATGAGGTCTCTACCACACCTCTGTTAGCAAATTCTTTG 
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Sequence 2458 

AQ GTAC GCGGGGAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCAA 

CATTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGCATCATCAGC 

ATCATGCTATTACAATCCCAMCCATGGGGGTTTCTCACAGCTTTACACCAAAGGGCATC 

ACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCTGCCCG 

Sequence 2459 

CCGGGCAGGTAC I 1 1 I I N H I I I I 11 I M I M i G 1 M 1 I I I II M H I I I I U 1 1 I I t 1 
rrTTTTTTTTTTTTTI I AAAAAAAAAAAAAAATNAAATTTNTGAAAANGNGGNAANTGGA 
ANTGGTAANNTNAAACCAAMNTNTGGAANTAAAAAMNTTNAAAAAAAATTAAAANTTG 
NGAATTGAAAAAAAAAA 
Sequence 2460 

TCGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAG 

CTCTTTGTGGCTTGGGTGGTCAGCMGAAT7TTTTAAGCAGAGATACTCTTGATTGGGTT 

GGATAAATAAAAGAGCAAGGAGGACATTGTAGGAGGGGAATTGCTCCCAGCAGAGCCTAG 

MGCAGAGAGATTTTCCAGGGTTCCCAGGAGAAGAGATTGAATAAGTGCAGTGGGCCATG 

TGGTGACACATCTTGAGTGTCCAGCTGGAGAACTGGAAGTGCATATCATAACAGGGGTGG 

AAAACTCAAGAGGTTGATAGGGAATGGTTGAGGTTGCAATGAGGGGACCCGGTGGGGGAC 
TTT 

Sequence 2461 

TTAACGCATTTACTAAACGCAGACGAAAATGGAAAGATTAATTGGGAGTGGTAGGATGAA 

ACAATTTGGAGAAGATAGAAGTTTGAAGTGGAAAACTGGAAGACAGAAGTACCTnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnNAGGAAATTTCGATATTCAAGNCTTATCTGA 

TACNCGTCGACNCTCTGAAGGGGGNGGGGCNCCGGTACCCAGACTTTTGTTCCCTTTAGT 

GGAGGGGTTTATATNAGCCGCCGCTTTGGACGTTATATCATGGGTACANTAGCTTGTATT 

ACANTGGTGGTGANAATTGTNTNATCACGNTTCACCAATTTCTCANCACAAACNATTACN 

NMGTCCCGNTNAGTCNATTAAAAANTTNGTTNAAAGACCTGGGNGGTTGNNCTTAAATG 

GAAGATNGAGGCCTTAAACCTTTCAACTATTTTAA 

Sequence 2462 

TCCGGCGAA I I i I I CTCNCNANNGCGGCGGCCGTGTGGCGCANGGAGGACCCCTTACAGT 

AACCACAGCGTNTTTNAATCCGAAAGGGAAAAGACCCCGCGTACCTTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnGCCCGGTTCCCTANNTATTTGTTTCCCTTTAGTGAGGGGTTAAATTGCGCGC 

TTGGCGTAATANATGGTNCATAAGCATGNATTCCTGTCGTGAAATTGTNATCCGCTTAAC 

AATTTN CACACAAACAN AC NGAGCCTGGGAGC ATAAAAAGNTG ATAAAGCCTN GGGG N GC 

ACTAATGAGTTGAGCTAAACTTCACAATTAAAT 

Sequence 2463 

TCCACCGCGGATGGCTGGCCGCCCGGGCAGGTACTCTTTCCTGGATTTACCATTTATACA 

AAGAAAATTCTTACCCAGCTGGAAGAGGAGGAACAAAACTTTTTACTATCAGGGCACAGG 

CAAAAAATNCATTTGTGCMGAGAGACATATAAGCAAACATGCATTTACAGTGTAGAGGA 

ANNGGGGACCNCAGTA I I I I I I I \ CTCCTGGGAANNCTTACATTTTTAATTAAATNNCNC 

CTTGNAAAATTTTGTAGNANTANTTAANANGCCTTCCNTTGATTTTCANATC 

NACCTCNNGCCAATGNAATATNCAAAACAACNTTTCANGGTATGTTGGGGAATNGTCNCN 

ACGATTATGTANAAGTAAAAAAGTGGCTCCANGTAAAAAA 

Sequence 2464 

CGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTTAAAAAAAAAACATCA 

GTTNTGGGACATAACAAANTAAATACTATGGAGAAATGGTATCTGGACAGGAACAGAAAT 

GTCCACAACTGCNAGGGATTTTCTTTTACACTTGGCCACAGAGTCGTTAATTGANNCCTA 

CCACTCCTTNAAATATTGGGGATTCTCTTAATTAAGGTT 

Sequence 2465 
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CCGCCCGGGCAGGTACCTTTCTTTGGAATGGGTTTGGTTCGACCGTCATACTTNTTCAAG 

GTGATATANACGCTGCCCGACGTTNCGGCACTTTCTGGAAAAGTNTGGTCAGCTCCGTCA 

GG AAC ATGCTCN NTTCTNC AAC AACACC ATN C NCGGN N N GACCCN N NGN NTACTCNTGGG 

CCGCTTTNTAGAAAACATACGNTGTGNATTNCCACCGGGGCCTGNCAAGGAAAATTACGT 

ATNATTNTAAAGGCNTNTATTCNGNATTACCCCGATACGTAACNCTNCTGAAGNGNGNGG 

GGGGCCACACGNGGTATCTCCCAANACATTANTTNTGGATTTCNNCCCTNTTTAAGTTCG 

GMGGGGGNTTTTAAAANTTTNGNCNGNCNGACNTTTTGNGACCGTTNAAMTNTCAATT 

GGGGGTTCNCAATNACGNCCTTTGNTATNTACNNCATCGGTGGGNTGGAAAAAAANTATT 

GNTTTTAATNACACNGGNTTTCCNAACNAAMTTATTNCCCAACTMCCAAMCNTATTT 

AACCCNTAAAGGCCTCNGGGGNGGGAAGGACNAATTAAAAAAAGTTTG 

Sequence 2466 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATTGACCCA 

AAACCTGGATGTCCTATCTGTGTGTTCCAGTAGTGCCCTATTTTAAATGTGTCATGATAT 

CTATTACTTTGTGTGAAAATTACCTGTTTGTTTTTTCAGGTTATATAATAGTATAACACT 

GCCMGGAGCGGATTATCTCATCTTCATCCTGTAATTCCAGTGTTTGTCACGTGGTTGTT 

GAATAAATGAATAAAGAATGAGAAAACCAGAAGCTCTGATACATAATCATAATGATAATT 

ATTTCMTGCACAACTACGGGTGGTGCTGMCTAGAATCTATATTTTCTGAAACTGGCTC 

CTCTAGGATCTACTAATGATTTAAATCTAAAAGATGAAGTTAGTAAAGCATCAGNAAAAA 

AAAGGTAAAACAAAATTGCTCC 

Sequence 2467 

CGCGGTGGCGGCCGAGGTACGCGGGGGCAAACGCCGGGAGTAGCCGAAGGGGACTGCCGG 

GAACAGGAATTTCTTCACATGGCTCCTGGAGAAGTGACCATCA 

Sequence 2468 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTNAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTNGAGGGTCTGGGACCCACTTGAGGAGGCAGTTCTGTCCGTTCTCAGATCT 

CCAGCTTGCGTGCTGGGTGCNTGGGGAGAAACCACTACTNTCCCCGCCGTTACCTNAGGC 

CCGCTTCTTAAGAAACCTANTTGGNATTCCNCCTCGGGTCTTGTCATGGTAATTTCCNAT 

TATTCAAAAGGCTATATTCNGTATAACNCCGTGCTGACNCTTNGNAGGGGNGGGGGGGNC 

ACTCCGGGNTACCCCCAAGNCTTTTTNTGGTTTCCCACTTTTMNTTGNAGNGGGGTTTA 

AAATTAGTCGACCGCCCTTTGGNCNGTTANAANTCAATTGGNGTTCAATAAAGGCCTGGA 

TmCCNTTGTTGGTNGGAAAAAANNTNGTTNTATTNTCCGNCATTCMCCANANTTm 

NTCAACCACCAAAACCNATTANCCGTAAGGCCTCCNGGNGNNAAGTCCAATTAAAAAAAT 

NNTTGGNTAAAANAAGGCCNCNTGGGGGG 

Sequence 2469 

GGCCGCCCGGGCAGGTACGCGGGGANTTNTCTACTGGCGAAACCTGTATCCGGGCCCAAC 

CTGAAAACATCCCAGCCAAGAACTGGTATAGGAGCTCCAAGGACAAGAAACACGTCTGGC 

TAGGAGAAACTATCAATGCTGGCAGCCAGGTTANGAATATAAATGTANNAAGGGGAGTAG 

ACTTTCNCAAGGGGAAAAATGGCTTACNCCAAANCTTGCCTTNCATAGNCGCNCTGCTGG 

GCCAACTTATGCCCTCNTCAGAAAACAATCAACNCTTACTCACATGCAAANGAANCGGGN 

CATTNGGCAATTAACAATGNGGATTGGATGGTAGTACTTGGGCAAAACNCATGAAAAAAA 

AAGGGCCTNG N ACAAATATN CTTAACC AAGGGGGNC CTTCNCTTAAATAGGAATTGNTAT 

NGGAAAACCATTNGGNTTGGCNTTGGAAGGGGGGCCAAAACCCAAGGCNAAGTGGGNNTT 

CCACCTTTTTANCCAACCCTNGGTGNTCCNTNTTGGGTAANGNNAAATGGGGGCNNTNGG 

N CTTCCCTN N AAAAAA 

Sequence 2470 

AGGTACI I I 111 I II M H U I I U M H M I CTAAAAGCAAGGNGATCTGTGATATTTT 
CGTCCTTTTGTATGATTCTCTACTTCTTATTCAAGACCTGTGATTAAGTTGNGTGTATAT 
TGTCTGTAAAATGAAACAAAAGTAGGTCCTAAACATTGTAAGAATATGGTATACTAAAAA 
GCTTTGAAACCAAAAGAAAATTTAAAAAAACAACAACAACAACAATGTAGAGTACCTCAT 
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ACCTMCAGACACJATAGAAA^ 

ATTTTTCACAAAAATAGATTCCTTAAATAANACATGTTGTCTAACAGGTATTTACA 
Sequence 2471 

CCGGGCAGGTACAAACAGGTGCTATCATAATTACCTCTGGGCAACAGGCAAAAATCAAGA 

TTGTCCTTAGTAAACCAGGATTCATACCAGGTCATCTGATGTTTAGTATAGCACTCACCC 

GACTCTGCCTTACATTGCTTATTTAAMTGTCTGCCTCCCCTTCTAGGTTTTATAAAAGT 

TCTTGGCTCACAGTA ACTCTT AGTATMGTTTCTGAAATAAACACTTTGTTATCACTTTT 

GAATTGATGGCTTACATTTTTGTTTTGGCATTTAACT^ 

TGCATTTCAACTCATCTGCAAGCTTATATCCCTCAAAGTTTGCTGCTGGGC 
Sequence 2472 

AGGTAC TTT i i i u i n l l II I ll I I I I t M rACATGTATCAATTTATTTACTTTTGGTT 

TTGCTC ATTTTT ACTGAAMTATGATTATCTATGTGGTGTCATTATTTTTCATCTCAAAT 

AACTTCATTTTTACATAGTGTAGTCCAGCTGGAAACGTTTATTTCATGTTTAT^ 

ATGATATTTTCACTGGATACAGAAACCTGTCTTCTGCCTCGCTCACGCTGGGAGCTGTAN 

ACTGGAGCTGTTCCTATTCGGCCATCTTGGCTCCTCCCCACTAATGCTGTTCTTCTAGAC 

ATGTTTGAAATAGGGGCATTGCTATAATGAATGGCTAGCAAGTAGGACAA 

Sequence 2473 

ATTGGAGCTCCCCGCGGTGGCGGCCGN C NGGGCAGGTACTCTAC ATACCTGTATCTN ATT 

TCATCCTAACAATAGCCACACAAGCTAGACATAATTATCCCATTTTCCAGGAGAAACTGA 

AGTTCANAGCATTTAGTAACTTCCTCAAATTTATACACCTNAATAAGTGGTAGAGCCAGA 
ATTTG 

Sequence 2474 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGAGGTGTGCTG 

TGTGTGAGGCGCCTGCCATGGTGATGGCCGTGCACAGCCAGACCATTCAGATCCCACCGT 

GCCCCAGCGGGTGGTCCTCGCTGTGGATCGGCTACTCTTTTGTGATGCACACCAGCGCTG 

GTGCAGAAGGCTCTGGCCAAGCCCTGGCGTCCCCCGGCTCCTGCCCGGAGGAGTTTAGAA 

GTGCGCCATTCATCGAGTGTCACGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnn 

nnnnnnnCGATATCAAGCTTATTCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCCGGT 

ACCCCAGCNTTTTGGNTCCCTTTAANTGAAGGG 

Sequence 2475 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCMCCtCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGGTAGCTGAGACCMTGATGCTGACCTCCCTCAAAAGCTGCATTTCTGMTTTCTG 

AAGGCAAACTGTCTGCCTATATTGTACAGCAAAATAATTTCCTGTATTAAAACGTnTTA 

TTGCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 2476 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGTAGGATGGGAAGCCGTG 
GGGCAAGGGAGGTTGCAGGAAGCCCATCCTTCCCCTCCTGCGGCAGGTACCT 
Sequence 2477 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGCAGGT 

AGGCATTTAAATTGGTGCAACCGCTTTGGATTGCTGCTTTGTAGTATCTGGTGCAACTGA 

AGATGAACATGCCCTGTGACACAGCAACCGCACTTCTAGGTCAATACCCTAATTATATTC 

TTACTGTGGTTCACMGAAGGTATGTAAGAGGTCATTGCCTGAGCACTGTTTAGAATAGG 

GGCAAACTGGAAATCCTCTAAATGTCTGTCAATGAAGGAATAGATAAATTGTAATATGTT 

CATATAAAATACTGCATAAATAAGTGAAATTTATAAATGTACCTCGGCCGCTCTAGAACT 
AGN 

Sequence 2478 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACT 
GCTGGTTCGGGCTGCCATAAAAAGCACCACAGACGACATGGCTTACACCACAGAAACTCA 
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TGTTCTCCCAATCCTGGAAGCTGAAAGTCCAAGATCAAAGTGTTGTCTGGGTTGGTTTCT 

TCCAAGACCTCTACTGTTGGCTTACAGATGATTGTATTCAGTTCACATGATGTTCTTCTA 

TGAGTGCCTGTGTCGATTTTCTGTTCTTATAAAGATATCAGTCAGATTAAATGATGGTTC 

ACCCAAATGACTTCATTTCACCTTAATTACTTCTCTAAAGGCTCTATCTCCAAATACGGT 

CACATTGTGAGTTTCTCAAGGGTTAGGACTTCAACATGTATTATTTGAGGGAGTGGAACA 

CAATTCAACCCCATT 

Sequence 2479 

CCGCGGTGGCGGCCGCCCGGGCAGG TAC \ III! IN HI III I I II II II1GI III 11 II 
M l I t M I I I M II I H I I II II N M I I I 1 1 I I 11 1 M I I M I I I H I 1 I I 1 M M II I 
rTTTTTNNTTTTTTTl iNNNTNNTM I I I I I I II M ICCCCANNANAANCCCNTNNTNNN 
NNAAAAAAAGAACGTGAGGACAGGGCTCCTGCTGCGCAGGCATAAAGCATCCAAGAGTCT 
GCACATACATGCCACACACTAAAAAAAAAAAAAAANTNNAGCNCNANNNNAANTCTGGGG 
GGGGGGGGAAAACTTAAAAAAAAAAAAAANNTTTTNCNAAAANANAAAAANGNNNNTAAA 
AAAAAAA 
Sequence 2480 

CCGGGCAGGTAC I I I I I I 1 M I II I I I I I I 1 I I I I I I G I I M I M i I 1 NTTTNNTTNANT 

N ANTN NTTTNNTTAAATN ATN AAGG AAN AAN N AAAAAAAN AANTN AAAAAAAAAAAAAAA 

AAAAN N AAAAAAAATTN N AAAAAAAANT AAAAAAAAAAAATAAAAAAAAAAAAN N AAAAA 

AAAAAAAAAAAATAMTAAAAAAAANNTAAAAAAAAAAAAAAAAAM 

TN AAAAAAAN NTAATAN AAAAAAAAAAAC AAATTN AAAAAG NTG NGTTAAAAATNTT AAA 

AATAl I \ \ INAAAATTAAAAAMNCGAAAANNCTTTTCCAAAAAANAAGNATAAAAAAAT 

CCAAANTNTTNAAANNANNAAANNGGTTNTTTAA 

Sequence 2481 

CCGCGGTGGCGGCCGAGGTACATCCTGGATGGATGAGAATGGGCACCATATTTTTTATGA 

ATATATA I M MCI II I I! I F GTTTTCTACAGCACCAAAGAAATTCAAATAGGAAAAGGA 

GAGTTGAG AATTG GGAATCAAGAATCAGCCCTGTTTCCATCTTAGCCACACCAACTTATA 

TCTTTATGATTTTCAAAGCTTTTGCCATGTGATTCTGCCCCCACAAAGGCATCGGTATTT 

CCTAAATGGTACCTGCCCGGGGCGGCNCGCTCGAAATACCGAGCCCGGGGAGCATTAAAA 

GTGTAAAAGCCCTGGGGGGTGCCCTTAATGGAGGTNGAGCCTAACCTCACATTTAAATTT 

GGCGTTGCGGCTCACNTGCCCGCTTTTCCAAGTTCGGGGAAACCT 

Sequence 2482 

AGGTACGCGGGACTGTTATTCTCTCCAAAGCTTACCCAGCAATAGGAACTCCCATACCAT 

TTGATAAAATTTTGTATAACAGGCAACAGCATTATGACCCMGGACTGGAATCTTTACTT 

GTCAGATACCAGGAATATACTATTTTTCATACCACGTGCATGTGAAAGGGACTCATGTTT 

GGGTAGGCCTGTATAAGAATGGCACCCCTGTAATGTACACCTATGATGAATACACCAAAG 

GCTACCTGGGATCAGGCCTTCAGGGGAGTGCCCATCATTCGATCTCACAGGAAAAATGAC 

CCAGGGTGTGGGCTCCAGCTTTCCCAATGCCCGAGTCAAAATGGGCCTATTACTTCCTCT 

TGGAGNTATGGTCCCACTTCCTC 

Sequence 2483 

CCCGNGGTGGCGGCCGCCCGGGCAGGTACCGTTCCTCCAGTGCCCAGAGATGCTCTCCGC 
ACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAGAGGGGTGGTNTCGGTTATT 
CAGGAC'I I M I I M I I I CTTAMTATCCTGNGCTTNTTTCAATCATTTGAAGGTAAAACC 
AGGTCCTGNGANTGGTAAACTG A 1 i 1 1 I GGTTCT 
Sequence 2484 

AGGTACGCGGGTGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCC 

GACGGCTCCTTCTTCCTCTATAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGG 

AACCGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAG 

CCTCTCCCTGTCCCCGGGTAAATGAGTGCCGACG(3Cnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnCTGCAGGAAATTCGATATCAAGCCTTATTCGATACCCGTCGACCTCGAGGGGGGG 
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GGCCCCCGGGTACCCCAAGCTTTTGTTCCCTTNTAGTGGAGGGGTTAAATTGGCGCCGCT 

TGGGCCGTA 

Sequence 2485 

GAGAGGCGGTTTGCCGTATTGGGGCCGCTCTTCCCGCTTCCTCGGCTCACTTGACTCGCT 

GCGCCTCGGTTCGNTTCGGCTGCCGGGCGAAGGCGGTAATTCAGCTCACTCAAAAGGGCG 
GGT 

Sequence 2486 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATAATTAACAATTTG 

TGGA MTTGAAAAAGCATAAACTGTGTTATTTGATTAGTAATATGTTCCCTTAAMTTCA 

TTTTGAGGTGTATGTTATACACACAGTAAATTTTTGTTCAGGMTGACTTGCTCATTCTG 

TGTTTTTAAAAATAGGAAATGAGGCATAGTGAGTCATCATTACATCAATTAACCAAAAAA 

TATTTCATCCCCTCCGTCACTGAMTTATCTACTTCAGCCACCTTTCTTATTCTCCGTGT 

TAGGGAGGGCCACGTTTATGGGACTTNTTTAATTTTCCATGTGCCCATTATTTGTCCCAC 

TTACCCGGGCAGTTAGCCCAAAAGGCTAGGCCTGTTTCAGTTCCCACAGGA 

Sequence 2487 

TATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGG 

TGCGATTCGCATATTCCGCGAGATCATCAAGCCAGCAGAGAAATCCCTCCATGAAAAGTT 

AAMCAAGATAAGCGCTTTAGCACCTTCCTCAGCCTACTTGAAGCTGCAGACTTGAAAGA 

GCTCCTGACACAACCTGGAGACTGGACATTATTTGTGCCMCCMTGATGCTTTTAAGGG 

AATGACTAGTGAAGAAAAAGAAATTCTGATACGGGACAAAAAATGCTCTTCAAAAACATC 

ATTCTTTATCACCCTGACACCAGGGAGTTTTCATTC 

GTGGTTTACCTAACATTTTTAAAAGGACCCACACAAGGGAAGCCAAAAATTCTTTTCTTG 

AAAGGAAGGTAAAA 

Sequence 2488 

CCGCGGTGGCGGCCGAGGTACGAAAGAGAGACAAAAGGGTTCTCTTGGAAACAAGAAGAG 

TGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGTTAATGCAAAAGATCAGAACAGG 

GCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGTTTTCACTGATGATCACCTTTCA 

CAGCATTTTCCCCAACCAGCATTTCACTTAGTCTTCTCTATACCCAGCACCTCCCCCGGC 

ACCCCCGGCAAGCCCACTTATCACTTCCCGACTTTCCAACGTGGCATTCCCGTGGAGGAT 

C CTGT TCCACATTAGGGCGAAAGCAGGGAGAAACACCTGGNGNAGCCAGCCAGGGATGGG 

GTTTTGGGAAAGGAGCCATGCCCTCTGGG 

Sequence 2489 

CCGGGCAGGTACGCGGGAGCAGAAATGATTGCACTATTGATAAATTCCGAAGGAAAAATT 

GTCCATCTTGTCGTCTTCGGAAATGTTATGAAGCAGGGATGACTCTGGGAGCCCGGAAGC 

TGAAGAAACTTGGTAATCTGAAACTACAGGAGGAAGGAGAGGCTTCCAGCACCACCAGCC 

CCACTGAGGAGACAACCCAGAAGCTGACAGTGTCACACATTGAAGGCTATGAATGTCAGC 

CCATCTTTCTGAAATGTCCTGGGAAAGCCCATTTGAAGCCAAGGTGTAAGATGTGNTGCC 

TGGACACCGACAANCAACCCAGGCCCCGACTTCCTTTGGCAGGCCTTTGGCTTCTCTAGC 

CCTCACTGGAACTGGGGGAGNAGAAGA 

Sequence 2490 

CCGGGCAGGTAC I M I I I I I I I I I I 1 I I M I I I I II I I GGGTTTCTGGTAACTTCAAAGC 

TCCCTAAGTCACTAACCTTTCTTCGCCCCTCCTCCACTAAGGGGCTGTCAGGAAGCATCA 

ATTTCANAAAATCTTCTTTAGACAGAACATATTGAGTCTNTGCAACAGCATCTCCCACCA 

CTGTCTGATGCTGANTTGCTGTGGACATGACGCCCAACTCACTATACATCCCGGGACATT 

CAGTCACTTNAGGCTCCTCGGGNCGGNGGGGCCTGCCCTCNCGATTTCTTCCAGC 

Sequence 2491 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATAGCTCAAACTCAGAGTCA 

TCGTGCTCCCAATTCCAAAGAGATTCCTAAAAGAGGCAACTT 

Sequence 2492 
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ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTGGATACCCCA 

G AATGACCTTC TAGGTCATCCAAAAACC AGAGCTTTTATAACTCATGGTGGAG CCAATGG " 

CATCTATGAGGCAATCTACCATGGGATCCCTATGGTGGGCATTCCATTGTTTTTTGATCA 

ACCTGATAACATTGCTCACATGAAGGCCAAGGGAGCAGCTGTTAGATTGGACTTCAACAC 

AATGTCGAGTACCTGCCCGGGCGGCGCGCTCTAGAACTAGTGGGGATCCCCCGGGCCTGC 

AGGAAATTCCGATATCAAGGCTTATCGATACCCGTCCGACCCTNCGAGGGGGGGGCCCCG 

GGTACCCCAGCTTTTTGTTTCCCTTTTAAGTGGAGGGGTTTAAATT 

CG7TAAANCATGGGTCATT 

Sequence 2493 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGTCTTCACTGCTCTGCGTCCTGTGCTGATAAA 

GGCTCGCCGCTGTGACCCTGTTACCTGCAAGAACTTGGAGGTTCACAGCTAAGACGCCAG 

GACCCCCTGGAAGCCTAGAAATGGGACCACTGACATTTAGGGATGTGAAAATAGAATTCT 

CTCTAGAGGAATGGCAATGCTTGGACACTGCGCAGCGGAATTTATATAGAGATGTGATGT 

TAGAGAACTACAGAAACCTGGTCTTCCTTGGTATTGCTGTCTCTAAGCCCAGGACCTGGA 

TCACCCTGGCCTGGGAGGCAAGGGAAAGGGAGCCCCTGGGAATCCTGGAAGNAGACATGG 

AGATGGGTTNGACAAAACCCCCAGT 

Sequence 2494 

CCGCGGTGGCGGCCGAGGTACCCCCAGTGATGGGACATGGTCGCGCTGGCGGTCGGGGCA 

GGGCAGGAATCGGCACCGATCTGGCGGCGGCGGCTTGGGCGCCTGCACTGTGTGTCCCCG 

CGTACGCGGGGAATTCAGAAATCACTACAAGCAGCATTAAGACTGAAGTTGGAATATTCT 

GTTGACCATAAAACCTTGATATCATTCTGTGTATATAGAATGTAAAAGGAATATTACAGT 

GTTAACTGCCATATATGTAATATACACAAACTCAATTAGCATTGTAAATGGCCCAAAATG 

GCATTCCCCCATGCTTTTTCTGGTTTTTCAAAAAAAATTGGAAA 

TCTTTATCCCCAAACAGGCTGCCCTAATTTTTAAGGGAGTTCTG 

Sequence 2495 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACCAGTCTT 

GAGAGGTGGCCAGCCAGACTGCCTGTCCACATGCGTGTCAGCACATACAGCCGCTTCCTG 

GMGCCGCCTGGMTGTCTTCACGGCAGCGTTTTGCTCACACAGCANGCTTTTGCACGCC 

CCAGGCAGCCCCGACTGCTGAAATCCAACTTGAGCTGGCTGGTGGTCCCTGGATCCTAGA 

GCCCTTCACTTCGGGTTACTCCCTCTTTCTTGCCTCTATTTCTTAGTTGGAAGAAAATAA 

ACTCACAAATTATTGGTGCAGTAATTTTTTCCCGGGGGAAAAGGTAAAAGGCCTCAGGGA 

ATGCCCCACCGCCCTTTTCTTTCCAAAGGCCTTTGTCTCTGGAGACCCTCTTAAGGTTC 

Sequence 2496 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTTl 
I I I I I I I I i IAI I I I I 1 1 1 I I 1 1 1 1 II I II I 1 1 1 1 I I CTTTGGGCAACACTTTATTGGGA 
AANATTTACNCNCGGNGACCTGTCNTAGGCCAAGCGATNAAAAAAGGGCCCCAGGANCNC 
TGGGGTCCCGAGGNGGCTCAAATGGAANCCATGGGA 
Sequence 2497 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTTGAGAAGCCAGCGCTCACCC 
ACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACTTGGAGAAAAGCACAAACACGAC 
CAGTCCCATCCTGGCTCCCGTGGGGCTTCTTCTATCTACGCATTGTATCGACTGCATTAG 
TTGGACTAAGATGATGACTCAGTTAAAGGAGGAGACAAATGCTGACTGTCTAAGCAAGAA 
TGGCCCAAGCTGGCAAGAAAAAGCACACTTGCATACATCCCGCGTACCTCGGCCCGCTCT 
AGAACTAGTGGGATCCCCCGGGCCTGCAGGNAATTTCGATATCAAGNCTTATTCGGATTA 
CCCGTCCGACCTCGNAGGGGGGGGGGCCCCGGTACCCCAAGC I \ I I IGTTCCC 
Sequence 2498 

CCGGGCAGGTACTGAGTCAAGGACGTCTTTAACGTCATGGACGGCTCCTTTACCCACGCT 
TTCTAGATCTTCGACTGCATCTrrTCCTAGTTTTCCGAGTCCCCCCACAGC I H I 1 I I GC 
TCCGTCTAGGCCTTTTTCCAGAAGGCTGGATCTCTGCTTCCTTGGCTTTGGTGCCTGTCT 
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GGCTAACCCTGGGTCTTCACCTGCATTTTCCTTTTGAGCTGCTGATGCTTCATGGCAAGG 

GTTCCCCCGATCCTGGGGCAAGCCCCGCGTAACCCTCGGCNCGCTCTAGNAACCTAGGTG 

GGATTCCCCCGGGCCTGCAAGGGAATTTCGGATATCAAAGCTTATCGGATACCCGTCCGA 

CCTTCGGAGGGGGGGGGCCCCCGGT 

Sequence 2499 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGG 

ACTCAGAAGCTTGGACCGCATCCTAGCCGCCGACTCACACAAGGCAGGTGGGTGAGGAAA 

TCCAGAGTTGCCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCC 

TACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCT 

CGACCCAAACTGCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAG 

GACATATTGAAGGAAGCTCTATATAAAATGCCAAGACAAGGCAACANAACCCTTGGATGA 

A1TATTTCATCACTTTGGAATGMGTGCCCCACCACAGTCAAGCCTTTTAAAGGAAAAGG 

TGGTTTGCT 

Sequence 2500 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 rTTTTTTTTTTT 

rTTTTTTTTTTTl I ATNGCAAGGACTGAGGCTGGAGCTCCACTTAAGTGGTTTGATCCAA 

ACTGGAAAAGCAACTTCCAGCMGGAACATCCANATAAACCCTTTGCGGAACAAAANAGN 

GG GCCC CCTCCTGACCACAAAGATGGAACCCGCTCTCCTTGATACAATCCACGAATTTAC 

ATCTTTTAATCGGCTGTCCACTCAGGCCTAGGGCGTAGTCAGAAAATGACACCCTCCACA 

CATTGTGGTAAGGCAGNAAATTAAGCCTCAAACCCACCGGGTGACCCTATGTAGGTTCCT 

GGGGTTTTGACCTGGGGGAGCCACCTTNTCAAACTTCCTTATACCTTCCCGACCGAAAAC 

TCACTGGGAGCT 

Sequence 2501 

ACTGTNNCCCGACCCTGCCCGCTTACCGGGATACCTGGTCNCGCCCTTTCTCCCTTCGGG 
GAAGGCGTGGGCGCCTTTCTTCATTAANCTCACCGCTGTAGGGTATTCTCAGTTCG 
Sequence 2502 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCATGTGGGACTCCAG 

GCACCATCTGTTCTCCCCAGGGACCTGCTGACTTGAATGCCAGCCCTTACTCCTCTGTGT 

TGCTTTGGGCCACCTGGGGCTGCACCCCCTGCCCTTTCTCTGCCCCATCCCTACCCTAGC 

CTTGCTCTCAGCCACCTTGATAGTCACTGGGCTCCCTGTGACTTCTGACCCTGACACCCC 

TCCCTTGGATTCTGCCTGGGCTGGAGTCTAGGGCCTGGGGCTACATTTGGCTNTCTGTAC 

CTGCCCGGGNCGGCCGCTCGACGGGTTATCCACAGNAATCAGGGGGGATAACCGCCAGGG 

AAAGGAACATGGTGAAGCCAAAAAGGGCCAGGCAAAAAGGGCCAAGGGAACCCGTAAANA 

AG 

Sequence 2503 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATGATCCAAACACCAAAAGCTGTGCAAGATT 

CTGGTATGGAGGTTGTGGTGGAAACGAAAACAAATTTGGATCACAGAAAGAATGTGAAAA 

GGTTTGCGCTCCTGTGCTCGCCAAACCCGGAGTCATCAGTGTGATGGGAACCTAAGCGTG 

GGTGGCCAACATCATATACCTCTTGAAGAAGAAGGAGTCAGCCATCGCCAACTTGTCTCT 

GTAGNAAGCTCCGGGTGTAGATTCCCTTGCACTGTATCATTTTCATGCTTTGGATNTTAG 

ACCTCGGAACTCGGGGAGGGGAACAATCCTGCCTGCATGGACCCTATTCAAGTTATGGGT 

GGCTAATGTGNTCTGGTGGGACC 

Sequence 2504 

ACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC rTTTTTTTTl 

nTTTTTTTTTTTTn ICTGCCCAACGAAATTATTTATTGCTGTCAGGAGGCCCAGGACC 

CTCCANACTGAANAGTCCATGGGCCCCTGTGCATGTGANAGCTACACAGGGAATCGCTGT 

CACCTTCCAAGGATAAGCAGACAGGGCCACACATGCCATGGCCACTCGGNGCACAGGGGC 

CAGGCTGNGCCCAGAGACCCAGGCATCCAGCTTGCATTCCAAGTTCCTTACTGAGAGCCC 

CACTGCTTGATGGCAAGNCTCTAGGCATTGCTTGGGGATTTGGGTGACTTTTGCCTCATT 
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GAAANAAANAATGTTCTGGGGNGGTCTGTAAGGGGACAATTGGATCATTCAAGGGAAGGG 
GCC 

Sequence 2505 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCCTG 
GAATGGGGCCCATGAGATGGTTGTCTGAGAGAGAGCTTCTTGTCCTACATTCGGCGGGTA 
TGGTCTTGGCCTATGCCTTATGGG 
Sequence 2506 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCTGACACTGCAGTGAAGGGCATG 
CTAAGTCTAGGCACAGGTCCTGGCAGCAGGAAGGAGACAGAGCCTCTCCCAGGCACACAT 

Sequence 2507 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCACCAA 

AGACCAATAATAAAAAAGAAGGTTCCAGGGAGGGCATGACCTCTGAAATTCCCCATGATC 

TTTGGGTGTTTAMTCCCATTTTCCCTTCTGAGTCATTGGiTCATCCTTGGTCTCAGTAGA 

TCAGGCACGTTAGAACTGTGGCAGAAGAGATAAGAATGAAGCAGCCAAAATTGAGCCTTT 

GCTTCTTAGGTMTGCTGAAAAGTTTCACACTTTTACCTCTGCCTGAGTCCCCCGCCGAA 

TCATTGCGTTACCTCGGCCCGCTCTAAGNAACTAGTGGGATCCCCCCGGGCCTGCAGGGA 

ATTTCCGATATTCAMGCTTTATTCGGATACCCGTTCGGACCTTCGTAGGGGGGGGGCCC 

CCGGGTACCC 

Sequence 2508 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGAAACAT 

GCATMGTGCCAATCCTTTGAATGTTCCACGGAAACACTGGTGGACAGATTCTAGTGCTG 

AGAAGAAACACGTTTGGTTTGGAGAGTCCATGGATGGTGGTTTTCAGTTTAGCTACGGCA 

ATCCTGAACTTCCTGAAGATGTCCTTGATGTGCANCTGGCATTCCTTCGACTTCTCTCCA 

GCCGAGCTTCCCAGAACATCACATATCACTGCAAAAATAGCCATTGCATACATGGGATCA 

GGCCAGTGGAAATGNTAAAGGAAGGCCCCTGNAAGCTGATGGGGGTCAAAATGGAAGGTG 

AAATTCAAGGGCTGG AAGGGAAAATAG NCAAAATTC ACCCTAC 

Sequence 2509 

CCGCGGTGGCGGCCGAGGTAC I I I M I I I 1 1 I I I I I I I I I i I I I M I I CTAAACTTCTCT 

TCTCACTTCATTTCATTCATTTGATCTTCAATCACTGATACCCTTTCTTCCAGTTGATCG 

AATCGGCTACTGAGGCTTGTGCATTCGTCACATAGTTCTCGTGCCTTGGTTTTCAGCTCC 

ATCA GGTC CTTTAA GGACTTCTCTGCATTGGTTATTCTAGTTAGCCATTCATCTAATTTT 

TTTTCAAGGTTTTTMCTTCTTTGCCATGGG 

GTTTGATNCGTNCTGAAGCCCTTNTTTTTCTCAACTTCACCCAAAAGGTCATTTCTCCAT 
CAGCNTTTGTTTCTGNATGCCTGGNNGMGGGAAGCCTTGCCGTTTCCTT 
Sequence 2510 

CACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGTGTCCTCAGATCTCAGGC 

TGCTCAGCTCCATGTAGGCTGTGTCTGTAGATGTGTCCTCGGTCATGGTGACTCTGCCCT 

GGAACTTCTGTGCGTTGATTGTTTCACCATCTTCAGGATCAAAACCTCCCTTCCACTCAA 

GCCCTTTTCCAGGAGCCTGTCGCACCCAGTGCATGGATAATTCAGTGAGGGTGTATCCGG 

AAACCTTGCAGGAGACCTTCACTGAGGCCCCAGGCTTCTTCACCTCAGCCCCAGACTGTA 

CCTCGGCCGCTCTAGAACTAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTT 

ATCGATACCCGTCCGACCTTCGAGGGGGGGGCCCGGGTACCCAACTTTTGTTTCCCTTTA 

Sequence 2511 

CCGGGCAGGTACCAAGGCTTAGAGGTCAGACTGAGGAAACAAATGCTTCACAGAGTCCTC 
AACCCATGACCATGAGTTCTTTAATAAGTAAGTGTGTGTGTGTGTGTATGTAAAATATGT 
GAGGCCCTCTAACGCCTCTTTTCTACGGTAGAACAAAGAGAAACTTCAGTTTTTGCCATC 
AATTTATTAAAATAAACATGTATAGCAGGTTTCAACAATTGTCTTGTAGTTTGTAGTAAA 
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MGACATAAGAAAGAGAAGGTGTGGTTTGCAGCAATCCGTAGTTGGTTTCTCACCATACC 

CTGCAGTTCTGTGAGCCAAAGGTCTTGCAGAAAGTTAAAATAAATCACAAAGACTGCTTG 

TCATTATATTAATTGCTAGGAAAG 

Sequence 2512 

GTGGGGCGGGCCCGCCCGGGGGCCAGGGTTANCCAATTGGGGTGGGGCCCAAATTGGAAT 

TTTGGAATGGGGTTAAAAGGGGGMGGGGGAATCCNGTTTTGAACCCTCCGGTTCTGGTT 

TMTTGGTAAAAAAAGGGAATTGCCCGNTAAGGGGGAATTGGGGGAAGGGGGCCGGAATT 

GGAAGGG MCCTTM TGGMTTGMTTGGGCCGGGGGTTANGGGGAATTAGGTTTCCAAG 

AAACCNGG I Mil ICTTAATTTTTTCCCTTGGAAGGCCCGGTTCCTTGGAANGNAATTGT 

TTAAGGTTNATTTTMGGTTTTNAAGTTTTTTNGGTTTGGTC 

GGGAAAAAAAAGGGGGGCCAATTAACCAAAGGGGGAACTTAANGGGGNAAAAGGCCCAAG 
NAATTAAAAGGGGGGAAAAAAAAATTGGAAATTTTAAT1TGGAAAGGGGGGGCCCGGNTT 
GGG AATTCCAANTTGGAAAAAAAGGN GGTTTGG NAATTAAAAAAGGGCCCTTCCTTTTTC 
CTTAATTTGGAMNTAAGGGGGGGGGGGGAAAAMGGNTTAAAAGCCCG 
NGGTTTAAAGGAA MCCCCC CTTTAACCCTTTTTGGGCCGGGCCCTTTGNCCCAATTf 
TTTNGG CCCCCCAAI 111 I I AAAAAAGGGAAATTTMTTTAATTTANGGGGGGGNAAATT 
Mill I AAAGGGCC CCCTTT AANTTAAAAAN I I I I 1 1 I ) AAAAAACCCTTTTTTTNGGAA 
CCCAAAAAAAAAGN M M l I AATTTTGGNAAAAAAAATTGGGGGGGGG I I i ill I I I M C 
CCCAAAAATTTAACCCCC I Mi l 
Sequence 2513 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCTGTCCTAGC 

AGTGGCAATCTTTTGGGGTCCCCTAAAATAAAGAAAGGCTTATCTCCACACTGTAATGGT 

AGTGATTGTGGATATTCATCTAGCATGGAAGGGAGTGAAACAGGTTCTCGGGAGGGTTCG 

GATGTTGCCTGCACTGAAGGCATTTGTAATCATGATGAACACGGTGATGACTCTTGTGTT 

CATCACTGTGAAGACAAAGAGGATGATGGTGATAGTTGTGTTGAATGTTGGGCAAATTCT 

GAAGAGAACGACACAAAAGGAAAAAATAAAAAGAAGAAGAAGAAAAACAAGATACTGAAA 

TGTGATGAACATATCCAGAAGCTTGGAAGCTGTATTACAGATCCAGGTAATCGAGAGACC 
TCAGGAAATACCCATG 

Sequence 2514 

CCGTAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAG 

GTACTGCGGCGGGTAGGCCTAGGATTGTGGGGGCAATGAATGAAGCGMCAGATTTTCGT 

TCATTTTGGTTCTCAGGGTTTGTTATAATTTTTTATTTTTATGGGC^^ 

TAGGTGGTAAGTTTGTGTTTAATATTTTTAGTTGGGTGATGAGGAATAAGTGTAAGGAGT 

ATGGGGGGTAATTATGGTGGGCCATACCGGTAAGTATTTAGTTGGGGCATTTCACTGTAA 

AAGAGGNTGTTGGTTCTC7TAATNTTTAACTTTAAAAA 

Sequence 2515 

CTAACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATC 

CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAA 

TATTTMTGCTTCAGAAAGCATTGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAG 

AAAAACTTAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTG 

ACAAGGGAAAACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACCT 

GCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCGTCGACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAG 

TGAGGGGTTAATTGCGCGCTTGGCCGTAA 

Sequence 2516 

TCTGGGTTGGGGATTATTTCTGGGTTTCTACTTCCTGTTNGAAGAATGTTGGCATGGAAN 
A ^ T QGTA AGTTTGAAGAAGATGAGTGCCGGGGGCCTTCATCTAATCCCTNGGAAAATTGG 
TCTTTTTCCCCACCAATTCCCCTTGGACACCAAGAAATTANTGGAAGCCCANTACAAGGG 
AANTTCTTGNAAAGNAAAATNGGGGGTNCTTCTTTGGCCCACCCTTCCCCAAGTTAAAAA 
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AGGAATTNA I I M I M 1 I I N I AAAAAAAAAAAAAAAGGGGTAAAAAATNCTTAAAAACC 
TTTGGGNGGCC I I I I I N AATCCTTNG N ATAATTGGAAAAAAAANTAACCAGG N AACCAGG 
CCCAAATTAAGGAAAAAAAAAAATTTTTAAAAAAAAAATT^ 
Sequence 2517 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACTGGATGTCAGGTCTGCGAAACTTCTTAGATT 

TTGACCTCAGTCCATAAACCACACTATCACCTCGGCCATCATATGTGTCTACTGTGGGGA 

CAACTGGAGTGAAMTTTCGGTTGCTGGCAGGTCCCGTGGGAAAATCAGTGACCAGTTCA 

TCAGATTCATCAGAATGGTGAGACTCATCAAGACTGGTGAGAAATCATCAGTGGTCATCT 

ACCATCATCAGAAGTCCGTTCCGAAGTCAATTGGGAAGTCCTGGGCTGTCCACATGGGTC 

CGTCATCATCTTCATCATCCCATATCATCCATTGTGGGTCATTGGCTTTTTCGTTGGGAC 

CTTACTTGGAAGGGGTCTCTTTGTTTAAAGTCA7 

CATTTCTGTGGGGG 

Sequence 2518 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 
TTTTTTTTTTTTT1 IGGGCTTTTATCCTGTTTGTTGTTCTCTGAGCTTCCTGAATCTGTG 
GTTTCGTGTCACATATTAATTTAGAAAMTCTCATTCATTATTGCTTTGAAT^ 
TATTCCTTTCTCTCTTTCCTCTCCTTCTGGGGTTCCACTATGCATACTTACATCTTTTAT 
AGTTGTCCCACTGTTCTTGGATATTCTGTTCC I N i l I A TTCAGTC I 1 < I 1 CCCTTTGCT 
TTCATTTTTGGAAGTTTCTCTTGAAATATACTGAAGCTCCAGATTCTTTCCTCACCCACA 
TCCAGTCTGCTMTGAGGCTTCMCACCATGCTTCATTTTTGGTMGGTTTCTTGACTTC 
TAGAAI I I I IATTT 
Sequence 2519 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTT 

rTTTTTTTTTTTTTTTTn CTCGNATCTTATAGGCCAGGCGNGGNGGCTCACGCCTGTAA 

TCCCAGCACTTTGGGAGGTCGAGACAGGCGGATCACGAGGTCAGGAAATCGAGACCATCC 

TGACCAACATGGTGAAACCCCGTCTNTACTAAAAATACAAAAAAATTAGCCAGGCGTGGC 

GGCANACACCTGTAGTCCCAGCTACGCAACTGNGCCACCTCAGAGAAGTANATTTTTAAA 

CAMCCCAATTTTNTCTTGCTGTTAACAACACTGAANACTAGTANCAGAGGAGTGAAAGT 

GTGGGTTAANAGCCTTAAAAACTTGNGGGAAAAG 

Sequence 2520 

ATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAGAAGGGCCATGCTGTTATT 

ACTCTTACACAAGGAGGCAGCCCTCGAGCCACAGGGTCCAGCTGTTGGCTATAATAGCCT 

ACCGGTCTCTGATGATCACCATGTTTCTGGAATTCAAGCCAGGAAGAAGCAGCAATCTGT 

CTTCTGGATTAAMCTGAAGATCAACCTACTTTCAACTTACTAAGAAAGGGGATCATGGA 

CATTGAAGCATATCTTGAAAGAATTGGGTATAAGAAGTCTAGGAACAAATTGGACTTGGA 

AACATTAACTGACATTCTTCAACACCAGATCCGAGCTGTTCCCTTTGAGAACCTTAACAT 

CCATTGTGGGGATGCCATGGACTTAGGCTTAGAGGCCATTTTTGATCAAGTTGTGAGAAG 

AAA 

Sequence 2521 

CCGGGCAGGTAC I ITTTTTTTGTTTTTTTT1 IGTTTGI I 1 1 I I IGI I II I I GAG ATGGAA 

TTTCGCTCTTGTTGCCCAGGCTGGAGTGCANTGGCGTGATCTCAGCTCACCGTANCCTCC 

TGCTTNAGCCCTCCCAAGTAAGCTGGGATTACACGGTGCCATTGCACACTCATGCCTGGG 

ACTMTN GTATANGTAC NNTTGTATTAAATNTTTTTG NTATNTTN TTCCTAG AAGG ACG N 

GTAGTTTTCCTCCATTGTATGAATCAGGGGCCTGGGTTCTCCGAAATTTTCCCAACCCTT 

NAAGGTTGGATTCACNGCCCAACCCTCAAGNCCTTCCCNCAAGTAGNTTGGCNTGGGGGT 

ATNTAACTAGGGNCNTTGNAACCCAACNCGGTANNCCCNCAAACTTNATTGCCNTGGNTG 

NTNTTTTTTTCTTT 

Sequence 2522 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGAATTGGACATCTG 
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TTGTTTGCAGATGTAATTAAGATGGGTCATACTTAAGTAGTGTGGGCCTCTAATCCAATG 

TGACTGGAGTCCTACTAAGGGAGCAGGGTCAGAGGCAGACATGGGAGAATGCCATGTGAT 

GACGGAGGCTGAGATTGAAGTGTTGCAACTGCAATCCAAGGAATGCCAACATTTGGTGGC 

CACCATGACAAGGTAGAAGGAGGCAAGGAAGGCTCCAACCAGTGTCTCAGAGGAAGCATG 

GCCCTGGTGACATGCTGAGTTTACGCTTCCAGCCTCCAGAGCTATAAGAAGGTNAAATTG 

TCTGTTTTCTN/TNGGTTCCCTGTTACAGCAGCCTTAAAAAAAT 

Sequence 2523 

AGGTACCGCATTCCTACTTCATTGCCCCTGATGTAACTGGACTCCCAACAATACCCGAGA 

GTAGAAATCTTACAGAATATTTTGTTGCCGTGGATGTGAACAACATGCTGCAGCTGTATG 

CCAGTATGCTGCATGAMGGCGCATCGTGATTATCTCGAGCAAATTAAGCACTTTAACTG 

CCTGTATCCATGGATCAGCTGCTCTTCTATACCCAATGTATTGGCAACACATATACATCC 

CAGTGCTTCCTCCACACCTGCTGGACTACTGCTGTGCCCCAATGCCATACCTGATTGGAA 

TACACTCCAGCCTCATAGAGAGAGTGAAAAACAAATCATTGGAAGATGT 
Sequence 2524 

TAGGGCGAATCGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAATGGGG 

AAAGACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCCTTCCTTGATATTGC 

ACCTTTGGACATCGGTGGTGCTGACCAGGAATTCTTTGTGGACATTGGCCCAGTCTGTTT 

CAAATAAATGAACTC MTCTAA ATTAAAAAAGAAAGAMTTTGAAAAAACTTTCTCTT^ 

CCATTTCTTC7TCTTCTTTTTTMCTGAAAGCTGAATCCTTCCATTTCTTCTGC 

ACTTGCTTAAATTGTGGGCAAAAGAGAAAAAGAAGGATTGATCAGAGCATTGTGCAATAC 

AGTTTCATTAACTCCTTCCCTCGCTCCCCCAAAAATTTGAATTTTTTTT 

CACCTGTTATGGAAAATGTCAACC7TTGTAAGAAAACCAAAATAAAAATTGAAAAATAAA 

AACCATAAACATTTGCACCACTTGTGGCTTTTGAATATCTTCCACAGAGGGAAGTTTAAA 

ACCCAAACTTCCAAANGGTTAAACTACCTCAAAACACTTTCCCATGAGTGTGATCCACAT 

TGGTAGGTGCTGACCTAACAGAGATGAACTGAGGCCCTGGNTTGNTTGGTCATAATACAA 

AAGGNGCTAANTAATAGGATTTCAANACTTGAANAATGGTGATNGGGCTANAANAATTGG 

AGAANAAATCCTCTGNATTGNGTGGATCGGGNGGGGGATTTTTAA 

Sequence 2525 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAACTAACCTCCCGCTCTTTATCTCCCCAAC 

CTTCCCACTCCCCAGCCTCTAGTAACCATTATTCTCCTCTACCTGTGAGTTGATTTTGAC 

CTCCCACATAAGAGAATGTGGCAGTATTTGTTTTTCTGTGCCTGGGTTATTTCATTTAAC 

ATACTGTCCTCTGGTTCCATCCTTGTTGTTGCAAACGACAGGATTTTGTTCTTTTTTATG 

ACTGMTMTATTCCATAGCCATATACATACCATGTTTTCTGTATCCATTCATTCGTTGA 

TTGGCAGTAAGGTTGATTCCATATCTTGGCTGTTGTGAATATTACTGCAGTAAACATAGG 

AATACAGATATCTTTTCCATATACTGATTTCCTTTGTTTTGGATATATACCCAGCAGTGG 

GCTrGCTGGATCATATGGCAGCTCTGATTTTAGGTTTTTAAGGATGGG I iTTTTTTTTGC 

AATGAGAATAAAAATTTTCCATGTATMGAACAGTGTCTTTGMCTTTGTTTATATCCTT 
TGCCCA 

Sequence 2526 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAATACTTTCTGACTGTCCACTAAAATGTCAA 

ACTATTTCAGACTCCTACTCCTCCACTGTCTTAGTTGATTTTGTGCTGCTGTTATAGAAT 

ACCTGAAACTGGGTAATTTATAGAAAACAGAGATTTATTTCTTATAGTTCTGGAGACTGG 

TCAAGGGGATGGCATCTGGTGAGAACCTTCTTGCTACATCATCTCATGGAAGAAGGTGGA 

AGGGCAAGAGGGCACATATGCCCAAGAAAATGTGTGCACACACAAGAGAGGTGGAGAGAG 

AAAGAGAGAGATAAATAGAGAGACGAAAAGGGGGCCAAGCTCATCCTTTTATTAGGAATC 

CATTCCCAT GATCT CCCATGATAACAGCATTAATCCATTATAAGTGCAGAGCCCTCATAA 

CCTAATCACTTTTTAAAAGTCCCACCTCTCAACACCATTGCATTGAGGATTAAGTTTCCA 

ACACATAAACTTTGGGAAATGCATACAAACCATAGCACTACCTCCACTCACATCTTGGCT 

CTTCAATGTCAAGGTATATTGGTTGGTAAGGGAATATAAACACATGAAGACATGTCTCTC 
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CTTTCAAT CCAM GACTTACTGGAATTGNGTCAAGCTTCTTGTGGTCAGAAATGAGGNCT 

TCTCTTTATTTTTGGC 

Sequence 2527 

GCGTTANACGACTCACTNTNAGGGCGAATTGGAANCTCCACCGCGGTGGCGGCCGCCCGG 

GCAGGTACGCGGGCACTGTAATGCTNACTTANCATTMCCTTTTMGTTAAAGATTAAGA 

GAACCAACACCTNTTTACAGTAGAAATGCCCCAACTAAATACTACACGTATGGCCCACCA 

TTAATTACCCCCATACTNCTTACACTANTCCTGATCACCCAA 

Sequence 2528 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCTTTTCAGACTAATTGGAACTTGAGTCATG 

GCTGTCACACTGAAATAATTTGTTCATTCAACTGCAGTGAAAATCAATATAGATCAACAT 

GCATAAAGTTCAGTAACTATTAAATGGGAGAAGCAAGCAAAAGCAACAGGAAAATTAAAT 

CAGGATGCTGAGGGGGGAACAGGGATTGCTGGCCATGAATTTTMTTGAATTTTTGACGG 

GGCAAGCTACGTTACATTATGGCAGCCCCCGTAC I 1 I 1 I i M M N I I 1 1 1 T 1 ( I I 1 1 1 C 

GAGTTTTATGATTTATTTMCTNGTGGAACAAAAATAAACCAGATTAACCACAACCATGC 

CTTACTTTATCAMTGTATMGAAGTAAATATGAATCTTATATGACAAAATGTTTCATTC 

ATTATAACAAATTTCCAATAATCCTGTCAATTATATTTCTAAATTTTCC 

Sequence 2529 

CGAGGTACGCGGGGGGGTGGTGTGGTCCAAAGGACAGGCTGGATGGCGGGTGCATCGGCG 

TGGGCGTGGTCAGCATGTGTCTGCAATGCCTGTGGGCTCTACTACAAGCTTCACAATATT 

AACAGACCCCTGACTATGAAGAAGGAAGGCATCCATACCATAAACCGAAAAATGTCTAGC 

AAATCCAAAAAGTGCAAAAAAAGTGCATGACTCACTGGAGGACTTCCCCAAGAACAGCTC 

GTTTAACC 

Sequence 2530 

CGAGGTACAAAAAACACAAGGAATACAACCCAATAGAAAATAGTCCTGGGAATGTGGTCA 
GMGCAAAGGCCTGAGTGTCTTTCTCAACCGTGCAAAAGCCGTGTTCTTCCCGGGAAACC 
AGGAAAAGGATCCGCTCCCAAAAACCAAGAATTTAAAGGAGTTTCTTAAATTTCGACCTT 
GTTTCTGAAGCTCACTTTTCAGTGCCATTGATGTGAGATGTGCTGGAGTGGCTATTAACC 
TTTTTTTCCTAAAGATTATTGTTAAATAGATATTGTGGTTTGGGGMGTTGAA 
TAGGTTAAATGTCATTTTAGAGATGGGGAGAGGGATTATACTGCAGGCAGCTTCAGCCAT 
GTTGTGAAACTGATAAAAGCAACTTAGCAAGGCTTCTTTTCATTA I MM IA TGT 
Sequence 2531 

AGGTACI rTTTTTTTTTTTTTTTTI I GTTTATG AAG ACTTTTATTAAATTACAGTGTATT 

ACAGATTATATCATAATAATAAGCCTTTCATCTTTAGGCTAATATGATACAAAAACCTAC 

TTGGCCACATTACTTCTTGAGTTrCTTTTTGGGCAGCTTTCTTNTT^ 

CCGCTTCATAGCATTGANCCCGTGATTCTTTGTGAMGTTTGGGGCCCTTTAAGGGATGC 

TGAGGGAGAGCTGCTGGATTCTGAAANNAATTTTGCTNGGTAAGAACCTGCCCGGGCCGG 

CCGCTCTTAGAACTTAGTTGGGATCCCCCNGGNGCTGCAGGGAATTCNATATCNAAGCTT 

ATCGAAACCCGCCGACCNTCNAAGGGGGGG 

Sequence 2532 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGTCACCTGCGGTGGGCGCACTAGAG 

GTCTTTTGAACTCCTTCGGCTACCGTCGCCGCGTTCTCGCTGTGCACTCTTATTCTGCGC 

CTGCGCGCGGCTACAGCACGGTTCGTTTTTCCTTTAGTCAGGAAGGACGTTGGTGTTGAG 

GTTGGCATACGTATCAAGGACAGTAACTACCATGGCTCCCGAAGTTTTGCCAAAACCTCG 

GATGCGTGGCCTTCTGGCCAGGCGTCTGCGAAATCATATGGCTGTAGCATTCGTGCTATC 

CCTGGGGGTTGCAGCTTTGTATAAGTTTCGTGTGGCTGATCAAAGAAAGAAGGCATACGC 

AGATTTCTACAGAAACTATGATGTCATGAAAGATTTTGAGGAGATGAGGAAGGCTGGTAT 

CTTTCANAGTGTAAAGTAATCTTGGAATATAAAGAATTTCTTCAGGTTGAATTACCTAA 

Sequence 2533 

AGGTACAGCTGCTATCTTATTGGACTACAGTAAATATTTTTTAAAAGGACACCAATGAGG 
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GGCACCATCTGGTGTTAACCTTAACCAGAAAGCTGGTTTCCTCCTCCTCCCCCCCCGCGT 
ACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGAT 
TTCAACTGCTCATCAGATGGCGGGAAGATGAAAGACAGATGGTGCAGCCACAGTTCGTTT 
TGATTTCCA CCTT GGTCCCTTGGCCGAAACGTCCTAGGCCAAATCGTTGGACCTGCCCGG 
GCGGNCGCTTTTAGAAACTAGTTGGGAATCCCCCGGGCTTGCAAGGGAAATTTCGAATAT 

TCNAAGCCTTAATCCGATAACCCCGTTCNGAACCCTTNGAANGGGGGGGGGGCCCCGGGT 
ANCCCAANC 1 lit! IGGTT 

Sequence 2534 

AGGNCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTTNTTTTTTTTTTT 



nrrnm nttnnnggggngggttgagtcggagtctcactctgtcgcccaggctggagtg 

CAGTGGCACGATCTTGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTTCTGCCT 

CAGC CTCCCAAGTAGCTGGGACTACAGGCAAGCGCCACCACGCCCAGCTAATTTCTGTGT 

TTTTAGTANAGACAGGGTTTCACTATGTTGGCCAGGATGGTCTCGATCTCTTGGCCTCGT 

GATC CACCCG CCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCTACCACGCCCGG 

CCTAAl I I I IMCTCTTTATTCCGTGTCTTTAAAACATCCATTCATCTAATTTTTAACTG 

NTTATTCTGNGTCTCTAAAGGATTCATTCATTCACCTMTTTTTCAAATACCGTTCTCAA 

GTCCTGGTTATAGACMGGCAGTTATTGGAAAGCACACTATGGCAGGATTTATGGAGTGT 
ACCTT 

Sequence 2535 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGTGAAACTTTTACCAAAGACTGCTGTCTACTC 

TGAGG GCTGCTACCTGTGAGGCTTCAACTGCATAAAAAGACAGCCTTTGTCACCAAGTCT 

TTTTTCCCCATCTCTCACCCATAACCTGTCACCATGCTCAAGCCTCAATTTCTTTCTATA 

AAATATAAAAACTTTGTCATTTGGCTCTTCTTTCAGTCTCATATTCATAGACCTCTCGTG 

TCCATGTGCATATTGATAMTCTGTATCCCTTTTCTCCTGTTAATCTGTCTATTGTCAAT 

TCATTTCAGCAGACTCAGACTTCAACATTCGGGGGAAAATTTGAAGTTCTCTACAGTTTT 

GGTATCTTTGGCAGGATTGTAAGCAAATCACTCTGCCCACACCCGAGGCTGCAGCCAGAA 

ACCACGGAAACTGACAGACACCAGCNNNNNANAAANNNNNNNNNNNAAAGTACCTGCCCG 

GGCGGCCGCTCGACTTGAGCAGGCTACTGAACCATGAGCCAAATAAACCTTTT 

Sequence 2536 

CCGCGGTGGCGGCCCGAGGTACCACTATAATCTCTAATACTTACTCAGAATTACTGTGTA 

TTTACTTMTTTCTTATTATGTGCCTTATTATGTGCTTMGATACAATAGGTTAGAG^ 

AATCTAAATATCTTGAAAGCTATATTGTGGGCTTGGTAAGCATTTTGTTTTTTCT^ 

TGT TTTGG TAAGGATTTAAAA 1 Mill j CATTGCAATTTTAAGTGGTTTTCAATAAGTAA 

TAGTTTTTATCAAATTTTTGGTGCTTGGTGCAGAGACGGTGTGGGGAAGGGTGAATGGTT 

TTGGGAATAATTCAGTGCACACCTGTAGGCCTCTTTACATTGTGACTGATAGGGGTTATT 

GCATATCAATTTGGGGCTGTAGAGTGCAATCTCAAGTTTCATCTTTTTCACCCATCANAA 

TTTGTCTCAGGATTACTTGGTTTTTTCTCAGTCCTCAAGCCGAGAACTTGCTT 

Sequence 2537 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCCGGAGAGACGCAC 

AGCTAAGATGTCAGGACATCCTGGAAGCTGGGAAAGGTAAATCTTGCTACTGCTCACTCT 

TTGGGTCCACGCTGCTTTTAAGAGCTGTAACACTCACCGTGAAGATCTGCAGCTTCGCTC 

CTGAAGCCAGCGACACCACGAGCCCACCAGAAGGAAGAAACTCCGAACACATCTGAACAT 

CAGAAGGGACAGACTCCAGACGCGCCACCTTAAGAGCTGTAACACTCACTGCGAGGGTCC 

G CGG CTTCATTCTTGAAGTCAGTGAGACCAAGAACCCACCAATTCCAGACACA 

Sequence 2538 

CCGGGCAGGTACTCAGGGATAATAAAAATGCAAACAAAGTTTTATGCACATAATACCCAT 
CACAGGGTTACTTATAACAGCAAGAATTAGGAGACCAAAAAAAAGGTAGACAATATAAGA 
GG AATGG TTGGGAAGTCATAATTGGCCATACAATGGATTTTATGAAGCCATTAATAAGGA 
AG ATTTTAATGTAAT AG AAAATAC ATG AG ATAAATTAAGTAAAAAAAG CGTG ACACAAAA 
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TTGCACGTAAAACCTCACCCATCTTAAAAGGATGCTTAATGATACTTTTCACTTTATGCT 

T TCATAATATCTAAATGATC TACAATTAACCTACATTTATTCTCTAATGAGTCAAAAAGG 

CTTTTTTTTTTTTTTTTTn I GAGACAGAGTCTTGCTCTGTCATCCAGGCTGGAGTGCAG 

TGGTGCGATCTCGGCTCACTGCAAGCTCAGCCTCCCGGGGTCACGCCATTCTCCTACTTC 

AACCTCCCAAGTAGCTGCTTTGAACCCGGGAGGCANAANGTTGCAGTGAGCTGAGANCAT 

GCCACTGCACTGCACTGCANCCTGGGAGACAGATTGGGACCCCCTCTTAAAAAAAAAAGA 

AANANGA 

Sequence 2539 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGMCTATTTATAMCATCCTGCAGGAAAATGTCAGCAATGTGATGGTGTTTG 

GAGGTGGAGCCTTCAGAAGGTAA7TAATGCCCTTGTAAGAAGAGGCCAGAGAGC1TGCGC 

ACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCGGCAAGAGGACC 

CTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTG 

AGAAGTATATGCTTGTGGTTTMGTCAATGGTCTATGGTAATTTTnTATAGCAGTCCCA 

GCCAAGACAGTGCCTCATTTAC 

Sequence 2540 

CCCGCGGTGGCCGGCCGCCCGGGCAGGTAC 1 HI 111 1 I I 1 U I i il l I I 1 M I Ml U N 
GATGTTATTTTATTGGGTCGTAAATAAGGAGTAGGTAAAACATTTTGNGCTTTAATAAGT 
GAGGC NCAGCTTAAAAGTTTTAAAC AGC AATA 
Sequence 2541 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTT 
CTMCAGGCTTGTAGAGAAACGTCGGTTCATGTNAATTANAAATTATGGGGCCACTTTGC 
C ATTCTTC AC ACCTGCAATGMC AGGTGTTTATCTGN NG N NCTG ACTTATCTCTTGAACT 
CCATTTGCATGGTATNGTGGGATGCAAGCTGATGCCCTGTCCAGATCT 
Sequence 2542 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTCACACTCATTTACCCG 
GGGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCATGCATCACGGAGC 
ATGAGAAGCCCGCGTACCT 
Sequence 2543 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAMTTAATTTAACAC 

ATCTTTTGATMTCTCATCCTTGGTGTTGGAAAAGACGGGAAAATCCAAAAGTGTCTATT 

TTGTGCCCAAATGCTCAAGTTAATACTCGAGGGCCTTGATCTCTGGCCTCTTGCCATCCC 

TCTCCCCATTATTTCTGAAGAAAGTANCCCATTTTTTGCAGATTCCGGAGGGATGGGGAG 

CCACGGC 

Sequence 2544 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNATATACAAAGACTNTGAG 

CTGNNTGCCTCCGATGGTTTCCAGTATTGGCCCGTTGTAAAGCTCATTAAGGCCAACTTT 

NACTTTNANTATGTGATTCTGCAGAATTAANTTAAGGAGGCGCTGATCCATGCTGAGAGT 

ATCATN AGAAAANGGCATTMTCC ACAAGGTGCC AANCAAAAGTTGTMTTTN NTTN CAT 

CNTGGCTCTCANGAAGCAANATGCCAANGCNTTAATNTGGGGNACACCAAAGAATCCGTT 

G AAAGGG N AGGTTTGCTTG 

Sequence 2545 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCACCTTGTGGG 
GCATGTTTGTATTTATTGCTTCATGGCCGACTGGAATCCTGAGTCCTGGGAAGCTGGCAC 
TGCGGGGATCTTGCCCGGTGTCCTGGTCCTCTTGCTTCCGTC 
Sequence 2546 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTTTTTTI 
mm I' IGGGTTTTGATTATTTGTGGTGCTAANAAGAAGGGAACATGGTTGGAGGCCCA 
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NTGAGAGAAACAGTGCTTTGAATCAAAGAGCAGAATGATAGAAACTGACTTCANAGCAAC 

TTCTTGGCAGCAGNATC CAAT TTGGAAGTTGAAGGTCTTGTCCTGGAGCCAGATGCTAAC 

CNAAACACAGCAAATGCTTTTCCTAAGGCACAATAGTCTTTTCANTGAGCTCAGGAACCC 

TNTNTATGCCANATCCTCGTNTGAACCTTCTTGCTCCTTCTGTGCNTGAAGATGCCCACT 

CCACAGANCCCGCNGTACCTGCCCNGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnCNGG 

CTGCAAGGAANTCTATATTAANCTTAATTCGATACCGTTCGACCTNNGANGNGGGGGGCC 
CC 

Sequence 2547 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCGGGAGGCTGACGAGAGCCGGGA 

GGCGTTAGCGAAGGAAGAGAAAAACCGAAGACGAAGCCACTACAGCCCCCGCGTACCT 
Sequence 2548 

CCGCGGTGGCGG CCGAGGTA C I I I 1 I 1 I I I I I I 1 I I U t 1 1 I I M I M I CGGGATGAAGG 

TAAACCAACGlTTATTTTTTTTCCAAAACAGAGTCAGTTTGTGCTGGCCATCCAGGAACT 

CACTMCACTACCCCTCT TAGTTT CATCATTAATCTGTTACTGAACTTCCACCTCCAAAA 

NACTTGNATCCTTTTCTTATTTTTGGAGACCGTNTTGCTNTGTCACCCAGGCTGGAGTGC 

AGTGGTGCNATCTTGGCTTACTACAGCCTNAACCTNTCGGGCTCAAGNGATTCTCCTACC 

TNAGTCTCCANAGTAGTTGGGACTACAGGGGTGCACCACCACGCCCAGCTAATTTCTGAA 

TTCTCTGTANAGATGGGATCTTGCTATTGTTTCCCAGG 

Sequence 2549 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTACTTCCACTAAGACTATAATTACTACTGAT 

GGTTATGATGATGTCCCAAACACGCCATCCTAACACCGGCTTCTTACCAGTTTGAAAACT 

TTTAAGGCATTGTTTGACTTCTCTTTMTCCTAGATACATTCTGCTTAAMGTCCT^ 

TATTTTCTTTAAAAGCGTAACATTATTATTTATCCCACAGTTACTACCCTTGCTCTTTAC 

CAGTT^CTTGTGACTGMTTACTGMGCATTTTCTTAGCGGTTTCC 

CTCTTTTTGTAATATACTCAGTATTCTTCTGCCAAATGAATCCCTCCCATCTCTTCCCTA 

TGATATCTMCTTAAGCTCTGACAATTTGAGTGTGGGAGTAGTTAGCAGTGATTCTGGTG 

AGGTAAGCAGGGACCAAGATCAtAAAGGACCTTAAGT 

Sequence 2550 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTA 

TAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAA 

CAGTAGAAAAACATAAGAAGAGAAAMCATTTAAAAAATGACMGGGMGTTAATGGAAG 

TCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAG 

AAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTG 

TCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGC 

T TTCCA ACTTCCAGMCJGTGAGAAGCATATGTTTGTGGTTTAGTCAATGGTCTATGGTA 
ATTTTTTTTATAGCA 

Sequence 2551 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCTTCTCATGCTC 

CGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCCCCGGG 
TAAATGAGT 

Sequence 2552 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGATGTCTAGTGATGAGTTTGCTAATACAATGC 

CAGTCAGGCCACCTACGGTGAAAAGAAAGATGAATCCTAGGGCTCANAGCACTGCAGCAG 

ATCATTTCACCCGCGTACAGTTTAGGGGATCCTTTCTAATGACAGGAAGGCACTGCTTTC 

CTCAACACTGTGATCTGACCTGTGACAAGTCTGTACCT 

Sequence 2553 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I H H M ( I l l n t I U I I I I T GGGCAGCT 
AAAATCTAGGATGGTGTTAAGTTTCTTCATTTTGTCMTTATATATAAAAAATTAGAAAC 
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AACATGAATCTGCATTTCTTGGATGAGATAGTTAATAACAAACTATTTCTCAATATTTGT 

NTACTAAAAAACTAGTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGAACATG 

GGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCT 

TGGCTGGGACTGCTATAAAAAAATTACGATAGACCATTGACTAAACCACAAACATATGCT 

TNTCACAGTTCTGGGAAGTTGGAAAGCCAAGATNAGGATGCCAGTATGGCTAGCTTCTAG 

TGAGGGTCCTC 

Sequence 2554 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTAGA 

AAATCCAGCAACGAAGAGCCATGGCTCACTGCTAATCACATAACACACACAGGAGTCTAT 

TTATG CATGGMTGTGGC AG ATTTTTTAACAAGAAGTCACAACTTGTTATACAC CAGAGA 

ACTCATACAGGAGAGAAACCCCTATCAATGGCAGTGAGTGTGGAAAAGCCTTTTCACAGA 

AGTCACTGCTCACGGTTCATCAAAGAACTCACTCAGGAGAAAAACCGCATGGGTGCAGCG 

AATGTCAGAMGCTTTTAGTAGGAAGTCACTCCTCATTTTACATCAGAGAATTCATACTG 

GAGAGAAGCCGTATGGATGCAGTGGAATGTGGAAAAGCCTTCAGTAGGGAAGTCGCAGCT 

TAAAAGACATCAGATAACGCACACAATAGGAGAAACCCT 

Sequence 2555 

NTTTTTTTTTTTTTNTTNN N N NN N G NG N NC ATATTAATATANGGCGAATG N AGCTCCACC 

GCGGTGGCAGNGGCCGCCCGGGCAGGTNCAGAGGACACACATTGTANACAGGCCTGTGTC 

ATGTTTCCTTACAGTCG I I I i I I ACAGAGAAAAGGGGCATTG I I i I I I CACTGCTTTCTC 

AACANTTCCTTGTGAATAMTGAAACATTTCGGAACTCCCTNGNTGNGCAAANAGCCCTT 

CNACTTTTGNTTNNTTTGCCGGGNTAGCCCNGGGGAACCCATTGTTGGTTTGGGTGGAAA 

TTCGNTGNTTTNNCCTTGGTCGNGGGGGGGCCCCAACNTTTNNACNTTTCAAAAAAATNN 

GNACNAANCACCNTGGGGAANGGGGGCACTTAGNTNTTTCGCTTATCCCCCTTTTAANNC 

CAACCTTTCNNCTCTTTTCNACACCNCAAGTCTCCCTCGGNACCTTNTCCNTTATNGNNN 

CCCNTTCCACCTNNNGCGCCCCNAAAACTCCAAGGNNANCCAACCGCCTTGCAAAAA 

Sequence 2556 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAACAATGAGG 

TTTCCCACAGCGGAGTCTCCCTGGGGTCTGTTTGGCTCTCGGTAAGGCAGGCCTACACCT 

TTTCCTCTCCTCTATGGAGAGGGGAATATGCATTAAGGTGAAAAGTCACCTTCCAAAAGT 

GAGAAAGGGATTCGATTGCTGCTTTCAGGACTGTNGGMTTATTTGGAATGTTTTACAAA 

TGCCCGCGTACCTGCCCG 

Sequence 2557 

ATACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAA 

GGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCCTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGCAGTGAAGTAAAAAGAAGACAGGGACATCCCAGAAGGTTATGCAAAACATCAAGAGAA 

GATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTNGAAAAAA 

AAAAAAAAAAAAAAAAAAGTACCTGCCCG 

Sequence 2558 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAA 

GGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCCTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGCAGTGAAATAGAAANANAACAGGGGACATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAA 

AAAAAAAAAAAAAAAAAAAGTTCCTGCCCG 

Sequence 2559 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGCCCTGGGCCAATGCGTC 

CTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACC 

TTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGCATTT 
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GGAGCCCTTATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGGAGTATATTGACCAC 

AAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTTCAAGTGCTGC 

CTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGATCGCC 
ATCTA 

Sequence 2560 

GG CGAATTGGAGCTCCCCGCGGTGGCG GCCGAGGTAC I 1 M I 11 I I 1 I I 11 1 1 M 1 I 11 N 

GG1 I 1 i >■ m m u i h i n M 1 | 1 | i M I I M 11 11 11 1 i I 1 I 1 li 1TNGCNCGGNGTT 

TTlTTAANGGAMNANAGCCNCCNACCTNAAAANCNCCCCCCCCTTGCCCNGNACCAAAA 

AAGGAAAAGGGGNAANCNAAACCCCCNCTNGGCCCAGGGGGGGGNAGGGGNTNAAAGGAA 

AACCNGGGCCCGGNACNCNGGCGGGGGNGGGGGGCATCNGCNNGAAAANCNGGGCCTCCA 

NCCCCNAACCTNGTGGGGGANAAAACCNGNCNGGGGGGCCCNCCNANAACCANGGGACCC 

CCNGGGAAAAAAANNAATCCCNAGGGCNAAGNCTNAAAAAAAAAACNGGGGAACCCCCCC 

NAGGGGGGGGGGGCCCCCNGGACCCCCCAANNCTTTTGNGGNTCCCCCCTTAAANGGGGG 
AGGGGGG 

Sequence 2561 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC n rTTTTTTGTTTTTTTTn 

TGAGATGGAGTTrrGCCCTTGTTGTCCAGGCTGGATTGCAATGGCACGATCTTAGCTCAC 

TGCAACCTCTGCCTCCCGGCTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGG 

ATTACAGGTGCCCCGCCACCACCGAACCAGCTAAGTTTTTGTATTTTTTAGTAAGAAGAC 

GGGATCTCACTGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGNCT 

CGGCCTCCCAAAGTGCTGGGACTACAGGCGTGATCCACTGCGTCCGGACTTAGTTTTAAT 

ACTATCATTGAATAATTACATACAAAACCTAGATTCTCTGTATATTCCAGATAAAGACAT 

CTAAGACCCGGGAAGTTTAAAGTTCAAAGNTGGTGAACCAGAAAAATT 

Sequence 2562 

CTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGATATATAACAATGA 

GGTGCTCCATCAACCACTTTCTGAAGCTCAAAGGAAATCCAAAAGCCTAAAAATTAATCT 

CAATTATGCAGGAGATGCTCTMGAGAAAATACATTGGTTTCAGAACATGCACAAAGAGA 

CCAACGTGAAACACCAGTGGCCAAATNGAAGGGAAGCTGGAACACATGGTNTCAAAACCG 

AACAAGATAATGTGAACAAACACACTGAACAGCAGGAGTCTCTAGATCAGAAATTATTTC 

AACTACAAAGCAAAAATATGTGGCTTCAACAGCAATTAGTTCATGCACATAAGAAAGCTG 

GCCACNAAAAGNCAGGATTACCATTGGTATTTCATTTTTCTTGGGAGAGGGAAAATGCCA 

ACATCATCTTCCTAAAAAGGAGAAAAAATGAGGAGGATATTTTTAATTACCAATAACCAT 

TTTAAAA 

Sequence 2563 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCTTTCCCCCAGTGCAAAAGACTG 
TT ACTTTAT TATTGTATTC^ 

AAi I I M I ICCTGCGAAGTTTAATGATCCACAAGTGTATATATGAAATTCTCCTCCTTNC 

TTGNCCCCCCTTTCTTTCTTCCCTCTTTCCCCTCCAGACATTCTAGTTTGTGGAGGGTTA 

TTTAAAAAAAACAAAAAAGGMGATGGTCAAGTTTGTAAAATATTTGTTTGTG^ 

CCCCTCCTTACCTGACCCCCTACGAGTTTACAGGTCTGTGGCAATACTCTTAACCATAAG 

AATTGAAATGGTGAAGAAACAAGTATACACTAGAGGCTCTTAAAAGTATTGAAAGACAAT 

ACTGCTGTTATATAGCAAGACATAAACA 

Sequence 2564 

TACTATAGGGCG AATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC nTTTTTTTTTTTTT 

»i rn \ ii n itttci imm nniii mmiuiiinMiniiiiiinmi 

'"iimiifimiliilll NGGCCCCCCNCCCANNTTNA7TNAAAANGGGNAAAAGG 
NNCNTNTNNAAAAAAAGGGGGGGGGGCNAAGNCNAAAAAANCCCCCAAAAACCCAGGGGC 
NTNTNGGNAANANNAGCNGGCNNAGNGNGGCCNNNANNGGCNAAACCCNAAAAAAAACCA 
GGCCCNNNGGGCCTTNGNAAANCNANCNGGTNANATTNNGGNAAAGGGCCCTTTTCCCNC 
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CCAAACCCNAAAA 
Sequence 2565 

ACTACTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTI 

TTTTTTTGCTGGGTTATGCCCGCCTCTTCACGGGCAGGTCAATTTCACTGGTTAAAAGTA 

AGAGACAGCTGAACCCTCGTGGAGCCATTCATACAGGTCCCTAAGGAACAAGTGATTATG 

CTACCTTTGCACGGTTAGGGTACCCGCGGCCCGCTCTAGAACTAGTG 

Sequence 2566 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCATTGTANG 

ACTTCCTGTGGATGAGCATGAGCTATCATGACCTCTTCTTCTTCCAAGACTGCAGGCTGC 

TGAGAATCAAAGTGGGAGGGCTCTTGTGAGTCTGCTCGTAAATAGCCTTCAGTTCGATCA 

TCTGTATCTACTTTCTCCTCNTTTrGTTGGAAGNTTGAGGATTCATACTTTGGACAACTG 

CTTCTTAATCTCCACATCATCATCATTTTCAGGTTTCTTCTGATTCTTTTTCTTC^ 

TTTAAATTGTTTGATCAATGTGAAGGACATGTTTCAACAAGGAAACCATNAAATACAGGC 

CTCC TAGAGCTGGTTAGACCCTTCCACGTGGGAANTCAAAATAGGCACTTTTCTTCTATG . 

TTTTN G AG AAG AC AG AATG ACTG N AAAAAGGTGGGTC CTCTTTTTTC AT 

Sequence 2567 

CTTAGGGCGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTATGCCTGGCTAATA 

TTTTrATTTTTTTATTTTTGTAGAGACAGGGTCTTGTTATGTTGCCCAGGC 

ACTCCTATCCTCAAGCCATCCTCCCATCTCAGCCACTCAACTTGCTGGAATTACCAGCGG 

AAGCCACTGTGCCCAGCCCCAGACTCCCTTTTACCTAGTATATCATGTCTAATTTCTAAC 

AAAAAATTAAAAAAACATTGTMGAGGAAAAAAAGAAACCCTAAAACAAACAAGCAAACA 

AAAAACCCAAAACAACAAAMGTTTCAAGAAACAGAGCTAGCTGACAGAACTATATTCAG 

ATATGGCAGGTATATTAGAATTATCAGATTGGTAATTTTAAACAACTATGACTAATATGG 

TAAGAGTGATAATGGGAAAAAATAGACAATTTG 

Sequence 2568 

CCGGGCAAGGTACAGTATATGTATTTTTTTAAGTGACCTCCTCTCCTTCCACAGACCCCA 

CATGCCCAAAGGCCTCGGGACTTCCCACCACCTTGCTCCACAGATCCAGCTAGGCCTGAC 

CTGTGCCTCATCCCGTGCCGCTCGGTCTCTGGCTGATCCCGMGGGCTTTTGTCTTTCCT 

CTCGTCAAGTTCTTTTGGGTTGTG I I I M I I G I i I I I I I I AATAACTC AAAAAAAAATAA 

AAGACTTGGAGGAAGGGTGCAAGCTCCCAAGTGCATCTGGGGCACATGTTTCTTGGAAGG 

GACTGTCTCGCCGACACTCGGGATTCCCTCTTGCCTGCATGTTGAGGCTATGGGTGACCG 

TGGTTAGTGGGACAGGCAGTGAGCAGTNGAAAGTGCCTGAGTGCCCGAATTGGTGGGGGA 

AGGCTT 

Sequence 2569 

GCGGCCGCCCGGGCAGGTACGCGGGACAGCGGCTTCCTTGATCCTTGCCACCCGCGACTG 
AACACCGACAGCAGCAGCCTCACCATGAAGTTGCTGATGGTCCTCATGCTGGCGGCCCTC 
TCCCAGCACTGCTACGCAGGCTCTGGCTGCCCCTTANTGGAAGAAATGGTGATTTCCCAA 
N G ACG ANTTCATATTCCC ACAATNTTGG NTCTTAANGACCTTGG AAATTAACC AAAAAGG 
AAAC 

Sequence 2570 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACACAACAAGCN 

N N GTCTN N C AT AACTG AAAC AG AN G ATTCTG NTTT AG AAN AAN GCCC ATCTG AGCTT ANG 

AGCNTAGAAGGAAAAGAAGAAAATATNAGGAGCTTTGTGCATCrrCTACAATGCCTGCAA 

TTTCANAGCTTTCATCATNGCTTTANGGGAGGANTCTCATANTGAATNACNTTAAACCTT 

TCTCGTCCCAAGATCATCAGNCTANAGTCGGAAAAGAACCACCTGCCTCTGTAGCTGGAA 

N 

Sequence 2571 

CATCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGAGCATGAACATCT 
GCAGCCTCTTGCAGAATCACCCCAGAAGGGGACTGAATCATGGTCCTCTTGATAGGTATG 
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TTCAGCAGAGTTTCCAGTCCTGAGGTGTATGAGGCCAGCTGGAGCTCATAATCCTTAATT 

CCCGCGTACCTGCCCG 

Sequence 2572 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGGCCCACCGTGCAAAGCTTCCA 
CTTGGTGCCTGCGTGGGAACGCACCACCTGCCGTGGAACTTCCTCCTGCGGGGCAGAGGA 
GGGGTCCCCCGCGTACCT 
Sequence 2573 

ACTCACTATAGGGCTTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTTGGTCATTT 
CCTTAACCAAGGATCAANGACAAAAGGCCAGACAGCTCACTGCACTGCCAGGCTGCACGT 
CGGAAGCCGGGCTCCGCGCGACTGCCCCCGCGTACCTGCCCG 
Sequence 2574 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGCAGGGGTTGCTTTGGTCATTGCTGTGTTGA 

ACAGTGCACCTCCTTTCTTGACTGTATGCACCCATTGTTGATATGACCTCGGGTTCCCTC 

CACAAAAGCCACCTGAAGTGATCTCTTCTTCAAATACATCAAGAGAAACCCCTTCCTGCA 

TTTAGTTTTTGCCAAGTCGACCATCGATAAACTTGCTTAAAAAGCTGGAGGTTAACGGCA 

GTTTCCAGGAACTGTTTCATCACCGCTTGAGNCGGTGCTTTACAAAACTCTCCTCACAGA 

AAGTGATGGGCTCACCAGGTTTGTATCTCAGTGCATCTCTGTAGGATCCAAACAAAGCAG 

CCTGTGCTCTAAGAAAGGCCCTAGCTACTCCATCACCCGTAGCTGTAGACTGCTTCTTCA 

GTTTATTTTTCAAGGCCGAGACCACATCACTTGGTAGGTT 

Sequence 2575 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGGATTTCGGC 
CTGAGAGCGGGCCGAGGAGATTGGCGACGGTGTCGCCCGTGTTTTCGTTGGCGGGTGCCT 
GGGCTGGTGGGAACAGCCGCCCGAAGGAAGCACCATGATTT 
Sequence 2576 

CCGGGCAGGGTCCCCGGNANTTCCANAATTTTNCNTGGCNTCNAAGGTCCTTTGAGGNAG 

AAGGCTCTGGGGCTTCTGCTTGTCCCTTTGGAGGGTGTCTTCTGGGTAGAGGGATGGGAA 

GGAAGGGACCCTTACCCCCCGGCCCTTCTTCCTGACCTTGCCAATAAAAATTTATGGTCC 

CAAGGGGAAAAAAAAAAAAAAAAAAATAATTNGTTCCTNGGCC^ 

ATCCCCCNNGCTTGCAGGAAATTAAANNTNAAGCTTATTNATACCNTTCAAACCTTG 

Sequence 2577 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I I I (I I I I I I I I I {( I (GTAGGG 

ATGGGGTCTCGATTTGTCTGCCCAGGCTGGTATCAAACTCCTGGGCTCCAGCGATCCTCC 

CGACTCANCCTCCTGAGTAGCTTGGGATTACAGGCACGTGCCCACACACGTGGCATCATA 

CACCCATTTTACAGGTGAGGAAATTGTGGTTCACCGACAGTAAGTGACTTGCCTAGGATC 

ACACAGGTAGTAAATGGTGGAGCCAGTAGTTAAAATCANATGAGTCCCTAAGTCTTCTGT 

TCTCCCTTACTTGTGTTCTGTTTTTTGAATCCCTAANAGCTTTGTGCAATGCCCCTGTC 

GCTGGATGAACCAACAAACACTGCTTTAAAACTCTACCAAAAAGTGAATCCCGCGTACCT 

0 

Sequence 2578 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGT 

GGCAGGAGAAGGCACGTTTATCCAGAGAGTTGATTCTCCTTAATACCAGGAAACTAGGCA 

ACAGCAGGGAGGGGACAATCATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAAT 

GGCACACGCGCCCCCGCGTACCT 

Sequence 2579 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTm 
TTTTTTN CCTTTTT AAAAAAGTTATTTATTTATTC' I rTTTTTTTTTTTTTI GGNAAGGTT 
GAATGCNCTTTTGGTTTTNGGTCATGTTCGGTTGGNCAAAGATAAAAACTAAGTTT 
GATGAATGCNAAGGAAAAAAATATTTTTCCAAAGTCCCATNGTGAAAATTGTCNCCCATT 
TTTTGGGCTTTTGAGGGGGTNCAGTTGGGGTTGCTTGTCTGTTTCCCGGGTTTGGGGGG 
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Sequence 2580 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I 1 M 1 1 1 1 1 I 11 1 1 

TTTTTGCCCCGCAGCTTTTATTAGCCATCGTTACAGTCTCTCGTCCACAAACACTGCCCA 

GCTCCCTTCCCAGGGCACTTTACTGGAGGCAAGTGCAAAACACTGTGGTGCTTTNTATTC 

AAAGAAACAGAAGATGTNGCTGTGGGTGGTANACGGGACAGCCCAGNTTACCCACCAGTT 

TTAC AG N AAGG ATCCCGC AGTG AG CTTTG ATAC AGATATGAGG ATAG ATGACGGTAAAGG 

GGGAAAAGAGTTTAAMTTATCMGTTTCCTC 

GMGAANACTGCTGCTAGCAATTTTCTAGGGACTAAACACACCGATGGGAATATAAATAT 

TTCAAACACAAACCCCTGTTACATTCTTACAGG7TAGTTTC 

Sequence 2581 

CCGGTGGGGCNGGGCCCCACGGTACCCCGGTNTCCCTTCCANATGGGCCGAAAAAAAATA 

TCTTNNTCCCCAANACAAAANCCCNAAAAAT1TTGGGAAAGGAAAGGGGCAAMGGGNG 

CCNNNAAAAAACCCNNGAAAGGGCAATTTTTNGGNAAAGGGCNN I MINI IA AAAATTN 

CCTNGGGAACCCCNTTGGGAAGNAAAAANTGGTTTTTACAGGGGNAAATTGNATTAAACC 

TTNAAAAAATTTTCCTTMGTNGGGTNAAAGAATCTTGGAAGTTACCCTTGCCCCNGGGG 

NCCGGGCCCGGCTTCCTTAAGAAAACCTTAGGTNGGGATTCCCCCCCGGGGGCCTTGCAA 

GGGGNAAATTTCCGMTTAATTCAAAAGGCCTTTMTTCCGMmCCCGGTTCCGAACC 

CTTCAGAAGGGGGGGGGGGGCCCCCCCGGGTTACCCCCAAGCCl I I I 1 1 GGTTTCCCCTT 

TTTTAAGNTGGGMGGGGGTTTTAAAATTTTGGCCGCC^ 

TTCCAATTGGGGTNCAATTAAAGGCCTTGGTTTTTCCCCTTGGTNGGTTGGAAAAAAT^ 

NGGTTTAATTCCCCGGCNTTCAACCAAAATTTTTCCCCACCACCCAAAACCAATTACCCG 

AMGNCCCCGGGGGGAAAGNCCANTTAAAAAAAGGGTTGGTTTAAAAAAGGCCCCTTGGG 

GGGGGGTTGG 

Sequence 2582 

ccgcggtggcggccgcccgggcaggtacgcggggggtccaggtttcagaaggtttgatgc 

tatacattgtgtgaattttgaggttatgatggtggcccagagaaggttgggaggcattag 

tgtagtgttggagggatttgtgtctcacagagcatagatcgtattgtggaatttcattct 

tcacagaagctgtctccctctctgcccagggtgacattccatgggacgacaaggatttca 

gg atgttcttc ctctggactgctctgttctgg ggtgg agtcatgttttacttactgctca 

agagatccgggagXgamtcacttggaaggactttgtcmtaactatctttcaaaaggag 

tagtagacagattgggaagtcgtcaacaagcgtttttgttcgagtgacctttacaccagg 

aaaaactcctgttgatggggtaaatcattttattaa 

Sequence 2583 

GCCGAATTGGAAGCTCCACCCGCGGTGGCGGCCGANGGACTCTGGCGTTGGTAACAATGG 

TTTCCNGGANCTTNGGNTNGTAACNACTGCTTCCNGGGAACTTCTGCGTTGTAACCACTG 

GCTTCCCGGGACTCTGCGTTGTTACCACTGCTTTCCGGGACTCTGCGTTGTTACCACTGC 

TTCCCGGGACTCTGCGTTGTTAACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCT 

TCCCGGGACTCTTGCGTTGTTACCACTGCTTTCCCGGGGACTCTGCGTTGTTACCACTGC 

TTCCCGGGACTCTGCGTTTGTTACCACTGCTTTCCCGCGTACTCTGCGTTGGTTACCACT 

GCTTTCCCGCGGTAACCTCTGGCGNTTGTTAACCATTTGCTTCCCGCGTTACCTGCCCCG 

GGCGGCCGGCTCTAAGAAACTAGTGGGAATCCCCCGGGGCTTGCAGGGAATTTCGATAT 

Sequence 2584 

GACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCAA 

ATTGGGCTACGAAAAGTGGCAAGTTTGATTGTAGCATCATGTCTTTGAGTGTCCTATTGG 

ACTACAGATTAGAGGATAATAAAGMCATTCATTTGAGGTTTCATTGmGCGGAACTTT 

TCAACGAAATGCTTCAAAAGAGATTTTr4GGTGTCCGTATATACAAATCATTACTGTCTCT 

TCCTGAGAAAGAGGACAAAAAAGAAAAGGATAAAAAAAGCAAAAAAGATGAGAGAAAAGA 

TAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCACCGTTAANAAGTTG 

AGGCAGCANAAGGTTTTGGAGCCAAGGAATGAAATGATGAGGCGTCCTTCAGGTAATGAA 
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CnCAGCTTGCAGTGTGAAAGGGGCAGGGAAGACTGGCCAGCTGTCAAAAACTGGAACAG 
Sequence 2585 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTACAAAATTAAAG 

TCACT TTGTA GGTATATAAAMGTmAAMGTTGGTTTATTMTGAGAGTAGTGNA 

CATTTCTTTTCMGCAGTAATTT/^^ 

ANCTTTNCAGCCTGGGACCAACATNGGNGAAACCNCGTATTCTNCCAAAAATTGAAAAAT 

TAGCNGGGCNTTGGGGGGNTCACNCCCGGNAGCGTGTTTCTTTTGGGNGGGGGAAACCAA 

TGNGTCTAAACNGNNCNGNNGGGCACCNGGGNTATACCNACCTNTGNGGGCCTTTTTCCN 

TAAAAAATCCCNNNAAATTTGGGACCCTNNGCCNNGGGCNGGGCGNTTNTTNAAAACNNN 

NGTGGGGGTCCCCCCCCNGGGCCNGNACGGGAAMTTNTCAATNTTTCCNANCCCNTTTT 

NNGNNTNCCCCCGGGCNNACCCCNTTNANGGGGGGGGG 

Sequence 2586 

TACTTAGGGCG A ATTGGAGCTCCCC GCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTI 
rTTTTTTTTTI iNAI I 1 1 1 I I I I I 1 I 1 I I N \ | \\ \{ | |j || 11 I I CTTTGGGCAACACT 

TTATTGGGAAANATTTACNCNCGGGGACCTGTCNTAGGCCAAGCNANNAAAAAAGGGCCC 

CAGGANCCCTGGGGTCCCNNGGGGGGCTNAAAATGGAACCCATGGGACGGCCNNTNTAAA 
ACTAGNGGATCCCCCG 

Sequence 2587 

GGCGAATCGGACTCCACCTTNGGTGGCGGCCGAGGTACATTTCCTTGTAGACTCTGTTAA 

TCTCCTGCAGCTCCTGN7TTGGTTTCTGGAGCANATGAATCTCAATGAGGAGAGTCCTCG 

TCGGTTCCCAGCCCCTTTATGGAAGCTTTAN N CTC AGAAN CGTC ATACTGAAACAGGCNT 

TTTTCAANAAGGNCCCNAAAAMNCACCCGGTTTTTCCAAGGGTAGGGCCNANAAAAAAA 

GGGCCCNAACTTTNCAANATGCCTTGAAATGCCNAANNGTNTTCCCNTTTTTNAGGGGTC 

CCCCTTCTNCCTNGGGGNTAANNGGCCNAAAAAGGCCAAATTAANTCCCCTGGNACTTCT 

TGGNNGGCCAATTNNGGCCCCCCCNGGCNGGTAANCCCNGGCCCCCGNGGGCCNAAGGNC 

CAGCNTNCNTAANAAAACCNAAAGTNGGGNAATCCCCCCCCGGGGGCCCNTGGCAAGNGN 

AAMTTTATCCNAATTATTCAAAAGGNCNTTTAAANCCNGAATACCCCCGNICCCNAAACC 

CCTTNNGAAGGGGGGGG 

Sequence 2588 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTCCT 

CATCATCACTGTTGCTTGAGCACCGAAGAAGAGTGGCCCAGCCTTTTGGGTGCAGCTGTA 

AGGACGTGATGCAGTTTCCGTGGTCACTCTCACCAAGAACTTCAGGGGTTACGACTTCAA 

GACTATTTCGTGGTGAAACTCCTGAACTTTGMGTAGTTCTTCTTGCTCTTAAAATGGGA 

TAGCTGCCTACTTCTAAAGACTTAGTTAAGTCAAGGTCATGCAAAATTAAGAGATATCCA 

GAGGACGTTGAAATCAACATTTTGGAACAATCTGGTGTTAACCTCATTCGCATGAGAAAA 

CGTGTGTGAAAGAATTTCTTATGTGGACACCCATCTTCTGTATACCTGTTAGTGTCCCAA 

ATAATGACATTTCCATCAAATCCTGATGTTACTAAGAGTCTTGTATTAAGTATCA 

Sequence 2589 

AATTGTTGCTCCNNCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAACATAAATTGATT 

TCTCACTGTT CTGTG GGCTAGATTGGCAAAGATCAAGGCATTGGCATGTTTTATGTCTTA 

TGAGGCCTGCTTTTGACCACTAAGATGGCACTTTGAAAGCTGTGTCCTCACATGGCAGGA 

GGATGGAAGGGGAAGAATGGGTGAATGCAATGTGAAACCTCTTTAATATCATTATTCCCA 

TTCACCAGAGTGGAGCCCTTCTTACTTACCTCCTAAAAATCCTACCTGTTAATACTATAA 

TATTGGTATATAAGTTTTAACATATGAATTTTGGAGAACATAT^ 

TTAATAGTTNCACCAGATGTMTAAATATCAAGCAATATAATTAAATAAATTTCTACCTC 

ATAAGTAAAACTACAGG 

Sequence 2590 

CNGGCGAA I i I I ICTCCTNGCNGCGGCGGCCGAGGTACGAACTTTTCTCCANAGGATANT 
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TAGGTTGCACCNTTGTATTTGTAAAACAGGAGCAAATTTGGACCTTGCCGGGCCAAAGTC 

GTGTCACGTGGAACCTCTTAATCTCAGCATCCGGAGCTCCAGGAAGGGAAAATTTCAAGT 

CAGATAGMTTCTATATATACCATTTCTTTGGMCCTTCAGCCCTCAAGATTCCAACATC 

ATGACCTCAGTTTCAACACAGTTGTCCTTAGTCCTCATGTCACTGCTTTTGGTGCTGCCT 

GTTGTGGMGCAGTAGAAGCCGGTGATGCMTCGCCCTTTTGTTAGGTGTGGTTCTCAGC 

ATTACAGGCATTTGTGCCTGCTTGGGGGTATATGCACGAAAA 

Sequence 2591 

GCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGAGCCAGCACTCCCTGACCTCAGGGTGTG 

TGAGGAGTTGGCACTGTAGAGAGAAACCAAGACTCTTCCTCAATGCCTGCCTTATTCTGA 

GCACCCACCCTTAGCTCTGATAAGGATCCCGCGTACCGTGGCCTACAATGACACACAAAT 

CCCATTAAATAAATTATAAACMGGGTCAATTCAAATTTGAAGTAATGTTTTAGTM 

GAGATTAGAAGACAACAGGCATAGCAAATGACATAAGCTACCGATTAACTAATCGGAACA 

TGTAAAACAGTTACAAAAATAAACGAACTCTCCTCTTGTCCTACAATGAAAGCCCTCATG 

TGCAGTAGAGATGCAGTTTCATCAAAGAACAAACATCCTTGCAAATGGGGTGTGACCNCG 

Sequence 2592 

GGGCGAANTGGAGCTCCTTTGCGGTGGCGGCCGAGGTACTTGCACATCCTTTTAGTTATC 
TTTTTTTATATAAGTAAAGNTCACACACATAMTTCACT^ 

TTGGCAATGAATACAGTGCCTGTTGAGGCCTGAAAGAGAGAGGAAATCAAGAACAGTGTG 

GGAGMGAAAGGCAATCCTTCGTTGAAAAGCTTTTCTCTTAACAATCCTTAGTGCCTGAA 

AAATAAGCCATTTTGATAACAAGAGTATAAAAAGCCTTATGATCTGCAAAGTAAATCGTA 

TCTGGTTTAAATTTCATCCCAAGTCTCAGGTTATGGCTGAGGAGCTATACTAAGGATCTT 

TGNAGCNCATTTCTTTGAATTTTTATAACTAAGACAAGAAAA I I I I I I I ATCAGATCT AG 

GCAGAAATACTGATC 

Sequence 2593 

CCGCGGTGGCGGCCGAGGTACl I I I T I II I M M II I I M M I M I I l I I I NC N I M I I 

m 1 1 1 1 1 1 1 1 1 1 m ii n 1 1 1 1 1 1 1 1 1 1 n n 1 1 1 1 1 1 n 1 1 m 1 1 it n 1 1 n n 1 1 1 1 
rmrn inanngggnaaaanccnncnnnaatannntcccnnncaaaaaaaaanattaan 

AAAANGGGNANANAAAAAAANTACAAANNNAGAAAAACTANAAAAAAAAAANGGGNAAAA 
ANATTNNTTTTTTTCCCCCAAAAMNNNTN^ 

ACANANNGAAAAATCNAAAAAAAGNANNGNAAAAAAAAAATTAAAAAA 
Sequence 2594 

GTGGCNGGCCCGAAAGTAAC I I I I I I I I NTTNNNAAAANAAAAANAAAAAACANANAGAA 

NANAAAANAAAAAAAANAAANAAAACAAANAAAAANAAAACANAAAAAANCANNANAAAA 

ANNAAANATANAAAAAACCAAAAAAAAANAAAAAAANAAANAANNANAAAAANNAAANAA 

AAANATAAAAAAAAAAAAAAAAAANAGAAAAAAANCAAAAANAAAAAAAANAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAANATAAAAAAAAAAAAA 

Sequence 2595 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACI I I I I ! 1 1 I I II I I I I I I 
TTTTTNNNGACTGAATGCAGCTTTATTAAAGATACTTTCCATAAACAATCATGGTATTNC 
AGGCNGGACANGGTGCAGACAANTGTAAACAGTATACAACAACTTTC 
Sequence 2596 

CGAGGTACATGAAAAAATTCATACTGGAGAGAAACCCTATGAGAACCCTAACCCTAACGC 

TTCAGTTGTCCCAGTTCTTTCATGAGCATGAAAGGAGTCACATAGAGAAACCCCATGAAA 

GTAAGAMTTTGGGAMGCCTTCAGTCCTTTCTGTTTCTTTCAACTACGTGAAAGGATTC 

ACGGTGGAGAAAGACCCTGTAAGATAATTGGCTTTAAATTACGAGAGACTTGTGATAGGA 

CAGTAAAACCTAGAGTTGGAGTTGGATCTCTGGATTGTGTTATGTCAGTGTTGGTAGGTT 

AGGAACTAGATTTCCCAGAATCCATTCCATTTGTGATTCCATGATACAATTCACCAGTAA 

CCTATCTTTACATGAGATTCNGGAAGTAAGTTAAGAAAGGCATTAGTCATGGTTTGGAAG 
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CACCATACAGGGAGACAGCTGTGTGAATACAGGCTGTATGGACACTTGCTTCCATCCCAT 

TTTCCTGCTTCTTTGGG1TGGCAATCAAGAGTATCCTCAAAACGACTTGACTTTAATTTT 

CT 

Sequence 2597 

CGCGCAGTCCGCCTGACTATACTACCTTTAGTAAATAAACCNCCCTTTCNGGATGCCACT 

ATCTCCGATGGCACTATACCCTGTCTACCTTCTATACCACTAAATTAAACCAACTCCCCT 

CCTCCTCCCAAGCNATAAAATAAAAATATACAAACCCTGGGCCAAAATGANNAAAATCTG 

TCNTTATTATTGCCCCACAATCCTAGCCTCNCGCCGAGACCTCGGCGNTCNAAACTAAGT 

GATNCCCGGCTGAGNATCGTATAAGCTNATCGATCCGCGCCTCGANGGGGNCCGNACCC 

Sequence 2598 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I I M I I M I 1 I I 1 I I I 1 1 GCTTCCTTGT 

CTTTGGTTTCATTGTTTTTTCTTTATCCCATTTCCCCCTCTAGTGGTCTAGM 

ATCCTMTACTGCGTTTGTAATACTTGCTCTTCACATATTTGTATGCAGCCAGTCACAAG 

TGGAATGTGACTAGAATCCTTGTTACTCAAGAGGCTGAGGTTGAAGGATCGCTTGAGCTC 

MNAATTTGAGTCCAACCTGGGCAACATGGNGAAACCCCATNTCTAAAAGAAAATNCNCN 

CAAAACTAAGCTTGATTACTTTTATATTTTCTCCAGAAACAAAAAGAAATCNCNCTNTGC 

TGNCATACTCACCCTNCCCACACTGNTCCTTCAGCAACATATACCATTGTTTTTAG 

Sequence 2599 

NGGCGGCCGCCCGGGCAGGTACTGGCTTCCTTGTTAATCCTTTATCCCCTTCAGGTGTTA 

TTCAMTGTCATCTTCTCAAGAAGGCCTTGTGTGATTAGTTTATTGAAATTATTCCACTC 

AACTCCTATGTCTAACACTATTCAAGTCTCCTTTCCTGCTTCAATTTTCCTNCTGGGCTT 

C NTACCACCATCTNAACTTGGGAAACCATGTMTATTGTTG AC ATTATTTTGGGCTTAG N 

ATTAATTTTCACCCCATTTNAGCCATNGTAAAACCTCCACGAAAGGCCTTTTATTTT 

GCAAGTATTTTNCTGCCACCCTAGAACAGTATTTGGGTTATATAAGCAAGGTCACTTTAA 

AATATTTTGTNTGAATGGAAGTGATGTAAGAAGGAAGGGTTCCAAAAAAAGCCCACTTGG 

GAAGAATGGTCATGGGCTTTGACCCG 

Sequence 2600 

CAGGTACAGTGGATTTTTCATTTGCAAAGACGTTAAGCCCTCCAAATGTGCAAATCATGA 

AGTCAGTNGTTGTTCCAGCAAGGTTTGCCCAGCGGTAAAAAACAAGATAAAACTAATGCA 

CTAGCTGAAACCAGGTGGGGAGACCATGTGTGGTAGTGCTTGGGGGTGGAGGGAAACTAT 

TTCTGAAATGAGGACTTAAAGTATAATACCAGCTTCACTGCCTGTTCACATGAGAAACCA 

AAGCTTCAATTTAACTGCANGCAATAGGAGTTTCACACTGTCAGCACCAACTGTCTAANA 

TNCAAAACTAGTATCTAAATGTGTAGGATCAAAACCAAAAATCTGGAGGGATCTAGTTAA 

ACTTCAATATGCATGACCCCAGATTCCCN 

Sequence 2601 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTTTCTGGCAG 

TGAATGGTCTGTATTCCTCTAGTGATGATGTGATCGAATTAACTCCATCAAATTTCAACC 

GAGAAGJTATTCAGAGTGATAGTTTGTGGCTTGTAGAATTCTATGCTCCATGGTGTGGTC 

ACTGTCAAAGANTNACCCCCCNNANTTGGAMGAAAAGCCAGGCAACCTGCATTTAAAAG 

ATGTTGTCAAAGTTGGTNGCAG7TGATGCAGATAAGCATCATTCCCTAGGAGGTCAGTAT 

GGTGTTCAGGGATTTCCTACCATTMGATTTTTGGATCCAACAAGAAACAGGACCANGAA 

GNATTACCAAGGTTGGGCAAGAACTGGTGAANCCATTGTNAGATGCCTGCGCCTGAGTGC 

CTCTGCGCCAG 

Sequence 2602 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACAATACAATCTAGATGACGGTGCAGACTAAGT 
CAAGAACTAAAGTTGTGCAGTAACCCGAGTTAAGGCATGAATGCAGACACACACATGCAC 
ACACACAGCACCCATGCTATCAAGACACAGGAl Mil I CAGTTGCCTCATGAGAGGGCAA 
CCTGGGCTTGGCAGTTAATCAGAACTGCTGAGCATTCCAGAAAATGCCCCCCACGACTTT 
ATGCTAACAGCTGTGTGTATGTTTTAATCAAAAAATTAAAGAAGAAAAAAAAAACCTAAA 
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AAACAAAGAAAAAAACAAACAAAAAATCACCAAAAACCTAGAAACCCCTTAATCTCTTAC 
AATGGCTCTTGAGCATGGAACTCATGTAGCAGCATCAATGGCTGGCTCTTTMCMTTTG 
GAAATAAAAGGGTTGTTTTACTATGTAT7TCTTT 
Sequence 2603 

TGGGAAACCGGAGCTCCCTTTNGGTGGCNGCCNCCCGTACGCGGGATGTGGCAGTGAGTG 

GAGGACTGTGGATGAGTGACCCCNTTAAAAGTAAGATATTGTCACATAAGCTCTTGAGAC 

AGTTMCTCATTATCTGCAGTTTCTTCCATTCCACAATAGGCTACTTATTTTTAATGCTG 

TATTTACATTGAGNAMNACCTCATGTTTNMNTTTTNATAAAGAAAAAAANTATTNAN^ 

TNNTTNTANTTCTACCAGAAAGGGTNGGACNATNTTCNAAAAAGGGCACCGGGTGAAATT 

ACCTNATGGTNCANATGNACCNGCNACTGC7TTNGCCCATATGCCCAAGGGANANTACCA 

NTTGGGNTCANAAAANANCCATGAMNTGGGNGGAAAAAAACTNATTCACTTTCANNAAG 

GATACCTTGGGCCCATTCAANTCAAAGGGTACCTTGCCCGG 

Sequence 2604 

GMTTGGAGCTCCCCGCNGNGGCTTTGTNTMTGATAGGAATTATTCCTTTMTTTAAAA 
AATACTTTCAATTTTTTAACAGTTTTTTCGTTACTTGNCCA^ 
GAATGGATCCTGCCATTGCTTTGGTGCTGGCCATTATCA 
Sequence 2605 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGAACAATCAACAGTT 

TCTGATGTTCCGAGGGACCTGGAAGTTGTTGCTGCGACCCCCACCAGCCTACTGATCAGC 

TGGGATGCTCCTGCTGTCACAGTGAGATATTACAGGATCACTTACGGAGAAACAGGAGGA 

AATTNGCCCTGGNCAAGGNANTTTANNTTGGGNCCCNGGGAGCAAAGTTCTTACAGCTTN 

CCCANTNAAGCTGGGNCCTTTANNACCTTGGGAGNTTTGAATTTATTACC 

Sequence 2606 

TGGTNAATCCNNTTCACAATTTCCACACCAACCATACCGAAGCCGGGGAAGCATAAAANG 
TGTAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnAAGGGCGGTTTTGNCGTATTGGGGCCGCTCTTCCCGCTTTCCTCG 
Sequence 2607 

CNAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGGCATGCAGGCGTGCACGTT 

TCCCGGCTCTCCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

n nnn nn nnAGGGTTNAATTTGNCCCNCTTGGCCGTTMTTC ATGGGTC ATMGCTTGTTT 

TCCTGTGTTGAAATTGGTTATCCCGCNTCACAAATTTCCAACACANACATTACCGAAGCC 

CGGNGAAGCATTAAAAGTGTAAAAAGCCCTGGGGGTGCCTTAAATGGAAGTGGAAGCTAA 

CTTCACATTAATTTGCGTTTGCGCTCACTTGCCCGCTTTCCAGTTCGGGGAAAACCCTGT 

NCGTGCCANGCTTGCATTTAA 

Sequence 2608 

CGGCCGCCCGGGCAGGTACACTTATNGTTGAGAGCCANGTCTCCCTTATCATTGGTGAAT 

GAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTTCAGCCTGTACCCCTAANATT 

TTATTATCAG NAAAGCAG NAGATTNACC N GTCCTTTACTTNTTTTACCNCCCTTTTAATN 

NGGGTAAAAANAAAAANGGTAMTGTTTTCCCCTTATNTAAAANMNATTTCCCTTTCCC 

CNGGTTTTCCTAACCCCCTTGGNNAAAAAACN I I I I I I AATGGGCCTTTTTNTTCCCTNG 

GTTTTNACCCANTAAAANTCCAATGGGGCCAATGGGGGGGGGCCTNAATGNAAACNCCTT 

AAAAAACCNTCAATTTTTNTTTGCAAAAATTTC^ 

NGGGGCCTTANTAAAATNTTTTCANTNCNNTGNGCCNTTAATTTTTGGGAACCANNNCAA 

NANTTTTGGANTAAAATAATTGGGNTTGGGNKnTCNCANATTTTTTCTCCATGNAAGGAT 

CANNNACNNNTTGGNANTAAACNCATAACANTATCTTCTNTGGGGTANTCCCCCACNANT 

NTTATTTCCAAAAATTGG 

Sequence 2609 
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CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGCTGC 

ATGCTGTGGAGGAGGGTTTCAGAGGAGAGAGGTCGGAGAGCAGAGGCCTGAGAAGCCAGA 

GGCAGGTGGAGAGAGGGTGGAAAGTGAGCAGCGGGCTGGGCTGGANCCGCACACCGCTNT 

CCTCCCATGNT AAANN NGCCCCTTTAGAAAAATTTCACAAAGTCCCCCTTCCNAANTAAA 

NNNANATATATATTTTATAAGTTACNCTNNGGCCGCTCTAGAAACTATGGNGGGNANTCC 

CCCGGGGCNTGCAGGGNAAT 

Sequence 2610 

GCAAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAAAGCGATCGGAACCCGGGAATGAAAT 

CGTCCCAGGCGCGGAGTGGAGCGATATTAACGTCCATGTTCTCAGCTTTGCGTTCTTCGC 

CTCCAGAATCTGGCGGCGGCGGCGGCGGCAACCGATCAGCTGAGCTGAGTTGGAGAGTTG 

ACAACCCCCGNGNTNCCNTTTGGCCNGTTCTTAGAAACTAANTNGGATCCCCCCGGGCTG 

CANGGAATTTCNATANTCAAGCTTATCCGATACCCGTnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGTAAATCATTGGT 

CATAAGANTGATTTTCCTGTGTGAAAAANTTGNTTAATTCCGCCTTCACCAAATTCCCAC 
CAC 

Sequence 2611 

TCCCCGCGGTGGCGGCCGAGGTACGCGGGTGTTCCTTTTTGTCCAAAGTCTATTTTTATT 
CCTTGATATTTTTCI I I I I I I M I II I rGNGGATGGGGACTTGTGAATTTTTCTAAAGGT 
GCTATTTAA 
Sequence 2612 

GAGCTCCCCGCGGTGGCGGCCGTTTGAGAAGCCAGCGCTCACCCACCCGGGGTCTCTGTG 

CATTGACCTTTGGGTGCTGACTTGGAGAAAAGCACAAACACGACCAGTCCCATCCTGGCT 

CCCGTGGGGCTTCTTCTATCTACGCATTGTATCGACTGCATTAGTTGGACTAAGAAGGAN 

GCNTCANNTTAAAAGGGNGGAGACAAAATGCCTGAACTGGTCTAAAGCCAANGAAATGGG 

CCCCAAAGCTGGGCAAGNAAAAAGCACCACTGCATACATAGGGATACAGAAGGGCAGAGC 

TTCTGCCTGCGGATCTGCAACATTACATNTTGTTTTTGCCTGCAAACTATCAAG 

Sequence 2613 

CCGCGGTGGCGGCCGTGGGACCTGCAATNACTACGCAAACNCTTACAGCTTTTGGCTCGC 

CACCATAGAGAGGAGCGAGATGTTCAAGAAGCCTACGCCGTCCACCTTGAAGGCAGGGGA 

GCTGCNCACGCACGTCACNCCGCTGCCAAGTNTGTATGAGAAGAACATNTNGAAANCCCT 

GGANTNNCTANATGTNAACAACATTGGTGCNTACTTTCTTCTTTCTTTTm 

AGCAAAC N AAACCCTTAANAAATTATAN NCGNTGTGTTACCCTCGGCTCGACTTCTAG N A 

ACCTANGTG NGA 

Sequence 2614 

GAGCTCACCGCGGTGGCATGCGGCCGCCCGGGCATGGTACTCCTGCCAAGGCANGCTCCC 

ACCGCTATGGGCACAAGGAAGTTGTCGTCATCTGCACTGCANNCTTGAGCTGTAGAATNC 

T 

Sequence 2615 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTGGTGGCCAATT 

GATTTGATGGTAAGGGAGGGATCG NTNACCTGATNTGTTATATN AAN GANTC AACGGGCT 

GAAC AAGG AACATCAGC ATCCTTTATG AAATGGATCCCCAAC AATN N C AAG AC AN CCGTC 

TGTTACATNNCACCTNNNTNGGCCTNAAAAATGGCAGTCACCTTCATTGGCAATAGCACA 

GCCATCCATGAGCTNTTNAAGCGCATCTCGGATCANTTCACTGCCATGTTCCNCCGGAAG 

GCCTTCCTCCACTGGTACCT 

Sequence 2616 

CGACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCCGCCCGGGCAGGTACGCGGGGT 
CCCAACCAAGCCCTCCAGCAAGGATTCAGAGTGCCCCTCCGGCCTCGCCATGAGGCTCTT 
CCCGTCGCTCCCGGTCCTGGTGGTGGTTCTGTCGATCGTCTTGGAAGGGAGTGGTTTTCA 
NAGACATTTCAGAAAGTGAAGGAGAAACTCAAGATTGACTCATGAGGACCTGAAGGGTGA 



Page 388 of635 



WO 01/51628 



PCTAUS01/00798 



Table 1 



CATCCCAGGAGGGGCCTCTGAAATTTCCCACACCCCAGCGCCTGTGCTGAGGACTCCCTC 
CATGTGGCCCCAGGTGCCACCAATAAAAATCCTACCANAAAAAAAAAAAAAAAAAAAAAA 
AAANGTACCTCGGCCGCTCTANAACTAGGTG 
* Sequence 2617 

TNCTTAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC l rTTTTTTTTTTTTTT 
I M I M I M I I I INGC1 I I I i I i 1 1 I 1 I II I M I I I I ! I i I II I I I 1 1 I I I IANCAAAAA 
NAGGGNNGAAGTTTNTTTNTTNTANCCCCNTTGANACNTTTTGAMTTGGMTTGGTA 
AAAAANAAAACAAAAAGCCTTTGAATTGTTTTTGGGGGA 

CNl I t 1 I I I I 1NNCAAATTNTACTGNTTCCAAACNTTTTGGAAANAAANAACTGGAATTT 
TGNNGGNCNCTTGCNCTGGTTGNCNANATTNNAACMGAGGAACCCNTTTTGGGGTTAAA 

I llll I 

Sequence 2618 

CCGCGGTGGCGGCCGAGGTACCTGTGTCAGGAGTCCCCAAGCCCACTCCCAAAATAGAAA 

ATTCACTAAAAGGAGTCAMGGATTCAGCATACAGTAATATTMCAACTMGATTTATTC 

CAGCAGGATACAAAGCAAAATAAGAAAGGAAAAGTCGCATGGGGCAAAATCAGGTGAAAG 

CTTCCAAGAGTCCTCTCCCAATGGAGTTACACAGGATGTGCTTAATACCTCCAGCAACCA 

GTTGTGACAATACATGCAAAGAGTGCAAAGTCTTGTCCAGGACGGATGTTCTTTTTTTTT 

TTTTTTGAGAAAGCCTTGCTCTGTCACCCAGGCTGGGAGTGTGGCACCGATCTGGGTGAA 

GAGCAAGACTCCCGTCTCAAAAAAACAAAAGAAAAGGAAAAGTTCCAGTCCTTGTC 

Sequence 2619 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTT 

I I rrn n n i 1 n n 1 1 1 1 1 1 1 1 n iacagatgctttcagtttaatagttgttttaaat 

i i i ( n n rm i i i i i i iaggntttaggctgaaaacntaaagcctgctgctcctgggtn 

tnatttctcagcttntccttacagctntagtacatanaatactcctttttttaagtct 

ntagcttcanaccaaatcggtctgacccaaaatacctgctnntntgganagngggatggg 

gagaaggtaactagaaaatgcaagatcntntttaccccaacagcataaaaccnccctagc 

tggggctntggctgggagggamtgctagtcattttttcc 

Sequence 2620 

attggagctccaccgcggtggccggccgcccgggcaggtacgcgggmgaagggacaaaa 

ctmgggtaagcgaactttggttctaggaatggcaagaccagcaagaagatcaccattgc 

cmctgtagcctttacacaatgtcatagtagcccaaattcagtcagctattgaattaagt 

ttattggcttcctttnncnagntaaaagmtgtatgmtgcttgtctttag 

tcatatggaggttctacmcttcttaga 

agacattcaagagatgggattgaaaaccaagagaaagggcagacaatagaaatatactca 
cagatgattcaggcagtggagttmgaccaggacttgaaataacaatgataattgttcaa 
gaaaatagaagacagggaaaaaaaaaaaa 

Sequence 2621 

nctatagggcgaattggagctccccgcggtggcggccgcccgggcaggtacttttttttt 
rTTTTTTTrm I I caggaactgtataattttgg ATATTGAACTGTTCTAC 1 1 I III U I 
TAAATTTTTATTCCCCCTATCTTTGCCTTGTTCTTCTAAGTCTGAAAAI I i n I CI I I II 
TTCTTTTCAANGTTGNNGTCTAAACTTNTMTNNNAGGTTTAAAGACCTGCTAATATGTA 
TTTAATTTCCAGGTTTTTTNAGTTTTCTATTTCTTT^ 

AGTC AT AT AAT ATCCTG NGTTTCTG AATATT AATCTTC ATTNGTTTTG AAC A I ll I I III 
NTTACCANATGAAGAGGCTGATTGATTNGGAGTTTTCCTGGACAGGGATATNCATGAGNC 
C 1 1 I I I 
Sequence 2622 

AGGTACGCGGGAGAGTTCTGCCTCGCTTCCCGGCGCGGTCGCAGCCCTCAGCCCACTTAG 

GATAATGGCGACAGCTGAGGTACATTGTGATACAACTTCTTCACAGATGAAGGCCAGCAG 

AGACAGCAACAGCTGCAAGCCCTTGAGAACGCGCCGGAGCAATGGGAGCCGGCCCATGAA 
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AAAGTAGGCAGCGAGGCCGCTCCGGGGGCCTCTGGCGGGGCCCGGGTCCTCCTCCGTAGT 

GGGGCTGTACCTGCCCG 

Sequence 2623 

AGGTACGCGGGAGAGTTCTGCCTCGCTTCCCGGCGCGGTCGCAGCCCTCAGCCCACTTAG 

GATAATGGCGACAGCTGAGGTACATTGTGATACAACTTCTTCACAGATGAAGGCCAGCAG 

AGACAGCAACAGCTGCAAGCCCTTGAGAACGCGCCGGAGCAATGGGAAGCCCGGCCCATG 

AAAAAGTAGGCAGCGAGGCCGCTCCGGGGGCCTCTGGCGGGGCCCGGGTCCTCCTCCGTA 

GTGGGGCTGTACCTGCCCG 

Sequence 2624 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGATTCATGTTGTTTCTAATTTTTT 

ATATATMTTGACAAAATGMGAMCTTAACACCATCCTAGATTTTAGCTGCCCAAAGAA 

TGAAAAGAATGAAAAAAAAATCTTTGAAAACCCACMGTGATATGGATCTMTTTATGGT 

TAAATAGATATAGATAACAMCAGAATACGCCTGTTTAAAACTGTTAAAATGACATTGGT 

TCTAATTATACTTTTATTTAMTTGAAAGACAAGGCATTTATATGGTATCTCTAACCATC 

ACAACTTTTGTGTGACAAAMGAMTTATCACCAAAATACACCTCCTTAAGTAAGTGTCT 

GATTTCACACTTCCAGAAAMGTGCTCTTTCTGGTCAAGCCAGCAAGAATTGAGAA 
Sequence 2625 

CCGCGGTGGCGGCCGAGGTACGCGGGCTCCCAAAATCCTGGGGTTACAGATGTGAGCTAC 

CACTCACGGCCCAMTCTTCTTGATCATATGTTTAAATATAT1TTTTAATATTTGGAGCA 

TGAGTTGTCACTTCTTGTTTGCCTTTTTTATAAGGAAATGTTGGAGAGTTACATCATTG 

TAATGTAGAAAT GTTAAGT GGGAAAAATATACAGTTTGGTAAAATAAACTAGATTCTACA 

TTTATTT GTGGG i ITTT1 iCCCCTCCTTTCTTTCCACAGCACTTTTGATATCAAGCAAGT 

GGCTTCCTTTTTGAGATATTAAAAAAAAAAAGAAAAGGAAAAAAGTAAATGAAGCCCAAC 

TACCTAACCCTTTCTTATTTGTATTTGTTTTAGTATTGTGAAGTTGTGTTAAATAGTACC 
TGCCCG 

Sequence 2626 

CCGCGGTGGCGGCCGAGGTACTGTTCCCTTCTGATTTGGTCTAGATAGCAGAATCCATTC 

TCTTCCGTCAAACGGAAGACACAAGGCACCTGAGGCTGATCCTTCCCAGAAATTAACTCC 

AGAGGCTGCCACATCTGGTATGAGCGTCCAAACCCAGCATCGACAATGTAGTTCCTGCCA 

TCAATGGTCACCTGCAGGAGAAGGTGAATCATGCCAGTGCTGTATTTTTTGGCTGGAGTG 

CTGTAAACATACCCTCCCAACATCGTGGTCTCAAAACCAATAGTGGTCAGAGCCCAGTAC 

AGAAGATGATTGACCTGGAGACACC^TCCACCCCGATTTCTTCTCACAACTTGATCAAAA 

ATGGCCTCTAAGCCTAAGTCCATGGCATCCCCACAATGGGATGTTAAGGTTCTCAAAGGG 

AACAGCTTCGGATCTGGTGTTGAAGAATGTCAGTTAATGTT 

Sequence 2627 

ACTACTTAGGG C GAATTGGAGCTCCCCG CGGTGGCGGCCGAGGTACI nTTTTTTTTTTTi 
rTTTTTTTTTI lAAl 111 I U 1 H I || | n M 1 1 I H 1 1 1 1 I NCACTCAGAAGGGGNCTT 

TATTCATACTTGTTGACATTTCCCATGGGGAATGGTTNAAGGGAGGGGCAGGGAGGGCTN 

ANCGGNAACCCACGGNTTGGTTJTCCCTCCATCCCNNGGGGAAAAAAAACTCTGGTAGGG 

NCATNCTAANNAAANGTGANCAGGGGANCCATCANTATGTGGGGCTGAAATNCACTGGNC 

AGTGACTTNAGGCMCCNGAGTCCTTTTGTCAGGAAATGCAATTTNTAAGNGGAAAAAAA 
AN C AAG NCCC AAAGAAA 

Sequence 2628 

CCGCG GTGGCG GCCGAGGTACTCCGAAACAAGTAGAAMGTGCTGTTTGAGGGATTTTAT 
TAAATC i urn I AATGGAATGTGGTACAGTTAGCTGTCACTCAGCTGACACCATGATGT 
GGCAGCAGAGAGGGAMCCTACAAGTGGTTTGCCTCATTGCCTTTGCCACATCTGAAGTT 

CTCAGCAGCACTACCTTAGACTTCATGAGCTAATAGGAAACTTTTTATGGTGTAAATGCT 

GTAAGACTTTGTACCTGCCCG 

Sequence 2629 
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GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I 1 I I I I M I I 

I 1 I I I I II I I IAGTAGAAGCAGGGTTTTGCCATGTTGCCCCTTGCCAGAATCAATTCTAA 

TTATAGGAAAAAAGTTTCTGAGTAATCAAAATTTTGCACATATTTCTGGNGGATTTCATA 

GTTTAGCTTAGGAAACTTGAACTCTTACTGAGTTGGGTCTGCCCATGTANGATGCCTGGT 

GTTGGTGTGTGCGCTCTCTTGGTTATAGAATAATCCACCCGAATTCTTCTACCATCCAGC 

TCCATTCCATTTGCCCTTTCCATAGCCTATAAAGAACAGGCACAAAAAGTCACCGTATAA 

TTCACCAGTCTTGAAGTAATCATTACTTAAAATTAAGCNCAAAGTNTATGTAATCCAAAT 

AMCTCCTGATTTTCAAAATGACAGCAATGTAGCTTGTACAGGGCA7TTGTAT 

Sequence 2630 

CCGCGGTGGCGGCCGAGGTACTTCCTCACAGTTCTCACATATGGAAAGGATACACACTTT 

GTAG AAAC AAG AACTTTATGTTATCCAAGTTCTAG GATAGCC ATGAG CTCC A A .TTATCTC 

AGAGCTCTGAGTCCTCTACTCAATACCCATTGAGATTTATGTGTTCTGAGGCTTTTGTCT 

TCTAGCTACTTACTTCATTCTCCATGGGTAACGGTCATTCATCCACATTAACTMTTTCC 

TCACTCCAAGCTCTTTTCTAGAGATAATCTCCAGTCCCTGTGCAGAAACTGTCATTGCAC 

T1TCTGCTGAAATGGCAGTTTCTTCTCAGCAAGGTGAGATTATGGAATCCAGMTCTTTT 

TTCAGGGGTCACATGCCCATTTCCCCACTTGCATGAATGTCGACACTGCAGCCACAGTTT 

TGGCCGTAAATGTGAATTTGGCAAGTAACCA 

Sequence 2631 

CCGCGGTGGCGGCCGAGGTAC I 1 I I I I 1 I I I I I I I I I I I I GTTGCTTAAAACAACAGCCA 

TTTTAT7TAATATCTCATGATTTTGTGGTCAGGAATGTGGACAGAGCTCAGCTGGGAGAT 

TGTTCCACTCCATGTAGCATCAGAAGACAGGTCTNTTGGTGATATTCAGCTGGAGAATAA 

GTTMTATGCAGAGTTCAAGATGGTTTTCACTCACATTGACACCTTTGGCAGGGATGGCT 

GGAGGGCTGGGCTCAGCCAGATCAGTTAACTGGCGTGCCTGCATATGCACTCTCCACCAT 

GGTGGGTCTCTAGAATTGACTCAAATTCTAAGAATTTGGCGATTGTATCCCGTCAGTCCA 

AGCAAGCTGGTTTACCGGGAACCACCCNAGNCTGCAAGGGCCAGGGCANCCGTGCCTGCA 

AGGCCGTANCATATTGTTTGCTG 

Sequence 2632 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTCAGTCTCAATTCTG 
GAACTCTAAGAAAAAACGTTCCAGACTTCTACGCAGCTGCCACCTCGGAGGACGGGAGAG 
CGGGGGACGAGGAAAGATCAGGATAAGACCCTAACTCCCACAAATGACTCCAGGAAAGGG 
AGACCACCACTCCCCGAGCTTTGGAGCGCCGCGCGGGACCCGCGTACCTGCCCG 
Sequence 2633 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 1 MINIMUM 
M M M M M M I GGGAGGCTGGAGTGCAGTGGCGCAATCTCTGCTAACTGTAACCTCTG 
CTTCCCAGGCTCAAAGGATCTTCCCACCTNTCCTTTGTTTAGCTGGGACTACAGTAGGGC 
ACCACCATGTCCAGCTAATTTTCG I MM! I ANTAAAGATGGGGGTTTTGCCATGTTCCT 
CAGACTGGTATCAAACTCCTGGCCTCAAATGATCTTCCTACTTAGGCTGGCCAAAGTGCT 
GGGATTACAGGCATGAGCCACCATGCCTGAACTTTAAAMGCATTTCTACTATATGTTTT 
TCTTFrTGATTACCAAAAMCTTNTATTAAAAATCTTTTAAAGTTCATTC 
GGMTTAGTCAACTTTAANACATGTTATTTAAAATTAGCCACTTGGGGAAACAA 
Sequence 2634 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAAACGGAGTCCCTGAAATAACAGAT 

GCAGCCACAGATCAGGGCCCTGCAGAAAGCCCACCCACTTCCCCTTCATCAGCCTCTCGG 

GGTATGCTGTCTGCCATCACCAATGTGGTTCAAAACACAGGTAAAAGTGTCTTAACTGGA 

GGCCTTGATGCGTTGGAATTCATCGGCAAGAAAACCATGAATGTCCTTGCAGAAAGTGAC 

CCGGGCTTTAAGCGGACCAAGACGCTCATGGAGAGAACTGTTTCCTTGTCTCAGATGTTA 

AGGGAAGCTAAGGAGAAGGAGAAGCAGAGACTGGCACAGCAGCTCACGATGGAGAGAACC 

GCGCACTACGGGATGCTGTTTGATGAATATCAAGGCTTGTCACACCTGGAAGCCCTGGAA 

ATTCTGTCCAATGAAAGCCGAAAGCAAGGT 
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Sequence 2635 

CGGCAGGTACTTi »M M M M i I > 11 1 I 1 t I 1 1 I 11 I 1 MI GN i U I II 1 1 1 1 T1GGGA 

NANACCANATTNTAAGGAANATCCCCTCGGGGGGAGCTCCATGGTGTGCATTTTGGCTTT 

CANGGTATAAAGGGGGAAACAATAGGGAAAGACTTGGNGCAAAAGGGAAGGAACANAAAC 

NGACAGATGANAAAAGGMGATGACANCATTGCAAGTGTTTNTTTAAATCCTNGCCANGC 

CAGGCTTTGTTTAGCTACCCTGGTCTNTTNCTGCATACCCCAACCCCCGTTTCCTGGGGA 

ATGGGGAGCAAAAAACAGGATCANAGCATGATCTGAAAAAAACACTGAACCTTCTCCCTG 

CTGGAACCCCAAACCCACCCCGGGT 

Sequence 2636 

CCGCGGTGGCGGCCGAGGTACGCGGGATATACAGAAATGAAATGACACTTCTGGGAGGCA 

GTAGAAGCAGGAAGTCAATGAATTGAGTAGAGGGTCCCATTCCCTCAGGCTGTCATTGAT 

CAGTGACAATTTATAAAAACAAACTGCAAAGTCTGTGGCAAGTGGCTGCCTGCTTCCTAN 

AAAGGAGCCCATGAAAGGTTAAACTCTGTGGTCGGTATTTGCAAGCGCCGGGCGTGGTGG 

CTCACGCCTGTAATCCTAGCACCTTGGGAGGCCAAGGCAGGCGGATCACCCGAGGTCAGG 

AGTTTGAGGATTTGCAAGCAAAAGGTCCTCTCCTGAGTCTTTCCCA 

Sequence 2637 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACAAATGCAGTAGC 

CMC AAGATTACCATGCAATCATTAAGGAGAACCAAAGTAAGAGAGCCACTCAAACCAGA 

TTTTGAACGCTACTAAAATTAAAGTAGTTCTTTGATGAATATGAATGAGTAGGGAAAGGA 

TTCTTTGTAATAGTGATACCTCTGGGGNTAANNAGAAGGGNGGTATGTGAGTTTTAGTCT 

ACAGATTATGGCAAATTCAGTGACAACAATCAAATGGTCTAAGATTGACAGTGGCACAGT 

TTTACTCTGTGMGGTAATGTTCAGGACAAATTTCAAGAAMCTAGAAAACCATT^ 

CAGCTGAAATCTTTCCCTAACCATTGTTATTTCCACTTTTAAGTCCTCAAGAGATGAGAA 

AAGGGAGGTAAGGCTTCCTTATACATTTCCTGCACAATGAAACATTTTTCCTCCTCCAGG 
CAAAGATTCAAG 

Sequence 2638 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTATG 

AGCGCCGGGACCTGTGACAGGGCTGGTAGCAGCGCAGAGGAAAGGCGGCTTTTAGCCAGG 

TATTTCAGTGTCTGTAGACAAGATGGAATCATCTCCATTTAATAGACGGCAATGGACCTC 

ACTATCATTGAGGGTAACAAGCCAAAAAAACTTTNTCTTGTCAACANGAACAAGTCATCG 

GCTATTGTGGAAATATTCTCCAAGTACCT 

Sequence 2639 

GG CGAATTGGANCTCCCCGCG GTGGCGGCCGAGGTACANAGTGCTTTTCTGTTTAGTTTT 
TACi I I I I I IGTTTTGTTTTTTTAAAGATGAAATAAAGACCCAGGGGGAGAATGGGTGTT 
GTATGGGGAGGCAAGTGTGGGGGGTCCTTCTCCACACCCACTTTGTCCATTTGCAAATAT 

A ^^GGAAAACNGCTCNTTTTAATTCTGATTGATCAGCCAAAACNGTCCCTGCCCGGGC 
GGCCGCTCTANAACTA 

Sequence 2640 

ATCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTG 

MGG TGAGAGAAAAGCTGCCAAGCTGCATTTTAGTCAGTTGGGCAACAGTGAAAATTTTC 

TTTTAGAGGTGGAAAAAAGCCTCTCTCTAGTGACCTGGGTAGTTTGCACGGTTTGGCTGG 

AAACCACAGTCCCCCCATCTCTGCCAGAACCCCCCATGTGGCCACTGTCCTCAGACAGCT 

CCTGGAGCTTGTGGATAAGCACTGGAATGGCTCCGGCTCCCTCCTCCTCAACAAGAAGTT 

TCTCGGAAAGTTTGAAGCAAAAACTGGTCAGAGTGCTGGAGGAAAACCTCATTTTGTCAG 

AAAAAATTCAACAGTTGGAGGAAGGTGCTGCCATCTCAATTGTGAGTGGGCAACAGTCAC 

ATACTTATGATGATCTTCTGCACAAAAACCAACAGCTGACCATGCANGTGGCTTGCC 

Sequence 2641 

CGAATTGGTTTTCCCCGNGGTGGCGGCCGCCCGGGCAGGTACCAAAGGTGCAATTTATGA 
CTCCAGAAGGANTTCAAAACACATCGGAATCATGACAGCAAGATAACCCCAAGACAGTCT 
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TTTATCAAGTTTTGACGGTTTATGCTGGGCACTGCCAGACTGAACCAGAGCTGGAACACA 

ATAATGTTCGAGCGGNCCGCCCGGCCAGGTACTN N N N NTTTTTNTTTNTTNTTTGGATAA 

TACTGTTTATTGTCCACGGACCGAGCGATTGCTCAAGAATGATNCAAAGCATCAGAGACA 

TGCNCAGTCTGCTTGTCMCTTTCAACAACTCTTGTGTGTTCCCAGCACCGGAAGGCCTC 

AGACTTCATATTCCAGCATAAACACAGTGCTCCCCTCCCCCAAAANTCCAAACGGAAACA 

AAATCAAC AG AGC TT 

Sequence 2642 

CTTACTTCACTATAGGGGGCCGAAATTGGGTAGCTCCACCGCGGGTGGGGCGGGCCGCCC 

GGGGCAGGGGTTACGCGGGGGGGGGGAA7TTGGGGTNGGGCCATTACCCCACTTCCAAGC 

AACAACCAAGGAAAGCCATTGMGGGTTTTCAATGGMCTTCTTCCTTCTTTCCC7TGGA 

ACANGGCTTCTTNGGGCANGGGAAGCCCCTNGGGTTCTTGGTNCCTTANTGGAATCCNAG 

AAANGGGCCCGNCTTTTTTGNCCCCCAAAGGGAANTCGGGGGGGGAAMCCCCTTTNGGC 

CNTTGAAAACCNNTTAMCCMNCCTTTAAAAAAAGNGGGGAAAAAATTTNCCN 

NAAAAAAANACCCCAAAGGGGTTTTTAACCCCCAAAACCACNNGGGNACCCCAAAAAGAC 

CCCCMGGGGAAAAGCCACNAMGAMTTCTCCAAGCCCCTTTTTTTGGGGGAAA^ 

AAGGGGCCCNTTATTAAACNGNGGAGGGCCAAAAAAAAAAAAMGGCCNTTGTTGGNGGG 

GGGGGGGGAACCTTTTNGGGGAAAAAAMCCTTTTTGGGGAAAA 

Sequence 2643 

NACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGAAGCCAG 

GGCAGTCTCCACAGCTCCTGATCTACTTGGCTTCTAATCGGGCCCCCGGGGTCCCTGACG 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnCTGCATGCAAACCTCTTACAAACTTGCGGGATTCGGCGGAG 

GGGACCAAGGTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCA 

TCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGMTAACTTCTAT 

CCCAGAGAGGCCAAAGTACCTGCCCG 

Sequence 2644 

AACACTTTTTAGGGCGAATTGGAGCTTACCGCGGTGGCGGCCGAGGTACGCGGGGACACC 
CTAGATCCCAAGATCTCCAAGGATTTGGTGGCATACCCACTCCAGCACACAGAAGCATGA 
GGTTC ATG ACTCTC CTCTT 
Sequence 2645 

AGGTACI I I I I I I I I M 1 M 1 1 1 1 1 I 1 1 I I I I GTTTTATAACCAGTGCTTTTGGTAACTA 

ACAGAATGAGAACTCACTTATTACAGTGAAGATGGCACTGAAACAATACATGAGGGATCT 

GCCTCCATGACCCAATCACTTCCCATTAGGCCTCACCTCCAAAATTGGGACTCAAATTAC 

MCATGAGGTTTGAGGGGACAGACATTCAAATTATAGTAATTACGAAGTAAAATGAGCAA 

CTGTTTTCTTTTGCTATCCAATCATGATGGATTGGATATGTGTGATGAAGGANAGGGAGG 

TATGAAGAATGACTTTNANATGTGTAACTTATAAAACTGGATACTATTCACTGAGAGAGG 

AAACACTGAAATNGACCAGAATTGAAGATGATGAGTTCC 

Sequence 2646 

CCGCGGTGGCGGCTCGCCCGGGCAGGTACATAGACCCAGCCCAAGGTTGGCCTTTGGGGT 

CTGATAGCATTCACAGAGCAGGAGA7TCACCCTTGGAAGCAATTGAATGTGTAAACTCTG 

AGAGCAGACACTGACAATGCAATGCCTGCATACCTGACTCCTGGGCTGCCTTCCTTCTCA 

GCAGAGCCCACTGAAAGTCTCACGTGTGCCTTTAACCAATGAGGATGGCAGGCCTTATCT 

GTGTTTGAAGTGGACAGGTCATTGCTACCCCTGTTGGAGAAGCTATGTAAGAAGGAGTAG 

GCTGGCTTCTGGTCTAGTTCGACTCATTCTTTATAGGGACATTTCANAATGATGAGTTGG 

CTAAAATTTACTTTAAAGGGGAMTGTAAMTTACCTCTTCTTTTCCTAT^ 

CTTTCCCATCCTGAGCCCTCTGGCAAGGCCTGTTGCAGCTCMGAAGTTTTTGGTCAGAT 

TACAAAGTTACCAAATGGTCAACAGGGAAGGACCCACAAGCAGATTCCAAAAACTCTTTC 

ATTCACTTGAAGTTAAACTGAAAATGGC 

Sequence 2647 
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CACCGCGGTGG CCGGC CGCCCGGGCAGGTACGCGGGGGCCTCACAGATGATGACTTTCTT 
CATCTTCTTGCTCTTTTTCCCATCCTTCACCGGGGTCTTGTGCACCCTGGCCATCACCAT 
CTGGAGATTGAAGCCTTCANCTGACTGTGGCCCTTTTCGAGGTCTGCCTCTCTTCATTCA 
CTCCATCTACAGCTGGATCGACACCCTAAGTACCTNGGCCCGCTCTAGAACTA 
Sequence 2648 

CGCCCGGGCAGGTACACATGGGGTTTCACCATGTTGGTCAGGATGGTCTCGAACTTCAGA 

CCTCAGGTGATCCGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACC 

NNACCCAGCCAGTTAATTTTTCTATTAACTACAGACCTANTTAANATTGAGGCAAAANAA 

ATGGGTCCTTGGGATTTGAAAATTACTATNCANTTTGGAAGTTNAATTTGCAACATANAT 

TGTCTGTTATTAAATTACTAGATATAATATCNCATAGGTGGAAAGAMGGTTGCTTAATT 

AAAGATCTAAGTTACTAGTCATGGTGTCAGATATNGANAATGATTGAANGTTATNAAGAN 

TCNCACACCAG ATGAGT AAATTGNTGTTTTCNNGAAGAAGTTACATNAANGTNACCGGAG 

TATATTTCAGCA I III! I GNTANATTAAAAMTTTGTNAAGCTATTTTCAATTTCAGGGA 

AATACTCTTAACNMNTTTACCGGGTAAMTGACCAAATTCCAGGTTTAATTTTCACATG 

TTMCACCCTCC TTGAG CCCCTTTATTTTTTTAAAGCTCTTNAATNTATT^ 

AATTMCCNTTTCTTTTAAGGAAGMTTTNAAAAA! 11111 I GAAAAAAAGTTGGAAATT 

CNTGGAANAGGCCCCTGGTTATNGTTTTTCAAATT 

Sequence 2649 

GGCGAATN GGAGCT CCACCCGCGGTGGCGGCCCGCCCGGGCAGGTAC 1 1 WWW I NTNT 
TTTGTTNTN I nil iGGAGGACTGGGAAATAGAGGACTGGNAATTTGAGCAAATTNTGAG 
GAATCTTAAATTGCTATTAGGACACCAATATGACGCATCAATNAATCATTATTT 
Sequence 2650 

GNAGCTCCACCGCGGTGGCGGCCCGAGGTACGCGGGGAGTCTTGGTTTTCTGCTAGTGCT 

GCTGCTGGGAGGACGACGGACGGCAGCGGCCAAGCAAGAAGAAAGACGTGGCAGCAAGCG 

GGAGTCGGGGATAGTGTCATCGGTTCGGTAAAGTTTCTGTTCTCTGATTTGGGGATATTC 

CCTGCCCAAAAAAAAAAAAAAAAAAAAAAANGTCCCTGCCCG 

Sequence 2651 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGT 
ATGATTGTATATGTGTTACTCCGATATGTAATCCATTTCACTGGCTGAGTTTGGCCCCTA 
GCCATGTGTTAATATAAAGTAGGCATGGCTTCCCAATGGAAATCTCTGAGAATGACAGTG 
GAGTTGTGCAAGCATTTTACATTGCCACATAAATCGAAT^ 

ACGGCACA TTTAG GCAGTTGAATATGACCGTTACCTTGCAGACTAAAAGGTTGAAGGCCC 
GAMCTAACTTTTAGCTAACAATAAGGGCTGTGCCCCAATC 

CTGAGAAAGGTTTGGATTGACNTGAAATATTTTCCTCTACAGGTCAAGGGACTTTGGCAT 
GTGGTGGCTGAAAACTGGAGC l Hill I GTGT 
Sequence 2652 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGTTT 
GCCACTTTGCAAAGGAGCTCACTGTGGTGTCTGTGTTCCAACCACTGAATCTGGACCCCA 
TCTGCGAATAAGCCATTCTGACTCATATCCCCTATTTAACAGGGTGTCTAGTGCTGTGAA 
AAAAAAMTGCTGAACATTGCATATAACTTATATTTGTAAGAAAATACTGTACCTGCCCG 

0 

Sequence 2653 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCGAAGATGGGCTTGT 

ATCTGGTTTCGGACGGACTGTTAATGACMTTTGATCGACGGGAATTGCACACCCCANAA 

TCCACCACAAAAGAAAAAGGTTACAAATTTAAC^ 

nrm icataatactactgaggggaattggtnagaatgtatnnatgtaaggcnttctta 

ATTT AAG TT ATT AAAGTTT AC ATTTTT AAT ATTTTTAAAC CTTTTGTAAATGCTTG G CTT 

AATTAGAAAATGTTTACAGAAAAGTAAAAAAATTCTAGTAATATGGGAAATCCTTGTAAG 

CAGCATGGTTTCAGAAAAATCTCAAGATGATTTATTTCACCAAATTGAGTNTTT^ 
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AACTAGGNAACTTCCCCAACCAAAAACACANGACNTTGAATAATATTTTTGTGTTATT 
Sequence 2654 

ATTGTTATTCC G NTTTAC AAATATN ACN ACATN NATATN N AGCCCGGG AGCC ATAAAGGN 

GTAAMGCCTTGGGGTGCNCTAATGAGTGGAGCTAACTCACATTAATTTGCGGTTGCGCT 

CACTTGCCCGCTTTCCAGTCGGGGAAACCTGATCTGTGCCAGCTTGCATTAAATnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnNGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnNTCGNTTGCGCTCGGTCGTTTCGGCTGCGGNGAGCGGTATCAG 

Sequence 2655 

CCGGGCAGGTACTCTN ITTTTTTTl NTTTTNTTTAGGCAGNTNAAATCTAGGATGGTGTT 

AACCTTGTCTTCATTTNGCCAATNATATNTNAAAAATNANAAACNACATTGAATCTGCAT 

TTCTTGGAN NAGANNGTN N ATMCAMCTATTTCTCAATATTNNGATACTANAAACTAGN 

GAAGGNGNCATGGGNTCCAGCNTCTTATCTNTNATNTGAACATGGGTTTCTGAANGGAGC 

CTATATNATAATATAAATGGTNTGNNNAAAATGAGGCACTAGCNTTGNCTGNNACTGCTA 

TAAAAAAATNACCA 

Sequence 2656 

CCGGGCAGGTAC U II I I I I M I 1 II I I I I I I I I AGGCAGCTAAAATCTAGGATGGTGTT 
MGTTTCTTCATTTTGTCAATW^ 

TTGGATGAGATAGTTAATAACAMCTATTTCTCAATATTTGTATACTAAAAACTAGTGAA 
GGTGTTATAGAGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTTCTGAAAGGAGCCt 
ATATAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCTTGGCTGGGACTGCTATAA 
AAAAATTACCATAGACCATTGACTAAACCACAAACATATACTTCTCACAGTTCTGGAAGT 

0 

Sequence 2657 

CGCAAGGAAAGAACATGTGAGCAAAAAGGCCAAGCANAAGGCCANGAACCCGTNAAAAAG 

GCCCCGTTGCTTGGCGTTTTTCCATAAGGCTTCCGGCCCCCCTGGACCGAGCATTCACAA 

AAAAATCGACCGCTCAAAGTCAAGAAGGNNGGCGAAAACCCGGACAGGGACTATAAAAGA 

TACCAGGCCGTTTTCCCCCTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGACCCTGGC 

CGCTTACCCGGATACCTGTCCCGCCTTTCTCCCTTCNGGAAAGCCGTNGGCCGCTTTCTT 

ATAAGCTCAACGCTTGTAGGTATCTTCAAATTCGGTGGTAAGGTCGNTTCGCTCCCAAGC 

TGGGCCTGTGTGCACCGAACCCCCC 

Sequence 2658 

CCGCGGTGGCGTCTACCTCATTGTCAGCCTCCTGCTCCCCATTTTCCTCATTAGCATTCC 

CGTTAGCAGGGGCGTCTCTTCCATTTTCTGCCTCTTCCACAACTTCCTTCTTCTCCTTTA 

AGTCCTTGGTGGTGATTTCGGAGCTGGTGTCTACGGCTGCGTCTGACCCCGCGTACCTCG 

GCCGCTCTAAGAACTAAGTGGATCCCCCCGGGCTGCAGGAAATTCGATATCAAAGCTTAT 

CGATACCGTCGACCCTCGAG 

Sequence 2659 

CCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGAGAGTTAGTTGATTTCAGTAGTTC 

AACAATGTCAGTATAAACCCATATTTTGCTTAGCCATATATTGCTCAATTCTGCCATCCT 

TGGGACTGGCTTCATCCTCAGGCTAGTAGCAAAATGGTTGCGGTGATTCCAGGCATCATA 

TCATATCCATAAATAACAACTTCCAAAAGAAGAGGAAAAGTATCTCTTCCTGTGCCACTG 

TTTTTGGATAGAGGAAACTTTCTACAAAAACCTCTAGCAAACTTCTTTCCCATCT^ 

GCCAGAATTGGGTCATAGGCTAGTTCCTGAACCAGTCACCANTAAATAACCCTTAGGTTA 

ATCACGTTTCTCAATCTGGAATGGTGTTGGCTTCTACTGAGGTATATGGTTGTATTGGGG 

AGAGATTCTGTTAAAGAGGAAGAGGGAGAGAATTGATGCTAGGAAAGTAATTTGGAAAAC 

CATMCTCTCTTCAGTGGTATTGGNTTTNTCAATAAACAGTNTTAGCAAGTTA T I I FTTA 

CTTTTTCAAAA 

Sequence 2660 

TNGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGA I ITTTTTTTTT C TTTTTTT C TTTTn 
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TTTTTTGAGAGGGACTCTCGCCCTGTCACCCAGGCTGGAGTGCAATGGCACGATATTGGC 

TCACTGCAACCTCTGCCTCCTGGGTTCAAGTGATTTTCCTGCCTCAGCCTCCTGAGTAAC 

AAGGANTAAC C AGGCATG NGCC ACCACNCCC AGCTAATTTTTTTGTATTTTTAGTAG AGG 

CAGGATTTCACCATGTTGTTCAGGTTGGTCTCAAGCTCTTGACCTCATGATCCATCTGCC 

TCAG CCTCCCAAAGTGCTCGGATTACAGGCATGAGGCACTGCGCCCAGCTGACTGATTTC 

TTTTGGCTAAACACTTTTTAAGTAGATGATCTATTTTTTCCCCCGTTTATGTATC 

ATCACCAGGAAAATTGTAACTATACTCAATAATATGGGCTTTACAATTGGACAGACACTT 

TATTTACG 

Sequence 2661 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC ITTTTTTITTTTTTTTTTTI 

TTTTTT71 I ACTCTCCTGTTGCTTTTCTTCTTCTTTCCCTTTCAGGCATCTCCTCGTTTT 

ATTTTCTGTAACTCACANANTGCTGCAAAATAATGTTAGGGCTTTGCAGTTTATTCCTTG 

CNATNTAACTGCNTNAATNTAAAAAAGGGNCCTNACCATTANNNCNCGTTGGGGTTATTC 

AAATAACCCAAGGGAATGTAAAGCGATTTCATATCTCCTTGAATCAGGAACATTAATTAA 

TCATATTGATGAGAAATGCMCCGTTTTTATTTACCAGGGCAAGAGGCTGCCAANATCTT 

TCTGAGTTTAANAAATTGCANACTCTGNGGGAGGGGAGGGAAAAATTNATTTAGTTNTTA 

AATTGGAGGGGG 

Sequence 2662 . 

GCCGAGGCACCI I I i I I I i M I i I I I I I I M I I NGGGGGGATGGGTATNTNCCATGTNAT 

ACCAACAGNTCATNAACTCCTGGGCATAANCCAGCCTCCCACCAACATCATTAACTTAAT 

CANAGCACCAAGTGTCACATATGGTAACACANTCACAACTTCCCANGAATTNAAATTTTT 

GGANNAGGTNGTACATTTTTTTNTGGGNCCTTTTNTTTAATTCNT^ 

NAGGATCTTGGACCCAACAAAANAAAAAAATTGGGCAGCCCAACACTTNTTTTTNCTTCC 

CTTTCCTTTTCCTTTTAA 

Sequence 2663 

CCGCGGTGGCGGCCCGAGGGTACAACAACAAAGTCTGCCCACCTTCTTGTGTGCACCTCT 

GTGATCAGAAGTAGCAATCAGTGATCGGATGCAGATCTCTAATATTTGGAAGACAACATC 

CTTTTTGCCCACCCTGGAGCCCATAGGCTGCATGCAAGTTTCTCCAAGAACTGGGGGATG 

GGTAGCTGCTACCGTTTTAAGAGCTAAMTTAAAATTAATTGTGCTTTTACCTTATACCT 

TCCCCTGGAATTTGCAAGCTTACAGTAGACTCCAGAGTTGCAAAAATAGTTACATTAGGC 

AGATTTTGCCAGTATAATTGTTGTCTAGGAGCACGGCCAGACTCCTGGTGCTTTCTACCC 

TTCCATCTTCCCCMCCGCTAGTTATTGAMCTTACTTTAATACAGGAATGTAAGCAGTT 

TACTTTGTGTTTTTTCATTTTCATCACTTATGAATAACTCAGCTAGTGAAAMCTTCT^ 

GAGTAMTCATTTTTCTTTCAA^ 

TTTCAGAGAAAGTCTGCGGCCAGTTTGATTTTTATTTCTTTTCAGATT 
AGATCTGGGATGCTTTAAATCTTTTTCCTTTTAATCAAATAACCCATGAAMTATCTTGG 
GTGTGCCCGCGTACCTGNCCNGGGCGGGCCGNTCT 
Sequence 2664 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCCGGNCNGGTACGCGGGGGGAGA 

TGCTGNCACCTANGTNACTTGNAGGACCCTATACNGCAACCTCCTTTGCCAGGAACTATT 

TATAAACATCCTGCANGAAAATGANTCAAGGAAGCTTTTCTTTTGAGCTATTTACAGC^ 

NTAGCAATTGAGTAAAGTATACTCCTGTGAACAAAATNNTGGAACATATNNGTNNNTCTC 

TAACTGATTNCTCCANAATTTGGAACTAGTTCAGNGAANTNNAAGAGACAGGGATATCCC 

ACNAGGTTATGCAAAACATCAAGAGAAGATGANAGGTCAGANATGGGAAGAAACAAGAAC 

TTTGACATGCTTGGTGTNCTTGCCCAAGCTTTGAAGAAGTTTACAAAGTCTATATGTCAG 

AATACACATTTCCC 

Sequence 2665 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAATGTGT 
GGCAGTCCAAAATTACGAGGAAAATGAGTTCCCTTCATGGGTCACATCAGCAATTTTTTT 
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TTCCCCTTTTGAGACAGAGTCTTGCTCTGCTGCCCAGGTTGGAGTGCAGTGAGACCCCGT 

CC CAAA AAAAAAAAAAAAATGCAAGTCTGGAAAAAATATAGCTNGGGAGTCAGTAATGCC 

AGTTTTAGCTGAGGGAGTTAAGATCATTTAATCAGTGTTAAGGAAGAAACTCAATAGTAG 

TGACCACCAATACTTAAGGAAAAAGGATAAAGTACCTN 

Sequence 2666 

CCGCGGT GGCG GCCGCCCGGGCAGGTACTTGAAGTGTGACTAATGTGACCAAGGGAACTG 
AATTTTAAATTTTATTTACTTTAAATTAATTTAAACTGAAGATA 

GGCTACCACATTGGATACAGCCTTAGATTCCTCCCAGACCAGACAACCTTCTGTGGCCTT 

CCAANGTTGNAACAGCAAACTCACACT.GTGGGCAGCCAGGCCTGGAGTGGGGCCCAGCAA 

GCCGCATCCTGGGGTCAGGTCTGGCCAACTAATTATCTACTGTCTATGGTCACATTAGTG 

CTACAATGGCAGCATCGAGCAGTTGCAACAGAGATGGTGTGGCCTGCAAAACTGAAAATA 

TT CATA TTTACTATCTGGCCCTTTACAGAAAAACATGCTGACCCCTGCACTACAGCAATT 

AGTTTTGGAGAGAACATTCCTGATAAAAGCTTACATCTTCTCTCA 

Sequence 2667 

CGACTCACTATAGGGGC GAATT GGGAGCTCCCCGCGGTGGCCCGGCCCGCCCGGGCAGGT 

ACAATCTCTGGCCCTACATTTTCTAAATGTTATGCCACCCCGACCAAGGGGCAACTCCTA 

CAAAGCCAGGCAAAATAATAAAATCATATTTGTCTCTAGTGGAATGGATAACTATGCCTA 

AMCTGTGCCCTTTGAAAAGCAACTAGAGAGATAATTTCTGAAGTGTTTGTCCCTAGCTG 

AATGTGTGGCAAAATTCTAAACTCCCTGAAGTGTGAAAGTGGTTTCCAAGCCACATGCAC 

ATCCAGTAGTGGTAAAGGGTGAAAATCTAACTGGCTAAGAGGGCTTCATAGCAACATTAA 

CCAAAAAGTGGTTTATGTAGTCTTTGCCTGCTTCATAATTCCCTANGCATTCTATGCTAT 
TCTGCACCT 

Sequence 2668 

CGCCCGGGCAGGTACATCCAAATTGGGCTACGAAAAGTGGCAAGTTTGATTGTAGCATCA 

TGTCTTTGAGTGTCCTATTGGACTACAGATTAGAGGATAATAAAGAACATTCATTTGAGG 

TTTCATTGTTTGCGGAACTTTTCAACGAAATGC 

ACAMTCATTACTGTCTCTTCCTGAGAAAGAGGACAAAAAAGAAAAGGATAAAAAAAGCA 

AAAAAGATGAGAGAAANGATNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAM 

GTTAAGAAGTTGAGCTTTTATCTTAGAGGCAGCAGAAGGTTTGGAGCCMGGAATGAAAT 

GATGAGGCGTCCTTCAGGTAATGAACTTCAGCTGCAGTGTGAAAGGGGCAGGAAGACTGG 

CAGCTGTCAAAACTGGAACAGTCCAGTGAGTATGTNCAGGCCCGGGCTTGGGCAGTGACC 

ANGGCAGGGAGCACCATCAATTTCTGCGGTAGAACCTTCTAGAGGAAGACCANACAGAGG 

GTCAACAAGAGTTGAAAGGAGAAAAATATTGAGGATGANGTCCAAGTTTTTAGTTTCAAA 

ANACTAGGCATATGGGTAATNCCGCGTCCTTGGCCGTTNTANAACTAGGGATCCNCGGGC 

CTGAGGAATTCGATATCAAGCTTATTCGATCCCGCCANCCTCAGGGGGG 

Sequence 2669 

TCACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTGAAAGTGACG 

GGGAGAATGGAACCAAGTTGGAAAACACTTTGCAGGATATTATCCAGGAGAACTTCCCGA 

ATCTAGCAAGGCAGGCCAACATTCAAATTCAGGAAATACAGAGAACGCTGCAAAGATACT 

CCTTGAGAAGAGCAACTCCAAGACACATAATTGTCAGATTCACCAAAGTTGAAATGAGGA 

AAAAATGTCAAGGGCAGCCAGAGAGAAAGGTCGGGTTACCCACAAAGGGAAGCCCATCAG 

ACTAACAGCTGATGTCTCGGCAGAAACGCTACAAGCCAGAAGAGAGTGGGGGCCAATATT 

CMCATTCTTAAAGAAAAGMTTTTCAACCCAG 

CATAAGTGAAGGAGAAATAAAATACTTTACAGACAAGCAAATGCTGAGAGATTTTGTCAA 

CAACATTTGAATATTATAACCTGCAGGGGTCTGTCCTGCAGATGGTTGAGACCATAACCA 

GATACTGATATTCAGTGAAAGAGCAGCAAGGATTTCCGAGCTGA7TACCAGACCCCAANG 

GAAGATGCTTGTAAAGAGCCCNCCAGCCACAGNCCTGGCTTGCTNGCCCTGGGGGCATTT 
ATTTAGTAA 

Sequence 2670 
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AGGGGCGAAT TGGGAGCCTCCACNCGCGGTGGGCGGCCCGAGGTAC I rTTTTTTTTTTn 
liltl I NTTTTGGGCCTAATTGCTTTGGGCTGAAGACAGAGTGAGGCGATGTACTGTAAG 
GCGAACACCTTNTGCACAGCAAACCCGTNATTCGCTCACCTGTNTGNGGTNAGGCTCTTG 
TAGAGAGTGANCACTCACTANTAGTTTTTAATAATAAACCANGTAACCAAAGTGAGATGA 
AGGAACTA I I I I IGGA 
Sequence 2671 

CCGCGGTGGCGGCCGAGGTACAAGGCATTTAGAGAAAGCTTTTTGATTCCATTCAATCCT 

GGAMCCMCATGTTGMGGGCCAGAT7TTCCACCATTTTATAGGGATGCAAACTTTAGC 

TCAGAGAGGTTAAATGACTTGCCAACAGTCACACAACTAGCAC7TAGAGGAACGGGAAGG 

CTAACTCGGAGCTGATGCTGCAGCTTACACTCACTCCCCACTCCCACGCAACACAAACTC 

TTTCTAGGCAGCTTCGAAGGTACCTGCCCGGGCGGCCGCTCGA 

Sequence 2672 

AGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAAGAATAGGGTGAAAAACCTCAG 

AAGGGAAAACTCCAMCCAGTTGGGAGACTTGTGCAAAGGACTTTGCAGATTAAAAAAAA 

AAAAAAAAAAGCCTTTCTTTCTCACAGGCATAAGA 

AC I I I I 1 ACC AACGGTC AGTTTTTACATTTTATAGCTGCGTGCGAAAG GCTTCCAGATGG 

GAGACCCATCTCTCTTGTGCTCCAGACTTCATCACAGGCTGCTTT7TATCAAAAAGGGGG 

AAMCTCATGCCTTTCCTTTTTAAAAMTGCTTTTTTGTATTTGTCCATACGTCA^ 

CATCTGAGCTTTATAAGCGCCCGGGAGGAACAATGAGCTTGGTGGACACATTTCATTGCA 

GTGTTGCTCCATTCCTAGCTTGGGAAGCTTCCGCTTAGAGGTCCTGGCGCCTCGGCACAG 

CTGCACGGGCTCTCCTGGGCTTATGGCCGGTCACAGCCTCAGTGGGACTCCCACAGNGGC 

CCCTGTANCCCGGGGCAAGCAGGAACAGGGTCTTTCTGCATCTGTTTTCTGGAGGAACTC 

AAGGTTTGGGTTGCCCAGAAAAAAATGTGCTTNATTCCCCCCTNGGTT 

Sequence 2673 

CTACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTTT 

n n m n n 1 1 n 1 1 1 1 iaggncgaatgcagggttttattgagtggtggaggtggttct 

CTGAAGGATGGATGCAGAGCTGGAAGCGGGGATGGCACGGGAAGATAATCTTCCTGCTGG 

GTGGCCAAACCTNTCCGACCACCCTCGGCTGAACTCCTCTCGGCGTTCAAATGTTCCTCT 

TCTCTCTTTCTCTGCTGTATCGTTTCACCATTCATCTGCTTGTCTGGGATGGTGAANACA 

ACTGTTTGCTCATTTGCTTGGCGATAGGTGATTGTCTCGCCGCTCAGCTATAGGCGATGG 

TCTCACCACTCGGTGATAGGCGATTGTCCCATCTAGGTCGCCAAAATGTGTCTGGAATTG 

GTGGGTTCTTCGTCTCACTGACTTCAANA 

Sequence 2674 

CCGCGGTGGCGGCCGAGGTACAGGAGACTTTCTGATTTCCAATCTTGGCTCAGGTCAGAA 

GAAAAAGGGGAAAGGTTACATTCCTGGAAAGAAAATACAGCCTATTTGAGGGCATGCCTG 

ACTTTCAGTCCATGCACTAGTCCTCTTGGCTCGTAGAGTTTTCAGTGCCCTGAAACCATG 

ATCCCTCTGCCTCCATGGTCCTCTGAGTGTAGTTATTATGTCTGCAAACCAGTAATGTCA 

CAAGTCAAAGGGTCTGGGGCTTCCCAGGGAACCACTATGACTGAGTATATTCTGATTTGA 

GAAACCTGTGACAAGTCTTCTCAGCACTCGCCTTCTAATTTTTGGAGCCGATGCCTGGGT 

TCCCAGCATCCAGGTCACAGTAGCTGGAAATCATGCAGGAAGGGCAAAAAGCCAGCAGCC 

CAGAATGGACTGTCTCTGCAGCTCTGGCACATTCTCTACACATCGCCACACCACTCAAAT 

GTCCTATAAAATATCAAAACCAATAACCTGGACATGACCCGATTAATCTGTACCTGCCCG 

GGCCGGCCCGCTCTAGAACTAGTGGGATCCCCCGGGGCTGCAGGAAATTCGATAT 

Sequence 2675 

CCGCGGTGGCGGCCGAGGTACTTCAGTTTCTCTAGATTACCATGTAAGACAGCTCTGTGG 

ATCCTCTTCAGATGATACGGTTTAATGGGGTATTGGGGAAATGCGAAGCCATCCGAGCAC 

AAGCGCTCCATGAGGGTGGGCCACCTCTCCCGCTTGCCGTCTTCCATAATCGTCGGCTGC 

AAATTGTAGCCTGCAGCCGTATTTCAGCTCGCCTTCGGGGATCGCCGCCTCCGAAGAGCA 

ACAACGAGCAAAGCAGTCTGTCCACGGACCTCCGCACAGACTCTCAGCGCCTCCCGCCTC 
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TCAGCAGAAACGCCCAACAGAAGGGTTAGAACCAGCGAGCACGCGCACCTTAGCCGGCCC 

TCCCCGCGTACCTGCCCGGGCGG 

Sequence 2676 

CCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTNACTCGGTCCNNNTACGGCAACC 

TCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGAGTCTTATATGTCAAAAT 

ACACATTTCCCACCTTTGCCCANCNGTNNNAAAACCNTAGGAAGGANGAAAAANCATTTA 

AAAAATGACACNGGAATGTTAATGGAAGCAATGTGAtGGTCGTTTTGGAGGTGGAACCCT 

TTCMN AAAGGTAATTAAATGCCCTTGGTTAAGAAGAAGGCCAAAAGAAGC7TGCGCACC 

I I l I I ICCTGCCATGTGAGGAAGCCAAANAAGCCGGCTGTCTGCAACCTGCAAGAGGACC 

CTCACTAGAAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAA 

Sequence 2677 , 

CCGCGGTGGCGGCCGAAGGTAATCGGGGGCCAGTCTTTATACTGCTGACAGTAATAAATT 

CCAAAATCTTCAGACTGCAGTCCACTGACGACGAGAGTGAAGTCTGTCCCAGACCCAGAG 

CCAATGAATCTGGCTGGGACACCAGTGGCCCTGGTGGATGCAGCAAAGACGAGGAGCCTG 

GGAGCCTGGCCAGACTTCTGGTGGTACCTGCCCG 

Sequence 2678 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTTTTT 

M 1 I I I 1 1 1 M M M I M 1 1AI t 1 I 1 1 11 11 I 1 H M I 11 I I I M I CTAAGNCATTACTT 

TTTATTTTGAAGGATTTGNGAAACTNTTCACATNATGGNGANAGTTTGTNTGATTAANAA 

AAANCAGNTTTTTCATNAAATGCTTGGAGGNGAACNAGTTNTCACCCTGNGANANCCNAC 

CNTCCCATTAACTTNGAAGTTTNTNTTGATTAATANAAAAAAAAAGGGGAGGGNGAAAAA 

AAGGNGGAACATGCTAAAAACCTTNTGACAATCNTNCAAATGTCCCGCGTACCT 

Sequence 2679 

AGGTACTTAACTAGGGTTAGCCGAGGTCAGGAGGGTGGATGCCTTATTATGGGATTAGTG 

CCCTTCTAAGAGGGACAGTAGAGGGCTTGTTCATGCTCTCTCTCTCCATGCACAAGAAAG 

AGCACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCATGCTGGCACCGTGATT 

GCAGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATATCTGTGGTTTAAGTCACCAAGT 

CTGTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATGATGCTTTTCACCACTGGGTGG 

TCCAGCCTTTCATGGAAGCTGTCAGCTCCTCAGCCTTGCAAAAAATGCTTGCTGGTCACT 

CTCTTTGGTCCTTTCAGATGCCTTTCTTAGTGGTGCCTCTA 

Sequence 2680 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCATACAGTTTTTCTAAT 
TTTCTAATTAAATGTAAGTATTGTGAATATTTTC^ 

AGTTTGATAGTGGTATAAACTGTTTATATTTTGTATTCTATCAGGAAAGTATTACAGTAA 

TGGAATAATTATMTTTCATTGAGTGCGTCTCCTCTTTTTMTATGMTTTTGAGCAAGT 

TTTTAGATTCATTTATATGMGTGTCCATTTTTGTAGGTAAAAGTAGTTGAAAA^ 

TGTATGGTTCAACCTAATAACAGAAAATATCTAGCCTTAATTAGAAAAATAAGTCATAAA 

GACAACTTT GTTT CAAATAAGGGGAAATACGATAGCAGTCATGAAAAACTGAAGGATACA 

AAGAGAATATTTTAATTGGTAGCCATACAMGTGGAGATATTTTAGAATCTATTTCATT 

TCTAATATATTTGAGTCATTTTATTTTGGGTAAAATAAATTATAGGTGGAGCATGGTGGC 

TCACGCTGGTTATCCCAAAGCGCTGGGATTATAGGCCGTGAGCCCGCCACTCCTGGCCGA 

AATCTGTTTTAAGTATAGAACATCATGAAATTGGAT 

Sequence 2681 

CCCGGCCGGGCAGGTACCATGGTTGGMTGATAAANGATATTGTCATTTTTGTTAGCANT 
GAAGTGAGTGGCATCTATGTATTTTTTAAGGTATATAATGAAATTGTGCCTAGGGGAGTN 
ATAATNCACTCTATGTA 
Sequence 2682 

CGCGGGATGGAAAGAAATGAAGACANTTTTAGACNTGCTAGACTNATGGTTGACTATACA 
N CAAC C ATCTCAG AAAGAGTTATTC AGATATAGCTTC AN ACTGATAN NTAAATC ATATAA 
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ANTAATGTGGTANTCAAAATANGAGTTANGTAACTACTGACANATATAAGGAAAGTCGTA 

CCANTTCNGAACTAAAAACAATGGTCTATGTTGCTGGANGAACAATGTGGGGAGGGT 
Sequence 2683 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAATG 

AMTTGGAGCTGGACCATTTAGTCAGTTCATTAAAGCAAAAACTCGGCCATTACCACCCT 

TGCCTCCTAGGCTAGAATGTGCTGCTGCTGGTCCTCAGAGCCTGAAGCTAAAATGGGGAG 

ACAGTAACTCCAAGACACATGCTGCTGAGGACATTGTGTACCT 

Sequence 2684 

CCGCGGTGGCGGCCGAGGTAC I I I I I I 1 I I I I I i I II i k I I I I I I I NCTCCACATCTTCA 

CATTCTTTCTGAATAACAAGCCTAAGTCCCCATGACCTGATCACCTGTCAGGTGATCACC 

TTGGCCATCACCTTGGCCTTTATTCACCTAGTGATGCTTCTCATAGTAATGGATTTGTTG 

TCTCCANATGTATTTGAGTCACTGCATTTTTGGAATGGACTTTAAGATGTAAAGGNGTAA 

TTCTACCTGAGCAGAAGTGACAAGGGGCCTTTCCATNTGCACCCCCAGCC 

Sequence 2685 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I M I J 1 I I I I M I I I I 1 I G AGACAGGG 

TCTTGCTTTGCTGCCCAGGCTAGAGTGCAGTGTCGCAATCTCAGCTCACCGCAACCTNTG 

CCTCCCAGACTCAAACC ATCCTA CCCTTAATCTTCCAAGTAGCTGGGACTACGGGTGTGT 

GCCATCATGTCCAGTTAATTTTTAAATTTTTTGTANAGAACAAGGTCTNACTATGT^ 

CAGGCTGGGTCTTAAACTNCTGGGCTCAAGCAATNTGCCTGCCTCCTTCTCCCAAAATGC 

TTGGGA TTAGAGGCATGAGCC ACAATACCTGGCCACTATGCAATTTGTATGTATTTTTAA 

ACGATAA I rTTTTTTTTTTTn I AAACAGAGTTTTGNTTTNTCNCCGAGGCCANAAGNGC 

AGNGGGCACAATCTCAGNTANACTGCAACTTCAAGCCTCCCAGGT 

Sequence 2686 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGGAGCTACAACAGGC 

AGGCAGGGGCAGCAAGATGGTGTTGCAGACCCAGGTCTTCATTTCTCTGTTGCTCTGGAT 

CTCTGGTGCCTACGGGGACATTGTGATGACCCAGTCTCCAGACTCCCTGGCTGTGTCTCT 

GGGCGAGAGGGCCACCATCAACTGCMGTCCAGCCAGAGTGTT7TTTCCAAGTTCCCAAC 

TATAAGAACTTCTTTAGCTTGGTACCTGCCCGGGCGGC 

Sequence 2687 

CCGCGGTGGCGGCCGAGGTACTCGCGGGCCTTGGAGGGGGCCACCGACAGAGCTGACATC 
ACCGTCATGATGCCTGCATTCCAGGAGTCACGGATGGGAAAGTAAATCTTTGGAGGGGCT 
GGGAGCTGGGGTATCTGCAGGCCTGTCTGCTGACACACGTACTAGAATGTTGGCTATAAT 
AGTTCTGTTCTTACAACACATGAAATTTmCGTTTTATTTT^ 

ATGTTCATTTCTACTCACAMCATGTTCTTGGTGTATTTCTTATGCAAACAATCTTCAGG 
CAGCAAAGATGTCCGTTACATCTAMCTTGAATAATAAAGTTTTACCACCAGTTACACAT 
NNAAAAAAAAAAAAAAAANGTACCTGCCCG 
Sequence 2688 

AGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCC 

AGATTTCAACTGCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCANCCACAGTTCG 

TGTGATCTCCAGCCTGGTCCCCTGGCCAAAAGTCCGAGGGATACTGCTACTCTGTTGACA 

GTAGTAAGTTTNCAAAATCTTTCAGGTTGCAGAACTGCTTGATGGNGAAGAAGTGAAATC 

TGTCCCAGATNCACTGCCACTTAAACCTTTGATGGGGACCCNACTTTTGCAAACTGGGAT 

TGCANCNTAAGAATGAGGGAGTTTAGGGGGGCCTTTCCCTGGTTTCTTGCTGATNCCAAT 

TTTAAATAGATATTAATGGACTTGACTTGCCCC 

Sequence 2689 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTGTATCNGG 
ACTTATGGTGGCCACCAAATATGAAGTGAGTGTCTATGCTCTTAAGGACACTTNGACAAG 
CACACCAGCTNATGGAGNTGNCNCCANTCTGGNNAANGCNNGNCCACCAAGAAGGGCTNG 
TGTGACAGATGCTACTGNNACCACCATCACCATTAGNTGGAGAACCAACACTGAGACCAT 
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